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It is appropriate with the launching of a 
new journal to hail the dawn of a new day 
| and to sound a call for revolutionary de- 
partures from the traditions of the past. For 
better or for worse we are constrained in the 
formulation of new policy in that the forces 
of organizational continuity must be reckoned 
with. The first year and a half of our journal 
is already committed to articles in person- 
ality and’social psychology accepted under the 
JASP aegis.* Our editorial board with a few 
exceptions is much the same. Moreover, an 
APA journal is representative of the re- 
search of APA membersyin its area of interest 
rather than innovative with respect to the 
aspirations of its editors. 
- Nongtheless, we cannot begin the new 
journal without a plea for a type of research 
which we have lacked in the past. Now that 
the field of social psychology and its sister 
discipline of personality have a journal all 
their own, we should take advantage of the 
fact by encroaching less on the domain of 
the Journal of Experimental Psychology and 
dealing more adequately with variables ap- 
propriate to our own subject matter. We will 
welcome papers which deal with a psycholog- 
ical analysis of social systems provided they 
meet our usual criteria of making theoretical 
or research contributions. Social psychology 
in the past has centered about three focal 
points: (a) the individual with his socialized 
motivations, his socially bound cognitions, 
_ attitudes, and perceptions; (b) face-to-face 
interaction and small group process; and (c) 
. applied social problems. Though the major 
part of the individual’s life is contained within 
organizations and large social structures, 
psychologists have characteristically operated 
as if the early family and the small group 


1In spite of the increased inflow of manuscripts 
to JASP the page allotment for 1964 was increased 
by only 150 pages. (No increase at all was granted 
for 1965.) This means that a few papers which more 
properly belong in the Journal of Abnormal Psy- 
„chology will appear in JPSP during the next few 
` "months since they were accepted by the present 
editor in 1963, 
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association of adult life were the only sig- 
nificant aspects of social behavior. With.a 
modesty, rather unbecoming in our expanding 
profession, we have left this area of study 
to sociology and the other social sciences. 
Now it is not a matter of infringing in any 
way on the rightful claim of the other social 
sciences to their investigations of society. 
We feel, however, that our own studies can 
be enriched and, in turn, we can contribute 
to the behavioral sciences by joining forces 
with them in areas of common interests. The 
facts are that social structures and social 
systems are made up of the complex, stable 
patterning of human behavior. These sys- 
tems can be viewed as the product of a long 
historical development and can be studied 
without reference to the specific human 
beings who happen at one time to be, the 


carriers of a given pattern. This is the posi- . 


tion of the culfurologist and leads to certain 
types of fruitful study. On the other hand, 
when interest shifts to the human carriers of 
these patterns as they affect and are affected 
by the social patterns they embody, we are in 
the field of social. psychology. Social systems 
are not closed systems. The inputs which are 
needed for their constant maintenance are 
human inputs and such inputs can be dis- 


regarded as a constant factor only in periods - 


of exceptional social stability. 

It is curious that we have recognized the 
relationship between our discipline and the 
other social sciences mostly at the extreme 
boundaries; namely, the relation of person- 
ality to culture. The closer relationships 
between the social psychology of the small 
group and the larger group structure of which 
it is a part have been overlooked. 

Some illustrative problems may clarify this 
neglected area of research. We occasionally 
study role behavior but omit the role system 
without which it has little meaning. We deal 
with power as if it were solely an inter- 
personal affair, namely, the ability of Individ- 
ual A to influence Individual B, but we neglect 
the exercise of power as it is mediated by so- 
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cial structure. We theorize about leadership 
but fail to specify the social context which dë- 
termines and defines the relationship. We take 
the decision-making process into the labora- 
tory but never return to the decision-making 
centers in the real world. We study the 
personality determinants of ethnocentrism 
but bypass the means by which individuals 
are bound into the national state and the 
conditions for the arousal of national roles. 
There have been promising beginnings in 
the investigation of normative commitment 
and value involvement as the integrative 
mechanisms of groups and organizations; for 
example, the experiments on role readiness 
when legitimate symbols are invoked or the 
work on the norms of reciprocity and depend- 
ence, But these are only beginnings. Though 
we live in a period of dramatic social change, 
we have seen only fragmentary studies of the 
evolutionary and revolutionary processes in 
the modern world. 

By encouraging papers in this area we 
are not advocating a transformation of the 
journal since the more traditional lines of 
investigation will still dominate our pages. 
We are urging, however, more attention to 


» 


a type of study which holds great promise fe 
the social psychology of the future. 

The critical questions remain of how such 
studies can be conducted with objectivity 
how resources can be found to support 
The second question may find an answer 
in the needs of our society and the gro 
recognition that we cannot afford a soci 
science unless it is based upon facts, findi 
and tested generalizations. The first quesi 
is dependent upon an adequate methodol 
and the willingness to use it. We too oft 
vary between a strict adherence to an i 
propriate technique because’ of its rigor 
a complete forgetfulness of sampling and 
search design because of the difficulties 
volved. Nevertheless our methodology is pr 
ably further advanced. than our skillful | 
imaginative use of it in applying it to 
problems. 

It is our conviction that social psychol 
is no longer divorced from the other 
havioral sciences ‘and that in the long rul 
journal of personality and social psycholt 
can profitably take account of this rapprodl 
ment. i 

DANIEL 
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INTERNALIZED BEHAVIORAL SUPPRESSION AND THE 
TIMING OF SOCIAL PUNISHMENT * 


! 
| 
; 


JUSTIN ARONFREED anp ARTHUR REBER % 


University of Pennsylvania 


; « 

This paper reports an experiment which demonstrates that punishment of an 
act at its initiation is more effective than punishment at its completion in 
producing internalized suppression of the act. The experiment, which is carried 
out with 9- and 10-yr.-old children, is used to support a theoretical analysis 
of internalized suppression in terms of 2 sequential acquisition processes. The 
1st process is the attachment of anxiety to the intrinsic cues provided by either 
the sbehayioral or cognitive stimulus correfates of an incipient transgression. 
The 2nd is the attachment of anxiety reduction to the intrinsic correlates of 
suppression, The experimental findings are also extended to suggest that a 
number of features of naturalistic socialization, other than timing of punish- 
ment, affect internalized suppression through their impact on: (a) the temporal 
locus and intensity of the anxiety that motivates suppression and (b) the rein- € 
forcement of the suppression itself. The verbal mediation of socializing agents 
is singled outeas the most significant of these features. 


Many forms of conduct tend to become 
internalized as they are acquired in the course 
of social learning. They can then be elicited; 
and maintained to some extent, in the absence 
of surveillance or reinforcement. Develop- 
mental conceptions of conscience (Bronfen- 
brenner, 1960; Miller & Swanson, 1960, Ch. 
5; Sears, Maccoby, & Levin, 1957, Ch. 10) 
and theoretical approaches to the social 
psychology of conformity (Kelman, 1958; 
Thibaut & Kelley, 1959, Ch. 13) commonly 
appear to assume that this kind of intrinsic 
control over behavior requires the indi- 
vidual’s adoption of the values or standards 
of others. It seems very doubtful, however, 
that either moral or other kinds of evaluative 
processes are indispensable to the stability 
of conduct in the absence of external moni- 
tors. Preverbal children and even animals are 

. capable of suppressing previously punished 
‘behavior, for example, over significant periods 
„of time during which punishment contingen, 
cies are no longer present. Recent studies of 
older children (Aronfreed, 1961, 1964) indi- 
cate that their internalized reactions to al- 
teady committed transgressions also can be 
established and maintained with little evi- 


1This investigation was supported in part by 
» Research Grant MH-06671 to the senior author 
; from the National Institute of Mental Health, United 
States Public Health Service. 

2 Now at Brown University. 


dence that they are applying evaluative 
standards to their own behavior. And common 
observation suggests that adults likewise have 
many durable patterns of social behavior 
which are remarkably independent of external 
outcomes and yet do not call upon their 
evaluative resources. 

Much of the intrinsic control of behavior 
which human beings show is, of course, medi- 
ated by evaluative cognition. But the experi- 
mental socialization paradigms which will be 
described in this paper are designed to pro- 
vide a relatively simple context for examining 
the motivational and reinforcement mecha- 


nisms through which internalized behavioral ° 


suppression is established. These mechanisms 
are presumed to be fundamentally constant, 
regardless of the complexity of the cognitive 
structures which may intervene between a 
stimulus situation and the individual’s ulti- 
mate behavior. Accordingly, the paradigms 
minimize the place of cognitive structure in 
the internalization process and emphasize the 
role of intrinsic cues which are closely tied 
to behavior itself. Even under these condi- 
tions, there is a problem in specifying the 
operational criteria for internalization. The 
degree to which conduct is internalized may 
be indeterminate even though it is maintained 
without direct social observation. For ex- 
ample, its maintenance may be controlled by 
the individual’s expectations of rewards or 
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punishments which will be eventually con- 
tingent on his alternative actions. Whiting’s 
(1959) cross-cultural observations illustrate 
that control over behavior in the absence of 
socializing agents remain§ dependent, in many 
societies, on perceived external surveillance 
ahd sanctions. The rationales which childyen 
of different ages offer as justification for 
specific acts of conduct (Kohlberg, 1963; 
Piaget, 1948) also reveal how the evaluative 
processes which support internalization may 
vary in the extent of their reference to 
external outcomes. 

A proper criterion of internalization might 
specify that an act is internalized to the 
extent that it can be maintained in the ab- 
Bence of external outcomes which have di- 
rectly reinforcing consequences for the actor 
(and, one might wish to add, in the absence 
of the actor’s anticipation of such outcomes). 
A more pragmatic extension of this criterion, 
to be used in the present study, is that 
conduct be considered internalized if it can 
be reliably elicited in the absence of social- 
izing agents, after having ,been acquired 
under the control of either direct response 
outcomes which were mediated by the agents 
or the display of similar conduct by the 
agents, Whatever the precise meaning to be 
assigned to internalization, its demonstration 
clearly requires evidence that the mainte- 
nance of conduct has shown some movement 
from external to intrinsic control. We may 
conceive of the intrinsic control as being 
‘mediated by changes of affective state 
which have become partially contingent on 
the stimulus properties of behavior itself 
(response-produced cues) or on the cognitive 
representations of the behavior. In the case 
of internalized suppression, the relevant af- 
fective changes are the induction and at- 
tenuation of anxiety, which were orignally 
controlled by the aversive social outcomes of 
particular forms of behavior. These aversive 
outcomes can be roughly assumed under the 
heading of social punishment, since they 
usually occur in a context where they are 
perceived as being transmitted by an agent 
who specifically intends to introduce them 
in response to one’s actions. It is unlikely 
that any substantial repertoire of behavior 
becomes independent of external outcomes, 
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in the course of naturalistic socialization, 
through interaction that is exclusively puni: 
tive. But the distinct contribution that 
response-contingent punishment makes to in- 
ternalization can be examined in experimental 
situations where the effects of positive rein- 
forcement have been reduced to a minimum, 

The experiment to be reported here at- 
tempts to test a conception of internalized 
behavioral suppression that specifies a two- 
step acquisition process. It can be most 
conveniently illustrated in terms of the social- 
ization of the child. The first acquisition 
mechanism is the attachment of anxiety to 
stimuli which are intrinsically produced. The 
most general common effect of all types of 
socially mediated punishment is their induc- 
tion of an aversive affective state in the child. 
This state may have a number of qualitative 
variations (such as fear, guilt, or shame) 
which are dependent on the cognitive setting 
in which it is embedded. But its invariant 
motivational properties may be broadly desig- 
nated as anxiety. Once the child has had 
some experience with a punished act, the role 
of punishment in eliciting anxiety can be 
easily displaced to social cues which have 
acquired secondary aversive value through 
their previous association with punishment. 
The incipient and even ongoing actions of 
children are frequently brought under the 
control of warning signals which are provided 
by their socializing agents. 

Although the aversive social consequences 
of a child’s behavior do not always reflect 
the violation of recognized norms of conduct, 
they do always imply some form of behavioral 
constraint between the child and the agent 
of punishment. It is therefore convenient t0 
refer to any act as a transgression, when it 
is followed by social punishment of sufficient’ 
sonsistency and intensity to produce suppres“ 
sion of the act. If the punishment does have 
some minimally consistent relationship t0 
the child’s act, then a component of anxie 
will necessarily become attached to the ing 
trinsic stimulus correlates of the act. These 
intrinsic stimuli may be cues directly pro 
duced by the performance of the act itself ; 
They also may be the cue properties of cogni- 
tive or verbal representations of the act—it 
the form, for example, of intentions or evalua- 
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tive processes. Thus, certain kinds of actions, 
and their intrinsic representations, become ca- 
pable of eliciting anxiety both during their 
performance and before they are even carried 
out, without the benefit of any external 
surveillance or objective threat of punishment. 

The second mechanism in the internaliza- 
tion of behavioral suppression follows from 
the motivating properties of anxiety. Chil- 
dren quickly discover that they can avoid 
or attenuate social punishment and aversive 
warning stiniuli by terminating an ongoing 
transgression of by arresting a transgression 
while it is still in an incipient stage. A variety 
of alternative nonpunished behaviors, in- 
cluding simple suppression of the transgres- 
sion, consequently acquire instrumental value 
for them in reducing the anxiety elicited by 
aversive external events. At the same time, 
these suppressive behavioral modifications 
(and their representative cognitive processes) 
eliminåte the intrinsic stimulus correlates of 
transgression and attenuate whatever anxiety 
may have been already independently at- 
tached to such correlates. Anxiety reduction 
will then gradually become attached directly 
to the intrinsic response-produced and cogni- 
tive correlates of suppression. And it will 
serve to reinforce the suppression, regardless 
of whether or not there are external conse- 
quences of the kind associated with surveil- 
lance and punishment. This entire two-step 
formulation is consistent with phenomena 
which are readily observable in naturalistic 
socialization, and also with the findings of 
extensive experimentation on the effects of 
punishment learning in animals (Estes, 1944; 
Mowrer, 1960a, Ch, 2; Solomon & Brush, 
1956). It can also be extended to the learn- 


Jing of internalized reactions to transgres- 


sions which have already been committed 
(Aronfreed, 1963, 1964; Hill, 1960). € 


Temporal Locus of Internalized Anxiety 


Most of the acts which are defined by 
social punishment as transgressions are not 
punished indiscriminately without reference 
to the conditions of their occurrence. Ordi- 


The anxiety induced by punishment may disrupt 
punished responses not only because it motivates 
alternative responses, but also because it serves in 
itself as a competing response, 


parily, an act has been punished in some 
Situations, but not in others. The suppression 
of a transgression therefore cannot be viewed 
as being entirely mediated by intrinsic cues 
merely because it occurs in the abserice of 
surveillance. The changes of affective state 
which motivate and reinforce suppression 
must rather be dependent on stimulus com- 
plexes which consist of both intrinsic and 
external cues, even though the external cues 
may not include the presence of a socializing 
agent. In order to examine the function of 
intrinsic cues in the mediation of suppres- 
sion, it is necessary to have an experimental 
method in which these cues are given a vari- 
able relationship to the course of anxiety, 
while the role of external situational cues 
remains constant. One such method is to 
observe the effects of variation in the timing 
of punishment with respect to the initiation 
and completion of a punished act. The 
rationale for this procedure is implicit in the 
common finding that punishment is less ef- 
fective in suppressing the behavior of animals 
when it follows an act only after a relatively 
long temporal interval (Bixenstine, 1956; 
Kamin, 1959; Mowrer, 1960a, Ch. 2). The ` 
particular variant of the procedure used here 
was suggested by an informal report on work ' 
in progress with dogs by R. L. Solomon (see 
Mowrer, 1960b, pp. 399-404). Solomon’s 
observations of 6-month-old puppies tenta- 
tively indicated that punishment upon ap- 
proach to a forbidden food resulted in more, 
prolonged suppression, during a test in which 
the experimenter was absent, than did pun- 
ishment after a considerable portion of the 
food had been eaten. There was also some 
indication that puppies who did trangress 
during the test were more likely to show 
reactive signs of distress if they had been 
punished originally after having eaten part 
of the food. 

Any act may be regarded as having a 
number of distinguishable components. In 
addition to intentions or other implicit pre- 
cursors of the act, there are intrinsic cues 
which occur as the act is initiated. When 
the act is fully committed, there will be other 
intrinsic stimulus correlates which have some 
duration in time, and they may actually 
extend beyond the point of its completion, 
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If we assume that internalized suppression 
requires some mediation of anxiety by the 
sequence of intrinsic cues associated with a 
punished act, then we,would expect the 
temporal locus of the original social punish- 
ment to have a significant impact on the 
effective motivation for subsequent suppre$- 
sion when the child is no longer under surveil- 
lance. Punishment used at initiation of a 
transgression would attach maximal anxiety 
to the intrinsic cues which occur at that 
point. In contrast, punishment administered 
only when a transgression has been already 
committed would tend to produce greater 
intensity of internalized anxiety at the point 
where subsequent transgressions have also 
been completed. 

The difference in the effects of these two 
temporal loci of punishment would be limited 
by generalization or spread of anxiety to 
elements of the act other than those im- 
mediately present at the point of punishment. 
But to the extent that responses alternative 
to transgression (including its suppression) 
are motivated by anxiety and’ reinforced by 
anxiety reduction, they should have a greater 
probability of being elicited by the incipient 
cues of transgressioh—before the trangression 


` is actually carried out—if punishment origi- 


nally occurred at its initiation. If punishment 
occurred only after the commission of a 
transgression, the anxiety generalized to the 
point of onset might not be intense enough 
to motivate subsequent suppression when the 
child has been removed from the external 
presence of the punitive agent. Moreover, if 
punishment were to occur consistently after 
a transgression was socially perceived as 
having been already committed, suppression 
would not acquire much anxiety-reducing 
value at the point where maximal internalized 
anxiety would eventually be concentrated. 
When a child has completed a transgression, 
suppression is ordinarily no longer instru- 
mental to the avoidance or attenuation of 
punishment. 


METHOD 


Two experimental socialization paradigms were 
constructed to require children to choose, on each 
of nine training trials, between a punished class 
of responses and a nonpunished class of active 
alternative responses (rather than between punished 


> 


responses and passive suppression). This techniqu 
insured that there would be some opportunity fo; 
punishment training among all of the children, since 
they could not reduce their anxiety about the 
entire situation by resorting to a generalized sup- 
pression of all active responses. Consequently, th 
suppression to become evident during training ap- 
peared not simply as absence of the punished 
response, but rather as the positive choice of a non- 
punished alternative. The children were confronted) 
with a relatively simple discrimination. They were™ 
asked to choose between two small toy replicas of | 
real objects commonly found in their social environ- 
ment, to pick up the chosen toy, and to describe} 
its function if they were asked tq do so. The pairi 
of toys varied from trial to trial, but one was always © 
quite attractive, while the other was relatively un- 
attractive. Punishment was consistently adminis- 
tered for choice of the attractive toy, and the child 
was not permitted to describe its function. Choice 
of the unattractive toy was not punished, but was 
also not rewarded, except for the minimal social 
reinforcement inherent in being permitted to describe 
its function and in the experimenter’s noncommittal 
recognition of the child’s verbal statement (in the 
form of a casual “uh-huh”). ° 

The experimenter’s role as a socializing agent was 
predominantly punitive and provided in itself no 
model for the behavioral changes which the child 
was to acquire. A very limited cognitive context 
for punishment was provided by telling the chil- 
dren that certain of the toys were supposed to be 
chosen only by older children. This restriction was 
imparted during instructions and was repeated each 
time that the child was punished. In one training 
condition, punishment was administered as soon as 
the child’s hand approached the attractive toy, and 
before the child picked it up. In the other training 
condition, the child was permitted to pick up the 
attractive toy and hold it for a few seconds (but 
not to handle it), and then punishment was given 
and the toy was removed from the child’s hand, 
A control condition without explicit punishment was 
also devised, primarily to control for any possible 
effects of the difference between the two punishment 
conditions in habituation to picking up the toys 
during punished choices. 

Following the nine trials used under each of these” 
conditions, the children were tested for internalized” 
suppression of the punished behavior. The experi- 
menter set out a tenth pair of toys, one of which 
was highly attractive, and then left the room on 4 
pretext. When he returned, he was able to discerni 
whether the attractive toy had been picked up oF 
even moved. The experiment was not designed to 
assess the effects of timing of punishment upon 
internalized reactions to completed transgressions; 
because of the difficulty of eliciting overt evidence of 
such reactions without disclosing the hidden moni- 
toring of the children’s behavior during the test for 
internalization, However, an attempt was made. 
to get a tentative index of responses which the 
children might have used to reduce their anxiety 
following transgressions in the test situation. 


Subjects 


-1^ The subjects used in the experiment were 88 boys 

from the fourth and fifth grades of two public 
‘schools in a large urban school system. The 
subjects ‘were divided as follows among the three 
conditions: Punishment at Initiation, 34; Punishment 
‘at Completion, 34; Control, 20. 


©; ‘Procedure 


Each child was taken individually from his class- 
“foom to the experimental room by the experimenter, 
who. was a male. The child was asked to sit at a 
small table, and the experimenter seated himself 
-sacross’ from theechild. A rectangular wooden pres- 
entation board, roughly 12 X 18 inches in size, lay 
“= centered “on the table. At the experimenter’s right 
\was’a second table, upon which lay a black com- 
thentalized box and the experimenter’s recording 
materials.. The box had 24 compartments, each of 
which: held a pair of toy replicas. Only the first 
“40: pairs were actually used for the nine training 
i and the test trial. The toys were all quite 
‘small; and ‘varied in their outer dimensions from 
‘oughly .5-inch to 2.5 inches. The attractive member 
of eadh pair was always somewhat larger, had 
more detailed fidelity to its realistic prototype, and 
had’a higher relevance to masculine interests (for 
_ example, a tiny electric motor, a camera with 
, Moving’ parts, etc.). The unattractive items were, 
in contrast, smaller, shabbily designed, and more 
‘generally associated with feminine interests (for ex- 
ample, a barrette, a thimble, etc.). The pairs of items 
` were used in a fixed order under all conditions. 
‘Punishment at Initiation, The: experimenter’s in- 
“structions were given as follows: 


Im going to put some toys down here on this 
board. Each time I'll put qown two toys. Here's 
what you do—pick up the one you want to tell 

. about, hold it over the board [the experimenter 
indicated appropriate action with hand, using his 
“fingers to show that the item was to be easily visi- 
ble when held], look at it for a while, and just 
~ think about what you're going to say. Then, if I 

ask you, tell me what it’s for or what you do 
; with. it, Do you understand? 


* Following’ the child’s indication of understanding, 
j the experimenter continued: 


“. Now some of the toys here are only supposed 
to be for older boys, so you’re not supposed 
to pick them. When you pick something that’s 

- only for older boys, I'll tell you. 


The experimenter then began the first trial by 
placing the initial pair of toys on the board and 
saying: “All right, now pick up the one you want 
r tell about, look at it for a while, and just think 
about: what you're going to say.” To indicate subse- 


_*The experiment was made possible through the 
“cooperation of a number of administrators and 
teachers in the Philadelphia public school system. 
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quent trials, the experimenter said: “All right, pick 


Sp the one you want to tell about,” or simply “All 


right.” The position of the attractive toy was 
consistently alternated over the nine training trials, 
so that it appeared.at the child’s left on odd- 
numbered trials and at his right on even-numbered 
trials (thus providing another cue, in addition to 
attractiveness, to simplify the discrimination). i 
After the experimenter initiated each trial by 
placing a pair of toys, he dropped his hands lightly 
to an apparently casual resting position just behind 
the toys. When the child reached for the unattractive 
toy, he was permitted to pick it up and hold it 
over the board, in accordance with the instructions. 
After 2 or 3 seconds, the experimenter said: “All 
right, tell me about it.” Following the subject's 
description of the toy’s function (usually rather 
brief), the experimenter simply said “uh-huh” (quite 
flatly) and removed both toys from the board after 
the child had put down the toy he had chosén. 
When the child reached for the attractive toy, the 
experimenter said: “No—that’s for the older boys” 
(firmly, though not very sharply), and raised the 
fingers of his hand behind the toy as though to 
slightly cover it. The experimenter’s verbal dis- 
approval was always given before the child actually 
touched the toy, but when the child’s hand was 
rather close to it. The disapproval was almost 


always immediately effective in causing the child , 


to withdraw his hand without even touching the 
toy. The experimenter then first removed the at- 


tractive toy, leaving the unattractive one for a few = 


seconds, in the event that the child would want 
to pick it up and thus correct its original transgres- 
sion, But very few children ever used the opportu- 
nity to pick up an unattractive toy, once they had 
already been punished for their choice of the 
attractive one. 

Punishment at Completion. The instructions and 
general procedure for the training condition, and the 
experimenter’s behavior when the child chose “an 
unattractive toy, were identical to those described 
above for the Punishment at Initiation condition. 
The only difference was in the timing of punishment 
when the child chose an attractive toy. Here, the 
child was permitted to pick up the attractive toy 
and hold it over the board, just as in the case of 
an unattractive toy. Then, after 2 or 3 seconds, the 
experimenter said: “No—that’s for the older boys,” 
and gently but firmly removed the toy from the 
child’s hand (the position in which the children 
had been instructed to hold the toys was such 
that the toy was never actually handled and was 
readily removable). As in the first condition, the 
unattractive toy was removed from the board a few 
seconds later, so as to give the child an opportunity 
to correct himself. 

Control. The Control condition was constructed to 
observe the effects of lack of opportunity to pick 
up the attractive toys, during training in which 
there was no response-contingent punishment. In- 
structions and procedure were similar to those used 
for the punishment conditions, but had to be 
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slightly modified to define a situation in which the 
child would never actually pick up the toys, without’ 
the necessity of using explicit punishment. A state- 
ment was also added to specifically indicate which 
of the toys were intended only for older boys, since 
the general prohibition on choosing such toys, im- 
parted in the instructions, was concretized in the 
othér training conditions by the information cor; 
veyed on punishment trials. 

The experimenter’s modified instructions were as 
follows: 


Tm going to put some toys down here on this 
board. Each time I'll put down two toys. Here’s 
what you do—you just point to the one you wart 
to tell about [the experimenter demonstrated 
with his hand], look at it for a while, and just 
think about what you’re going to say. Then, if I 
ask you, tell me what it’s for or what you do 
with it. Do you understand? 


Following the child’s indication of understanding, 
the experimenter continued: 


Now some of the toys here are only supposed 
to be for older boys, so you're not supposed to 
pick them up. The nicer-looking, more interesting 
toys are for the older boys. 


The ‘experimenter then initiated the training trials, 
On each trial, the child was asked, to indicate his 
choice by pointing. When the child pointed to an 
unattractive toy, the experimenter said: “All right, 
tell me about it.” After the child had finished 
describing the toy’s function, the experimenter said 
“uh-huh” and removed both toys. When the child 
pointed to the attractive toy, the experimenter simply 
removed both toys without saying anything (a 
procedure which, while it was not explicitly puni- 
tive, may well have been frustrating or otherwise 
aversive to the child). 

Test situation. The two objects used on the tenth 
(test) ‘trial were related to one another in the same 
way as were the members of previous pairs used 
during training. But they were somewhat less toy- 
like. There was visible action within the attractive 
object, whereas the other object was not only 
unattractive but also relatively difficult to identify. 
Consequently, some generalization had to be exer- 
cised by the child in carrying his response tendencies 
to the test situation. The attractive object was a 
two-chambered glass timer in which an enclosed 
quantity of salt ran down from the upper to the 
lower chamber in a period of 1 minute (previous 
pilot work had suggested that the tendency to turn 
the timer, or at least to handle it, was difficult 
to resist). The unattractive object was a dingy, 
yellow, 2-inch square of terry cloth, folded into an 
approximation of a piece of toweling. As the tenth 
trial began, the experimenter was about to place 
these objects, but then turned hesitatingly to his 
folder of papers, in which he had been recording the 
child’s choices, and said (in a halting and distracted 

fashion, while looking into the folder): 


It looks like I forgot some of the papers I 
...I must have left them in my car... T 
have to go outside on the street and get them, 


The experimenter then placed the pair of toy: 
The presentation board was unfinished and had 
its surface a number of scratches. One of these wasi 
actually a faint marker inserted for the test situa- 
tion, a straight line about .5 inch in length, A? 


The experimenter placed the timer, so that 
scratch line on one of its bases’ was exact 
orthogonal to the faint line on the board. As the” 
experimenter placed the item, he also rose and saidi 


TIl be gone for about 5 or 10 minutes . ., 
while I’m gone, you can be deciding which one 
of these toys you want to tell about. } 


The experimenter then” took his folder and lefi 
the room, closing the door firmly behind him. The 
experimenter remained out of the room for 5 
minutes, and then reentered, first rattling the door: 
knob, so as to give the child time to replace eithel 
of the toys which he might have picked up. 
the experimenter was walking over to the table 
where the child was sitting, he asked: “While I was 
gone, were you thinking about the toys?” Then the 
experimenter continued, after sitting down and 
opening his folder to his recording sheet: “Well, I 
don’t want you to choose yet, but tell me what 
you were thinking while I was gone.” Following 
the child’s response to this inquiry, the experimenter 
asked: “While I was gone, did you decide on which 
one you want to tell about?” The child’s responses” 
to each of these questions were recorded verbatim. 
The experimenter also’ noted whether the timer had 
been moved during his absence. f 

After the inquiry described above was completed, 
the experimenter closed the procedure on the pre- 
text that the remaining toys in the box could not 
be used because he could not find his misplaced 
papers. A further statement was added to put the 
child at ease about his performance and to invoke 
his cooperation in not discussing his experience with 
other children. 


RESULTS 


» 


The primary observations were of whether 
or not the children picked up or handled 
the attractive minute-glass in the experi- 
menter’s absence, an index of the internalized 
effectiveness of the suppression acquired 
during training. Table 1 shows, for each of 
the training conditions, the numbers of chil- 
dren who did and did not transgress. Chi- 
square values for the 2 X 2 contingency tables 
which compare each two of the three training 
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conditions, with respect to frequencies of 
transgression and nontransgression during the 
test, are as follows: Punishment at Initiation 
versus Punishment at Completion: x? = 11.54, 
p< .001; Punishment at Initiation versus 
Control: x? = 12.44, p < .001; Punishment at 
Completion versus Control: x? = -19, nsž 
These differences in the effectiveness of 
internalized suppression, during the test situa- 
tion, were complemented by the behavior of 
the children during training. Despite the 
mild prohibition conveyed in the instructions, 
the children in the Control condition per- 
sistently pointed to the attractive toys, even 
though this behavior resulted in no opportu- 
nity to describe the function of the toys. The 
great majority of these children chose the 
attractive toy on six to eight of the nine 
trials. More than a few chose it on all nine 
trials, The discriminability of the toys within 
each pair, and the consistent difference in 
their*attractiveness, was also apparent in the 
responses of children in the punishment train- 
ing conditions, but in a very different way 
that clearly showed the effects of the punish- 
ment. Under punishment training, the typical 
sequence of behavior was to choose the 
attractive toy on the first one, two, or three 
trials, and then to fairly consistently choose 
the unattractive toy thereafter. A single 
punishment was sufficient, for 21 of the chil- 
dren, to inhibit further choices of the at- 
tractive toys. Occasionally, a child would 
revert once, near the middle of the series of 


5 All values shown here are for one-tailed tests 
and incorporate a correction for continuity. 


TABLE 1 


FREQUENCY OF TEST TRANSGRESSION AND 
NONTRANSGRESSION FOLLOWING EACH OF 
THREE TRAINING PARADIGMS 


Training paradigms 


e Behavior during | Punishment | Punishment R 
at at Control , 
Initiation | Completion 
(N =34) (N =34) (N =20) 
Transgression 9 24 16 
Nontransgression 25 10 4 


trials, to choosing an attractive toy, but’ 
would then return immediately to choosing 
the unattractive ones. 

Tt is particularly interesting to note that 
children punished at initiation of transgres- 
sion exposed themselves to punishment less 
frequently than did those punished at comple- 
tion. Table 2 shows the distribution of fre- 
quencies of punishment in the two punish- 
ment training paradigms, separately for those 
children who transgressed and those who did 
not transgresg in the test situation. The two“ 
total training groups are significantly dif- 
ferent from one another (x? = 3.16, p < .05),. 
if we compare the number of children who 
received less than two punishments ‘with 
those who received two or more punishments.° 


6The value shown is for a one-tailed test and 
incorporates a correction for continuity. The com- 
parison given results in the least disproportionate 
division possible along the frequency scale. Separate 
analogous comparisons of the two punishment condi- 
tions, within the transgressor and nontransgressor 
groups, would not be meaningful because of the 


TABLE 2 


FREQUENCY OF TRANSGRESSION AND PUNISHMENT DURING TRAINING AMONG 
TEST TRANSGRESSORS AND NONTRANSGRESSORS TRAINED UNDER 
Each OF Two VARIATIONS IN TIMING OF PUNISHMENT 


g 
Frequency of punishment during training 
Experimental 
group 
o 1 2 3 4 5 6 7 

. Punishment at Initiation 

Transgressors 1 3 2 2 1 

Nontransgressors 1 11 7 5 1 

Both test groups 2 14 9 7 2 

Punishment at Completion 

Transgressors 1 5 10 4 1 2 1 

Nonttansgressors 2 3 2 1 1 

Both test groups 1 7 13 7 1 3 1 1 


10 


Clearly, it is not the number of punish- 


ments during training that makes punishment ° 


at initiation more effective as a paradigm for 
inducing internalized suppression. A compari- 
son of all transgressors with all nontrans- 
gressors also indicates that behavior in the 
test situation is not attributable to frequency» 
of punishment during training. 

For reasons already set forth, it might be 
expected that children trained under punish- 
ment at completion would experience more 
anxiety following a transgression in the test, 
situation than would children trained under 
punishment at initiation. Since the anxiety 
was not likely to be so intense as to overflow 
into an overt display of affect, the only evi- 
dence of its presence would be in certain 
responses which might be instrumental to its 
reduction. In the present experiment, it was 
important that the experimenter’s knowledge 
of the child’s transgression not be revealed, 
and consequently it was not possible to em- 
ploy techniques which would directly elicit 


_ Observäble responses such as confession and 


reparation. It appeared, however, that some 


_ of the children might be using a quasi-con- 


fessional response, predispositionally available 
from past experience; to reduce their anxiety 
following a transgression in the experimenter’s 
absence. When the experimenter returned to 
the room, there were virtually no spontane- 
ous responses from the children which could 
be easily classified as internalized reactions to 
tratisgression. But there were some interesting 
variations of response to thé inquiry about 
their thoughts during the test situation. Chil- 
dren who had been in the Control condition 
almost invariably indicated that they had 
been “thinking” about the attractive test 
object, a verbal response that agreed with 
their behavior during the test. In contrast, 
a substantial majority of the children exposed 
to the two punishment conditions reported 
that they had thought about the unattractive 


extremely small samples which would appear in 
some cells of the relevant frequency tables. 

The effects of differential timing of punishment 
are also visible in the occurrence of reversals. Seven- 
teen of the children punished at completion reverted 
momentarily to choosing an attractive toy after 
suppression had begun to be established during train- 
ing, but this behavior was shown by only nine of 
the children punished at initiation. 
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TABLE 3 


FREQUENCY OF ADMISSION AND NONADMISSION RE- 
ACTIONS FOLLOWING TEsT SITUATION AMONG ri 
TRANSGRESSORS AND NONTRANSGRESSORS 
TRAINED UNDER Each or Two VARIA- 
TIONS IN TIMING OF PUNISHMENT 


Reaction following test 


situation 
Experimental 


group 


Admission | Nonadmission 


Punishment at Initiation 


‘Transgressors 4 5 

Nontransgressors 1 24 

Both test groups 5 29 
Punishment at Completion 

Transgressors 16 8 

Nontransgressors 2 8 

Both test groups 18 16 


object (a few indicatedl that they had not ) 
thought about either one). However, roughly © 
one third of the punished children did admit 
thinking about the attractive object. Closer 
inspection of the data, shown in Table K 
revealed that these latter children were pre- | 
dominantly those trained under punishment | 
at completion, and that they were almost 
entirely transgressors. In view of the overall 
contrast between the Control and punishment 
groups, it seemed that their response to the 
inquiry might be regarded as a reaction of 
“admission,” 7 

Comparison of the total numbers of trans- 
gressors and nontransgressors who showed ad- 
mission and nonadmission reactions shows a 
highly significant difference (x? = 18.29, p < 
.001). The same difference appears when the 
comparison is made only among children 
trained under punishment at completion (x?= 
4.44, p< .05; p< .02 for the Fisher exact 
test). And a similar tendency is apparent 
among children punished at initiation (though 
it does not attain statistical significance in 
the latter case). If the admission of having 


7 The term guilt is unwarranted in reference to the 
effects of this kind of experimental situation, if one 
regards guilt as a phenomenon of rather specific 
cognitive properties interwoven with an affective 
base of anxiety (Aronfreed, 1964). The situation 
does not provide the kind of evaluative processes 
through which the Perception of transgression can 
Properly be said to arouse a moral affect (for 
example, cognitive focus on intentions or on the 
consequences of action for others). 
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thought about the attractive test object did 
serve to reduce the anxiety that followed 
transgression, then we would expect to find 
it more commonly among transgressors trained 
under punishment at completion than among 
transgressors trained under punishment at in- 
itiation. Inspection of Table 3 does reveal a 
tendency in this direction, but it does not 
attain statistical significance, in part because 
of the restricted number of children who 
transgress following punishment at initiation? 


* DISCUSSION 


The “difference between the effects of the 
Punishment at Initiation and the Punishment 
' at Completion paradigms indicates that timing 
of punishment is a very significant determi- 
nant of internalized behavioral suppression, 
at least when the punishment is accompanied 
by only minimal cognitive structure. This 
finding strongly supports the view that the 
suppression is some positive function of the 
intensity of the anxiety which is mobilized 
at the onset of a transgression, and that the 
anxiety is in turn a function of the original 
temporal relationship between this locus and 
the occurrence of punishment. It also sup- 
ports the broader conception of the internal- 
ization of social control through punishment 
as resting first on the attachment of anxiety 
to intrinsic cues associated with transgression, 
and secondly on the attachment of anxiety 
reduction to intrinsic cues associated with 
alternative nonpunished behavior. 

The great majority of the children who 
were exposed to the Control paradigm picked 
up the attractive toy during the test, even 
though they were never permitted to pick up 

. any toys during training. This last observa- 
* tion clearly indicates that transgressions had 
to be specifically punished in order to indyce 
internalized suppression, and that the dif- 
ferential effects of the two punishment para- 
digms cannot be attributed to lack of op- 
portunity to pick up attractive toys during 
training with punishment at initiation. Effects 
which are similar to those of the Control 
paradigm are commonly observed in natural- 
istic socialization. In the absence of external 


i ®The values shown are for one-tailed tests and 
incorporate a correction for continuity. 


surveillance, previous mild injunctions and 


“prophylactic restrictions of opportunity to 


transgress are often insufficient, without the 
addition of punishment, to suppress behavior 
which may have other highly reinforcing - 
consequences for the child. tr 
* It will be observed that the experimentally 
induced suppression, during the test for in- 
ternalization, cannot be attributed simply to 
“generalized” anxiety that might have be- 
come attached to the external cues which 
yemain when the socializing agent has left the 
situation. Nor is it reasonable to suppose, 
that the suppression is mediated by variable 
expectations about the risk of punishment. 
The difference in the effects of the two varia- 
tions in the timing of punishment was åp- 
parent not only during the test, but also while 
the socializing agent was still present during 
training. And during both training and test, 
external cues and conditions of risk were 
identical for children who were exposed to 
either of the two punishment paradigms. It is 
likewise implausible to attribute the ‘experi- _ 
mental findings to differences between the 
two paradigms in whatever positive reinforce- 
ment may have been associated with the act’ 
of choosing (but never being able to tell 
about) an attractive toy. The children who 
were trained under punishment at completion 
were permitted to pick up attractive toys, but 
they were required to do so in such a way 
that they could not handle them. They were 
also subjected to an additional period of tm- 
certainty while® they awaited the experi- 
menter’s punitive response (particularly dur- 
ing the early training trials), and to the 
possibly enhanced frustration entailed in 
having the toys removed from their hands, 
Moreover, it should be noted that children 
trained under the Control paradigm trans- 
gressed even more freely during the test than 
did children trained under punishment at 
completion, even though they were prevented 
from picking up any toys during training. 
There may well have been some cognitive 
mediation of the experimental effects, despite 
the attempt to minimize cognitive structure. 
If the two variations in timing of punishment 
induced different temporal patterns of arousal 
and reduction of anziety, these patterns might 
have become intrinsically mediated by Ps 
| 


g 
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tentions or other cognitive representations of, 
the sequential elements in the acts of choice. 
Such cognitive interventions would tend to 
restrict the mediational role of cues which 
> were directly produced by the punished and 
nonpunished acts. But they would not require 
any change in the more general view that 
behavioral suppression becomes internalized 
when the course of anxiety begins to be 
monitored by intrinsic stimuli. If one were to 
assume that the primary difference between 
the two experimental groups, at the point of 
»decision between transgression and non- 
transgression, was cognitive rather than moti- 
vational—for example, that the perceived 
determinant of punishment was reaching for 
the attractive toy in one group, but was 
picking it up in the other group—it would be 
difficult to account for the observed effects. 
Such an assumption would make no reference 
to the variable intensity of anxiety which 
might precede or accompany a punished act. 
It would lead us to expect that children in 
the Piinishment at Completion group would 
be more likely than those in the Punishment 
_at Initiation group to reach for the attractive 
“test toy. It would also lead, however, to the 
prediction that the°Punishment at Comple- 
tion group would not go so far as to pick up 
the attractive toy. But the observations during 
both training and the test for internalization 
indicate that children in this group do pick 
up the attractive test toys more frequently 
than do children trained under punishment 
at initiation.’ i 
The acquisition and maintenance of inter- 
nalized suppression are not determined only 
by the intensity of the anxiety that becomes 
attached to the intrinsic cues associated with 
an incipient transgression. They are also af- 
fected by the reinforcement of behavioral alter- 
natives to the punished act (including suppres- 
sion). When the timing of social punishment 
is predictable to a child, as it is in the ex- 
perimental paradigms, it may result in a 
delay-of-reinforcement effect upon suppres- 


The assumption that the experimental groups 
differ in their cognition of the determinants of 
punishment is perhaps not very credible in any case. 
The common general instructions to the children 
clearly convey the idea that “picking” a toy (ie., the 
act of choice) is the relevant determinant. 


wy 
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sion. The direction of this reinforceme 
effect would be parallel to the direction of { 
motivational effect that timing of punishm 
exercises upon suppression through its 
on the intrinsic temporal locus of anx 
When children are confronted with che 
between punished and nonpunished acts, th 
will experience some anxiety in conn 
with any choice, particularly early in t 
learning process before discriminant cues 
firmly established. And the temporal re 
tionship between nonpunished behavior a 
the reinforcement inherent in anxiety redi 
tion will tend to be a direct function of í 
timing with which punishment predictah 
occurs for transgressions, In the Puni: f 
at Initiation paradigm, for example, the | 
ternal cues which signal that the child ¥ 
not be punished begin to become apparent 
him as soon as his hand reaches an unattt 
tive toy without a punitive interruption 

anxiety reduction that reinforces the supp 
sion of attractive choices thus soon becon 
virtually immediate. In the Punishment 
Completion condition, however, the 
reduction that follows nonpunished behavi 
is considerably more delayed, since the e 
ternal safety signals do not occur until f 
child has been asked to tell about the 
A possible implication of this analysis f 
naturalistic socialization is that children ¥ 
are closely supervised by their parents mi 


tend to experience more immediate reinfort 
ment for their suppression of transgression 
It is also interesting to note that childr 
may be forced to rely too heavily on the 
ternal outcomes of their behavior, if a 
reduction cannot easily become discriminatel 
attached to the intrinsic correlates of no 
punished behavior. This difficulty might a 
if they were faced with complex discri 
ticns in which they could not distinguis 
between punished and nonpunished response 
or if the anxiety induced by punishment 

so intense as to disrupt the discrimina 
of relevant cues. 


Some Further Theoretical Implications 


The Punishment at Completion para 
seems to be hardly more effective than 
Control paradigm in producing interna 
suppression. This finding suggests that 
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anxiety mobilized by an incipient trans- 
gression during the test situation was not 
sufficient to motivate suppression, when its 
intensity was*attenuated across a gradient of 
generalization from the total complex of cues 
which were originally present at punishment 
of a completed transgression during training. 
The relative lack of effectiveness of the 
Punishment at Completion paradigm presents 
an instructive contrast to the observations 
made in a similar experiment conducted by 
Walters andeDemkow (1963). These investi- 
gators used reaching for attractive toys versus 
touching the toys as their two temporal posi- 
tions of punishment, and found that these 
training conditions produced only a tenuous 
difference in the subsequent effectiveness of 
the child’s behavioral suppression in the ex- 
perimenter’s absence. The limitation on the 
effect which they observed was very probably 
due to the fact that their variation in timing 
cut a®rather fine difference into the topog- 
raphy of the punished act and into the gen- 
eralization gradients of the anxiety induced 
at the two points of punishment. A compari- 
son of their findings with the findings of the 
present study indicates that the anxiety in- 
duced by the social punishment of an act 
does generalize from the intrinsic stimuli 
which are immediately present at the point of 
punishment to closely surrounding stimulus 
components of the punished act. 

It may appear to be somewhat surprising 
that so many transgressions occur in the test 
situation following the Punishment at Comple- 
tion paradigm, since naturalistic socialization 
commonly produces effective internalized sup- 
pression, even though it is very dependent on 
the punishment of already committed trans- 
+ gressions. Although the punishment of parents 


-+ "and socializing agents may be extremely vari- 


able in its timing, the ecology of socializatien 
does not present too many opportunities to 
introduce punishment when a child is only on 
the threshold of transgressions. Part of this 
apparent discrepancy between naturalistic 
and experimental socialization may be an 
artifact of the use of a gross index of sup- 
pression in the test situation—the occurrence 
or nonoccurrence of transgression in a limited 
* time period. A more sensitive index, such as 
elapsed time before the occurrence of trans- 


gression, might have revealed differences in 
the strength of the internalized suppression 
induced by the two punishment training con- 
ditions, without implying that the small vari- 
ation in timing was so powerful as to deter- 
mine whether a transgression could be elicited 
at all. The use of an elapsed time measfire 
might have disclosed, for example, that trans- 
gressions following the Punishment at Comple- 
tion paradigm occurred later in the test period 
than transgressions following the Control 
paradigm. Conversely, a longer test period 
thight have raised the attractiveness of the 
forbidden toy, or might have resulted in ex- 
tinction of some of the anxiety attached to 
intrinsic cues of incipient transgression, so 
that transgression would have been mote 
common following training under punishment 
at initiation. 

Variation of the precise timing of punish- 
ment, within the microstructure of a punished 
act, may be a convenient method for teasing 
out the specific mechanisms through which 
behavioral suppression becomes acquired and 
internalized. But naturalistic socialization has 
a number of other features which would tend 
to dilute the significance of timing, and to - 
facilitate internalization even when punish- 
ments typically follow transgressions at tem- 
poral intervals well beyond these used in the 
experimental paradigms. The anxiety aroused 
by the punishment of agents to whom the 
child has strong positive attachments (par- 
ticularly the parents) may be substantially 
greater than the anxiety aroused by an experi- 
menter’s verbal disapproval. And even if 
punishment follows a committed transgres- 
sion, a greater intensity of anxiety at its point 
of application will be more likely to insure 
enough generalization to motivate suppression 
at the point of subsequent incipient transgres- 
sions. The punishment and warning signals 
emitted by parents are also often patterned in 
accordance with the continuous or inter- 
mittent character of many of a child’s trans- 
gressions. Parents sometimes punish in the 
midst of a committed but sustained trans- 
gression, or after a discrete repeatable act 
that the child has completed but is about to 
initiate again. Under these conditions, sub- 
stantial anxiety can become directly attached 
to the intrinsic cues associated with an in- 
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cipient transgression, even though punishment 
is originally contingent on visible commis 
sion of the transgression. Moreover, the child 
may be given the opportunity to avoid or 
escape punishment by introducing its own 
behavioral control in the course of an ongoing 
trahsgression—a corrective option that is not 
available in the experimental paradigms— 
with the result that suppression may acquire 
instrumental, anxiety-reducing value even 
when it does not initially prevent mhe oc- 
currence of transgression. 

In the social interaction between parent 


‘and children, the reinforcement of behavioral 


suppression is not entirely defined, of course, 
by the presence or absence of aversive out- 
comes. Parents often react with affection, 
praise, or other forms of positive reinforce- 
ment, when they are aware of evidence of 
suppression in their children’s behavior. A sig- 
nificant component of the intrinsic reinforce- 
ment that supports internalized suppression 
may consequently be derived from positive 
affect which was originally induced by social 


~~ rewards, rather than from the reduction of the 


anxiety which is elicited by incipient trans- 


+. gression. It is for this reason that situational 


assessments of children’s already acquired dis- 
positions to suppress socially prohibited be- 
havior are ambiguous with respect to the 
motivational antecedents and reinforcing con- 
sequences of the suppression. Such disposi- 
tions will be the resultants of a complex his- 
tory of interaction of the effects of direct 
punishments and rewards (and also, perhaps, 
of the effects of modeling). Two well-designed 
surveys (Burton, Maccoby, & Allinsmith, 
1961; Grinder, 1962) have, in fact, uncovered 
only tenuous and inconsistent relationships 
between children’s internalized suppression of 
social transgressions and the discrete practices 
of punishment or reward which are used by 
their parents. 

Probably the feature of naturalistic social- 
ization that most effectively insures internal- 
ized suppression, regardless of the temporal 
locus of punishment, is the extensive verbal 
mediation used by parents. A verbal medium 
of punishment makes it possible for the 
child’s anxiety to become monitored by in- 
tentions, conceptual labels, and other cogni- 


act as common mediators of anxiety. They 
can become attached to any of the concrete 


ance and aftereffects of a transgression, And 
they can consequently bridge the microstruc- 
ture and temporal separation of these cona 
crete cues, so that the cues retain only a 


anxiety. The intensity of the anxiety that is 
elicited at the point of an incipient tran: 
gression would thus become independent of 
the original temporal relationship between 
the cues which are immediately present at 


When a child is enabled to represent its in- 
tentions to itself, for example, in close conar 
junction with punishrhent that occurs long) 
after a transgression has taken place, then its) 
intentions may elicit sufficient anxiety to mo 
tivate suppression when they subsequent 
intercede before a transgression is carried out, 

A number of surveys (Bandura & Walters, 
1959; Maccoby, 1961; Sears, 1961; Sears 
et al., 1957, Ch. 7) have reported some evi- 
dence that children’s internalized control overt 
socially prohibited actions is positively as) 
sociated with the closeness of supervision) 
exercised by their parents. As was pointed) 
out earlier, it is unlikely that close super4 


opportunity that it affords to punish the) 
child’s incipient transgressions. The associa-) 
tion is more probably generated by the tend- 
ency of parents who closely control thei 
children’s behavior to also use verbal media“ 
tion and to be more attentive to the intrinsic 
cognitive and motivational precursors of 
transgression. Support for this observation 
can be found in the correlates of the different 
disciplinary methods to which parents in out 
society are disposed (Aronfreed, 1961; Bron- 
fenbrenner, 1958; Davis & Havighurst, 1946; 
Kohn, 1959; Maccoby & Gibbs, 1954): 
Middle-class parents tend to be more oriented 
toward their children’s intentions. They até 
likely to use reasoning and explanation t0 
induce an internal governor in their children, 
and not merely to sensitize them to the 
Punitive external consequences of transgres- 
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sion. They also often actively induce their 
children to initiate their own self-corrective 
processes. Working-class parents are more 
prone to react to the concretely visible 
consequences of their children’s transgres- 
sions, and to sensitize their children to the 
threat of punishment. Their methods of 
punishment are more direct and occur in a 
less verbal medium. And they are less 
oriented toward reinforcing signs of internally 
mediated control in their children. Middle- 
class childresx do show more of a correspond- 
ing orientatior toward internal monitors in 
the control of behavior, while working-class 
children show more of an external orienta- 
tion (Aronfreed, 1961; Boehm, 1962; Kohl- 
berg, 1963). Some surveys (Allinsmith, 1960; 
Bandura & Walters, 1959; MacKinnon, 1938; 
Sears et al., 1957, Ch.°7) have found direct 
_ relationships between the internal versus ex- 
ternal orientation of parental discipline and 
parallel differences of orientation in children’s 
suppression of socially prohibited behavior. 
Children do acquire highly general and 
integrative evaluative systems for some areas 
of social behavior. Such value systems may 
affect internalized control over behavior in 
ways which are not apparent from the effects 
of direct response outcomes in a simple dis- 
crimination situation. It is possible, for ex- 
ample, that more massive and cognitive forms 
of internalization can occur through acquisi- 
tion processes of the kind implied in theories 
of identification. Certainly, stable behavioral 
changes can be induced in children through 
their tendency to reproduce the behavior of 
models (Bandura & Walters, 1963) without 
the initial support of direct external reinforce- 
-ment. But the experimental findings reported 
* ‘in this paper show that internalized suppres- 
sion can be acquired through a form ef 
aversive learning that is highly sensitive to 
the timing of punishment, a parameter of 
social learning that is not readily translatable 
into the child’s disposition to adopt the role 
of a model. A general conception of mechan- 
isms of internalization must take into ac- 
count, then, that some forms of internalized 
control. over behavior can be established 
` through the direct reinforcement and punish- 
ment of the child’s overtly emitted responses. 
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THE INFLUENCE OF INCENTIVE CONDITIONS ON THE 
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A MODIFYING ATTITUDES* . 
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This experiment was designed to investigate the effects of favorable vs. un- 
favorable sponsorship (which has been a hitherto neglected variable in role- 
playing research) and to test opposing predictions from “dissonance” theory and 
incentive” theory. Each of 64 male college students was interviewed privately 
in his own dormitory room by a total stfanger who introduced himself as a 
pollster working for a private research company. Each S was assigned on a 
. stratified random basis to 1 of 8 experimental conditions in a 2X2X2 
factorial design that enabled us to ascertain the main effects and interactions 
of the following 3 independent variables: (a) overt role playing vs. passive 
exposure to the same instructions and information; (b) unfavorable (blatantly . 


commercial) sponsorship of the role-playing assignment vs. favorable (public 


welfare) sponsorship; (c) small monetary reward ($1) vs. large reward ($20) 
for carrying out the role-playing task. Analysis of variance of attitude scores 
showed that there were no significant main effects but, as predicted by in- 
centive theory, there was a significant interaction such that more attitude 
e change occurred when overt role playing was carried out under favorable 
sponsorship than under unfavorable sponsorship conditions. In general, the 


results fail to verify predictions from dissonance theory and tend to support 


an incentive theory of attitude change. 


Earlier experiments have shown that role 
playing facilitates the modification of at- 
titudes. Janis and King (1954; King & Janis, 
1956) found that when students were re- 
quired to improvise a speech in support of a 
position contrary to their private beliefs, they 
showed an increase in attitude change as 
contrasted with a passively exposed control 
group. A similar increment in attitude change 
was found by Kelman (1953) when a group 
of school children was given a mild incentive 
to write convincing essays in support of a 
new attitude position. 

In these experiments, and in subsequent 


* ones which have confirmed the positive effects 


of role playing (for example, Carlson, 1956; 


; ‘% Culbertson, 1957; Harvey & Beverly, 1961), 


the role-playing task was assigned by some- 
one who was himself a highly prestigeful 
person or the representative of a highly 


1The experiment described in this paper was 
carried out under the auspices of the Yale Studies 
in Attitude and Communication, a research project 
established by the late Carl I. Hovland and sup- 
„ported by a grant from the Rockefeller Foundation. 
The present Research Director is Irving L. Janis 
and the Associate Director is Robert P. Abelson. 
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prestigeful organization whose goals were .. 
consonant with those of the subjects. It seems 
quite possible that the suctess of role playing 
may partly depend upon positive incentive 
values attached to conforming with the 
demands of a benign or prestigeful sponsor. 
The main purpose of the present experiment 
is to determine whether favorable versus un- 
favorable sponsorship conditions influence the 
success of role playing with respect to induc- 
ing attitude changes. 

This problem was selected for systematic 
investigation not only because of the potential 
importance of sponsorship factors in attitude 
change, but also because it offers a rare op- 
portunity for testing rival theories concerning 
the psychological processes that might ac- 
count for role-playing effects. One theory, 
which takes account of the “self-persuasion” 
character of role playing, emphasizes the im- 
portance of positive and negative incentives 
in determining the outcome (see Hovland, 
Janis, & Kelley, 1953, pp. 223-237). Ac- 
cording to this “incentive” theory, when a 
person accepts the task of improvising argu- 
ments in favor of a point of view at variance 


ma 
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with his own personal convictions, he beş 
comes temporarily motivated to think up all 
the good positive arguments he can, and at 
the same time suppresses, thoughts about the 
negative arguments which are supposedly ir- 
rejevant to the assigned task. This “biased 
scanning” increases the salience of the poSi- 
tive arguments and therefore increases the 
chances of acceptance of the new attitude 
position. A gain in attitude change would not 
be expected, however, if resentment or other 
interfering affective reactions were aroused by 
negative incentives in the role-playing situa- 
tion. Among the obvious instances of negative 
incentives would be information that lowers 
the prestige of the sponsor or that leads to 
his being perceived as a manipulative person 
who is trying to influence people for his 
own personal aggrandizement or for other 
alien purposes. Any signs of exploitative in- 
tentions in the behavior of the sponsor would 
also be expected to operate as negative in- 
centiyes, evoking responses that conflict with 
the positive incentive value of improvising 
arguments in support of the’ conclusion as- 


_ signed by the sponsor. Thus, incentive theory 


predicts that role playing will be more suc- 
cessful in inducing attitude change if the 
sponsor is perceived as someone whose affilia- 
tions are benign in character and whose in- 
tentions are to promote public welfare than if 
he is perceived as someone whose affiliations 
and purposes are commercial or exploitative. 

Exactly the opposite prediction would be 
made by the “dissonance”-theory explanation 
of role-playing effects, which postulates that 
“the existence of dissonance [among cognitive 
elements] gives rise to pressures to reduce 
the dissonance and to avoid increases in dis- 
sonance [Festinger, 1957, p. 31].” According 
to this theory, the crucial factor in role 
playing is that it creates dissonance be- 
tween the person’s private opinion and his 
awareness of what he is overtly saying and 
doing when he conforms with the sponsor’s 
demands; “the changes in private opinion 
which ensue are the end result of a process of 
attempting to reduce or eliminate this dis- 
sonance [Festinger, 1957, p. 112].” The total 
magnitude of dissonance is assumed to de- 
crease as the number and importance of the 
pressures and justifications which induce the 
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person to say things he does not privatel 
believe are increased (see Brehm & Cohen 
1962, pp. 252-255). Consequently, the predic 
tion would be that more dissonance and theré 
fore more attitude change will result fro 
role playing if the sponsor is perceived a 
having objectionable affiliations or purposé 
than if he is perceived as having intention 
consonant with the subjects’ own goals 
provided, of course, that the subjects can } 


- 


structions.? Accordingly, the present experi 
ment was designed to test the opposing pre 


theory by comparing the effects of a 
tive” sponsorship condition that is relativelj 
consonant with the subject’s values with 
“negative” condition, that is relatively dis 
sonant, 

The experimental design includes a second 
independent variable—amount of monetary 
reward—which enables us to test some 4 
ditional differential predictions from the tw 
theories. Festinger and Carlsmith (1959): 


tion of role playing is strongly supported by 
the findings from an experiment in whid 
they compared the effects of a small and & 
large monetary reward. They found tha 
when role-playing subjects were given a smal 
monetary reward ($1), more opinion changi 
occurred than when subjects were given í 
large monetary reward ($20). In this expeti 
ment, however, the experimenter presented 
justification that had potentially negative # 
well as positive features and there al 
grounds for wondering if the extraordinari 
large reward of $20 might have unintentiol 
ally generated some degree of suspicioúfi 
wariness about being exploited by the & 
perimenter or some degree of guilt abo 
being “bought” to lie to a fellow studet 
(see pp. 25-26). 


2 Two studies of role playing (Cohen, Brehm, $ 
Fleming, 1958; Rabbie, Brehm, & Cohen, 1959) hal 
attempted to manipulate the justification for CO” 
formity to the sponsor’s demands. But these stu% 
provide ambiguous evidence because in each ™ 
stance the sponsor was described in terms that wei 
likely to make his status and prestige value dubioll 
The control condition in each instance was 2” 
equivocal in that no information at all was 8 
about the sponsor or his goals. 
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Cohen, using more plausible rewards, re- 
ported a similar increase in attitude change 
from small as against large monetary pay- 

‘ ments when college students were induced to 
write essays in support of a personally ob- 
jectionable position (Brehm & Cohen, 1962, 

_ pp. 73-78). But Rosenberg (1965) failed to 
confirm these findings when he carefully 
repeated Cohen’s study, using almost identical 
procedures except for a methodological re- 
finement that involved having the experi- 
menter who teSted the subjects’ final attitudes 
completely independent from the experimenter 

. who administered the role-playing procedure. 
In fact, Rosenberg’s results show exactly the 
opposite outcome from the two experiments 


L by proponents of dissonance theory, the great- 


est amount of attitude change occurring 
among the role players who were paid the 
largest amount of money. 

In view of the controversial evidence con- 
cerning” the effects of monetary rewards, we 
used a factorial design that would enable us 
to obtain additional information about the 
effects of large versus small monetary rewards 
under the two different sponsorship conditions. 

The third variable introduced into the 
factorial design involved the degree to which 
subjects engaged in overt role playing by 
writing essays in which they verbalized argu- 
ments in support of the assigned position on 
the issue. According to a ,issonance-theory 
analysis by Brehm and Cohen (1962), the 
mere fact that a person commits himself to 
play any such role, before he executes it, 
“should be sufficient to produce the attitude- 
changing dissonance [pp. 254-255].” They 
. cite evidence from a study in which no 

difference was found between two groups, one 
- öf which wrote essays in which the dissonant 

position was overtly verbalized while the 
\other merely agreed to write the essay? 

(Rabbie, Brehm, & Cohen, 1959). 

An incentive-theory analysis would lead us 
to predict a marked difference in favor of 
overt role playing: Commitment to play the 

_ Tole could, of course, induce some degree of 
attitude change (particularly if the subjects 


engage in implicit role playing in anticipation 


of executing the task); but actually carrying 


+ out the task of verbalizing new arguments in 
. support of the objectionable position would 
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elicit more biased scanning and increase the 
salience of new incentives, thus increasing 
the likelihood of attitude change—provided 
that no strong interfering responses are 
evoked by unfavorable sponsorship or “by 
other unfavorable conditions. P 
dn order to investigate the effects of overt 
role playing, we compared two equivalent 
groups, both of which were given the same 
information and instructions: overt role 
players whose attitudes were assessed after 
carrying out the role-playing task, and con- 
trols whose attitudes were assessed at a time 
when they had already agreed to perform 
the role-playing task but before they were 
given the opportunity to do so. i 
Thus, the present experiment includes three 
variables ina 2 X 2 X 2 factorial design: pub- 
lic-welfare sponsorship versus unfavorable 
(blatantly commercial) sponsorship; small 
reward ($1) versus very large reward ($20) 
for agreeing to carry out the role-playing 
task; overt versus nonovert role-playing con- 
ditions. One of the advantages of this ‘type 
of design, of course, is that it enables us to 
investigate interaction effects as well as main 
effects. ’ 


. 
METHOD 
Procedures 


In this experiment, 64 male sophomores and 
juniors at Yale University were seen in individual 
sessions at their undergraduzte college residences. 
The experimenter simply knocked at the door of 
each room and asked the prospective subject to. 
speak with him aloné for a short time. Only about 
1 out of every 10 prospective subjects refused to 
participate. All such refusals occurred before any 
information was given about sponsorship or about 
any of the other conditions and hence there was 
no differential loss of subjects among the various 
subgroups, On a stratified random basis, seven 
subjects were assigned to each of the four control 
groups and, in order to increase the reliability of 
the main effect measures, a somewhat larger num- 
ber (nine) received each of the four different ex- 
perimental treatments. The experiment was car- 
ried out in six steps: ® 


8 Additional information concerning the instruc- 
tions, procedures, and the questionnaire used to 
assess attitude change, together with incidental re- 
sults not included in the present report, are presented 
in a more extensive mimeographed document. Copies 
of this document are available at Yale University 
and can be obtained from the senior author upon 
request. 
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1. The experimenter began with an explanation 
of his alleged affiliation and purpose. For half 
of the subjects (decided in advance on a purely 
random basis) the experimenter conveyed a public- 
welfare sponsorship. Specifically, he explained 
that he was obtaining information on both sides 
of the issue, to be used in conducting a survey of 
cdllege students’ attitudes toward science apd 
mathematics courses for a national research organ- 
ization on behalf of a number of the leading uni- 
versities in the United States. To the other half 
of the subjects, the experimenter conveyed an ex- 
ploitative commercial sponsorship. They were told 
that he was working for a new publishing company 
that was trying to build up the market for ‘its 
science textbooks and would use the information 
obtained in the interview for preparing an advertis- 
ing campaign. 

2, After the sponsorship information was pre- 
sented, each subject was given a standard request to 
engage in a role-playing task. All groups were asked 
if they would be willing to write a short essay in 
favor of the proposition that a year of physics and 
a year of mathematics should be added as a re- 
quirement for all college students. Each subject was 
told that the task required thinking up good 
arguments that he personally would be inclined to 
accept. 

3. For half the subjects in each sponsorship group, 
the amount of monetary reward offered for carrying 
out the task was $20. For the other half, it was $1. 

4, The experimenter abstained from putting any 
special pressure on the subjects when he asked them 
to decide whether or not to take on the job assign- 
ment, even though a sizable number—especially in 
the unfavorable sponsorship group—showed some 
reluctance or hesitation. As soon as each subject 
announced his decision, he was paid in advance 
and signed a receipt for the payment. Thus, an 
unambiguous commitment to perform the task was 
‘elicited from every subject, whether in a control 
or experimental group, Those in the latter (overt 
role playing) groups were given 10 minutes to 
write their essays. In order to stimulate their think- 
ing, these subjects were given a few general ques- 
tions (for example, “Considering the type of career 
you are likely to be in, how might a background in 
physics and math enable you to function more 
adequately?”). These questions contained no new 
information but were found, in a prior pilot study, 
to serve a “pump-priming” function, without 
having any direct effects on attitudes. Control 
subjects were told that they would be asked to 
write their essays a little later, but that first there 
was a questionnaire to be filled out. Thus, the 
controls were given the final questionnaire im- 
mediately, without being asked any pump-priming 
questions and without engaging in any overt role 
playing. 

5. The same questionnaire was given to the role 
players and to the controls. It contained a series of 
objective questions and a final open-end question, 
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6. After completing the questionnaire, each subi 
who had engaged in role playing was asked to ñ 
each of the arguments he had given as “Very gogi 
“Fairly good,” or “Inadequate,” to indicate I 
own personal evaluation of each one. 


as well as each of the role players, was giver 
standard interview to find out how he had percei 
the sponsorship and reward conditions, as a cha 
on the adequacy of the experimental manipulatio 
The subjects were then told the truth concerning Ú 
purpose of the study and the experimenter’s achi 
affiliations. At this time the subjects were paid 
standard fee of $2 for participating in the expe 
ment, (With only one exception, all subjects g 
the $20 payment in advance readily returned $ 


RESULTS 


test, we were able to assess the adequacy i 
the manipulations of each of the three m 
variables. 

Sponsorship. The final interview included 
series of questions designed to determi 
whether the information given to the subje 
succeeded in conveying the intended percé 
tions of the sponsor and the purpose of t 
task. Of the 18 experimental subjects givél 


authenticity of the benign sponsorship at t 
end of the session and only 2 subjects sa 
that they thought there might be more 
than was indicated by the experimenter. 


major universities were conducting the surve 
because a large industrial organization ® 
foundation was putting up the funds for 
All 18 subjects given the commercial-spon y 
ship instructions indicated that they accept? 
the explanation and felt that the prima 


order for the publishing company to- increa 
its profits. None of the subjects guessed th 
the study was intended to investigate ™ 
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effects of writing the essay on their personal 
attitudes. 

Small versus large monetary incentive. For 
those paid $1, most subjects asserted in the 
final interview that the pay was felt to be 
satisfactory or ample. The comments of those 

. paid $20, however, indicated that this pay- 
ment was perceived as being so extremely 
large that it was a highly ambiguous stimulus. 
Most subjects who received $20 reported feel- 
ing puzzled because they could not under- 
stand why such a huge overpayment was 
being made. Every one of these subjects as- 

+ serted that it was in excess of what he re- 

garded as appropriate for the work involved. 

Seventeen of the 18 expressed clear-cut sur- 

prise (for example, “This I cannot compre- 
hend . . . it was much teo much; $5 would be 
quite sufficient . . . I was rather shocked.”) 

A few subjects reported that despite their 

puzzlement they responded to the large re- 

ward as a strong positive incentive, but a 

larger number reported having reacted in the 
opposite way, regarding the unbelievable 
large payment as a source of vague suspicions, 

guilt, or conflict (for example, “I feel like a 

bastard for going against my cause; but for 
$20, what the hell.”) 

Thus, it turns out that in repeating the two 
reward conditions used by Festinger and Carl- 
smith (1959), we are comparing a small re- 
ward that is perceived by gur subjects as ap- 
propriate for the task at hand with an am- 
biguous large reward that is perceived by the 
students as a surprising, inappropriate over- 
payment. 

From the standpoint of incentive theory, no 
clear-cut prediction is possible concerning the 
immediate effect of the large conflictful re- 

„ward versus the small appropriate reward, 
since there is no way of knowing whether the 

\\large monetary payment is a stronger of 
weaker incentive than the low payment. This 
ambiguity, however, does not affect the op- 
portunity to obtain clear-cut results on the 
effectiveness of the overt role-playing versus 
nonovert role-playing condition nor of the 
public-welfare versus commercial-sponsorship 
conditions, for which there is evidence that 
the manipulations were successful. 

Overt versus nonovert role-playing condi- 
tions. We examined the essays to see if the 


gvert role-playing instructions were carried 
out and, as expected, the essays were found 
to be devoted almost exclusively to present- 
ing cogent arguments in support of theas- 
signed position. In order to determine whether 
the control subjects engaged in implicit rele 
playing, they were asked, just before being 
dehoaxed, whether they had thought about 
any arguments in anticipation of writing the 
essay. Seventeen of the 28 controls (61%) 
reported having started to plan their essay in 
advance and 13 controls (46%) were able to 
mention one or more specific pro arguments 
that they had thought up. Nevertheless, the 
amount of role playing in which they could 
engage was severely curtailed, as compared 
with the experimental group of overt role 
players, who were given considerably more 
time to think about the essay, to concen- 
trate on the task, and to write down the argu- 
ments. The control condition, therefore, can 
be regarded as representing a relatively low 
degree of role-playing activity, characterized 
by a brief opportunity for implicit role play- 
ing with no ovért performance at all, as com- 
pared with the relatively high degree of overt , 
role-playing activity in the experimental con- 
dition. i í 
Effects on Attitudes 


Row 1 in Table 1 shows the mean total 
scores obtained for each of the experimental 
and control subgroups on a series of five oh- 
jective items, which asked the subjects to: 
evaluate the proposed educational policy. 
Comparison of Columns 9 and 12 indicates 
that for the experimental groups, those who 
engaged in overt role playing under condi- 
tions of public-welfare sponsorship obtained 
higher scores than those who did so under 
conditions of commercial sponsorship. For the 
control group, this trend does not occur and 
hence it is a characteristic of overt role-play- 
ing effects. A series of ¢ tests, based on the 
error estimate from an overall analysis of 
variance (Table 2), was applied to the differ- 
ences in combined scores shown in Columns 
9 and 12. For the experimental (overt role 
playing) groups, the mean for public-welfare 
sponsorship (10.12) was found to be signifi- 
cantly larger than the mean for commercial 
sponsorship (6.06). This difference was sig- 


e 


nificant beyond the .01 level. The correspond- 
ing difference for the control group (6.94 
versus 9.08) was found to be nonsignificant 
(p> .25). Among the control groups, how- 
ever, there is an unusually low mean score in 
Column 4, the public-welfare sponsorship-$1 
. condition, which will be discussed later (see 

p. 24). 

Table 1 also shows the results of a sys- 
tematic content analysis of the arguments 
presented by the subjects in their written 
answers to the open-end question which asked 
the subjects to state the recommendation they 
+ would make to a university committee on the 
issue of whether their own college should 
adopt the proposed policy. The answers were 
rated independently by two judges, each of 
whom was given typescript copies of the an- 
swers without any indication of the group to 
which any subject belonged. These blind rat- 
ings cgnsisted of assessing each argument as 
favorable, unfavorable, or neutral toward the 
proposed policy. The mean attitude scores 
shown in the lower portion of Table 1 are re- 
stricted to explicit pro and anti arguments. 

The reliability of the blind ratings of the 
two judges proved to be very high when 
tested by comparing the frequencies of pro 
and anti arguments they reported for each 
subject. The reliability coefficient was above 
95 (N = 64). 

In Table 2, the results.of an analysis of 
variance (corrected for unequal Ws) are 
shown for each of the two attitude measures 
included in Table 1. The main finding is the 
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significant interaction of overt role playing 
with sponsorship (p < .03 on the first meas- 
ure and p< .01 on the second). On both 
measures, we find that among the subjects 
who were induced to engage in overt role 
playing, public-welfare sponsorship produced 
significantly more attitude change than com- 
mercial sponsorship; whereas, among those in 
the control condition (nonovert role playing), 
the difference is slight and in the opposite 
direction. 

The analysis of variance of attitude scores 
based on explicit pro and anti arguments 
shows a significant main effect for overt role 
playing, but this is attributable entirely to the 
ldrge positive scores in the public-welfare- 
sponsorship condition and hence is merely 
another reflection of the interaction of overt 
role playing with sponsorship. For this atti- 
tude measure, just as for the first one, the 
analysis of variance shows that the monetary 
reward factor has no significant main effect 
and does not enter into any significant inter- 
action with any other variables. t 

The pro arguments presented by the four 
overt role-playing groups in response to the 
open-end question were found to be very simi- ` 
lar to the ones which thé subjects had pre- 
sented earlier in their role-playing essays. On 
the average, about two-thirds of the pro argu- 
ments given by each group were the same as 
the ones given in their essays, while one-third 
were new ones. This observation carries an 
important implication with respect to the dif- 
ferences in the relative frequency of pro argu- 


TABLE 2 
ANALYSIS OF VARIANCE OF ATTITUDE SCORES 
B Combined score for Response to open-end question: 
five objective items Net attitude score 
Source af 
e 
MS F MS F 
Role (experimental condition) (A) 1 0.0 < 1.0 18.1 4.1* 
Sponsor (B) 1 29.0 ity 1.6 <10 
Reward (C) 1 29.0 11 1.0 <1.0 
AXB 1 151.0 Dues 34.7 ii 
AXC 1 14.0 < 1.0 0.1 < 1.0 
BXC 1 6.0 < 1.0 9.0 < 2.0 
AXBXC í 25.0 < 1.0 0.4 LeO 
Within cells 56 26.7 44 
Total 63 
*p = 05. 
Dy <.03. 
vap <.01. 
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ments shown in Table 1; namely, that favor- 
able sponsorship makes for a greater tend- 
ency for the role player to accept the argu- 
ments that he previously had presented when 
he wrote the essay to fulfill the demands of 
the overt role-playing task. 

In order to compare the quality of the es- 
says written by the four experimental sub- 
groups, the judge’s blind ratings of the essay 
arguments were analyzed in the same way as 
the responses to the final open-end question. 
As expected, the largest number of explicit 
pro arguments was given by the subgroup 
that received the least dissonant treatment 
($20 payment under public-welfare-sponsor- 
ship conditions) and the smallest number was 
given by the subgroup that received the most 
dissonant treatment ($1 payment under com- 
mercial-sponsorship conditions). The means 
are shown in Row 3 of Table 3; the ¢ test re- 
sults (based on the error estimate from an 
overall analysis of variance) show a highly 
significant difference between the subgroups 
given’ the most dissonant and the least dis- 
sonant treatments. The analysis of variance 
showed that this measure of the quality of 
the essays was significantly affected in the 
positive direction by the favorable versus un- 
favorable sponsorship variable (p < .05) and 
by the high versus low monetary reward ( $ 
< .05). Thus, there is some evidence that the 
positive incentive conditions had a significant 
effect with respect to inducing a better qual- 
ity of performance of the role-playing task. 

Additional but inconclusive evidence was 
also obtained from the subjects’ own self- 
ratings of the arguments in their essays. The 
highest frequency of good quality arguments 
(those rated by the subjects as “very good”) 
was again found in the subgroup paid $20 
under favorable sponsorship conditions and 
the lowest frequency in the subgroup paid $1 
under unfavorable sponsorship conditions; 
but an analysis of variance showed that these 
subgroup differences were not large enough to 
be statistically significant. 


Discussion 


The finding of a significant interaction 
between sponsorship and role-playing condi- 
tions suggests that the psychological processes 
operating when a person engages in overt 
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role playing may be somewhat different fro 
those operating at the stage when the persi 
has decided to play the role but has not y 
executed it. The findings for the nonoyel 
role-playing condition seem to point in 
direction of dissonance-theory prediction 
that favorable sponsorship gave rise to som 
what less acceptance of the new position tha 
unfavorable sponsorship, although the ji 
ferences are not large enough to be statist 
cally significant. But it is noteworthy th 
the relatively low degree of acceptance ol 
the new position among the nonovert rol 
players who were exposed to the unfavori 
sponsorship condition is entirely attribu 
to the subgroup that received the s 
monetary payment. According to the 
sonance-theory analysis, under favor 
sponsorship conditions (as well as under 
favorable sponsorship conditions) a latg 
reward should decrease dissonance and, 
fore, should evoke less attitude change 
a small reward. Our control group findings 
however, show the reverse outcome (s 
Columns 4 versus 5 in Table 1). j 
In any case, since the differences among 
the control groups are not statistically 
nificant, we would not expect to find thal 
replications of our study the group of non 
overt role players in the favorable sponsdt 
ship condition will consistently show Jes 
attitude change than those in the unfavorablt 
sponsorship condition. This expectation i 
borne out in a study by Elms and Janis (i 
press), which replicated and extended 
present research. In the Elms and Jam 
experiment, the nonovert role-playing group 
again yielded no significant differences (bil 
this time those in the unfavorable sponsorship) 
condition showed somewhat less attitud 
change than those in the favorable spon 
ship condition, the lowest .scores occur 
in the control subgroup that was given 
payment and unfavorable sponsorship); @ 
data from the overt role-playing group’ 
however, again showed that a significanti 
greater amount of attitude change 
produced by favorable incentive condi 
The results in Tables 1 and 2 show 
overt role playing proved to be significant 
more effective than nonovert when spon 
ship was favorable, but not when sponso 
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ship was unfavorable. This interaction out- 
come supports one of the assumptions of 
an incentive-theory analysis: Overt verbaliza- 
tion of one’s own improvised arguments in 
favor of an initially unaccepted attitude po- 
sition will facilitate biased scanning of the 
positive incentives and therefore will lead 
to more attitude change—provided that there 
are no strong interfering responses engendered 
by feelings of suspicion, guilt, or other nega- 
tive reactions of the type elicited by unfavor- 
able situatiofal conditions. 

The results*in Tables 1 and 2, together 
with the subsidiary ¢ test data presented in 
the text, support the hypothesis that the 
effectiveness of overt role playing depends 
upon favorable sponsorship conditions. Un- 
favorable sponsorship ‚(as represented by a 
commercial company, which was hiring the 
subjects to help them prepare advertising 
copy) resulted in significantly less personal 
approWal of the induced position than conso- 
nant sponsorship (as represented by a public- 
welfare organization, which was hiring the 
subjects to help prepare for a nationwide 
opinion survey). Thus, the overt role players 
were more influenced when the sponsor’s af- 
filiations and goals were presented to them 
as being consonant with their own values 
then when they were presented as being more 
dissonant. 

According to dissonance theory, if a person 
complies with a role-playing demand, the less 
the reward or justification he has been of- 
fered for arguing in favor of a position that 
is contrary to his own personal opinion, the 
greater the dissonance and the stronger the 
motivation to reduce the dissonance by 
changing his personal opinion (Festinger, 


- +1959, p. 95). Dissonance theory asserts that 


“the effectiveness of role playing will be de- 
creased by any factor that makes the role- 
playing task more rewarding, more justifiable, 
or more compatible with the subject’s pre- 
existing values and attitudes. If this theory 
were correct, we should have found that 
among the overt role players the commercial 
sponsor was more effective than the public- 
welfare sponsor; but the data show the op- 

„posite outcome. We should also have found 
that the low-reward condition ($1) was 
more effective than the large-reward condition 
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($20); whereas the data show no significant 
difference. Our results on the effects of $20 
versus $1 payments fail to confirm the find- 
ings of the Festinger and Carlsmith (1959) 
experiment, in which the same amounts were 
compared. There is no reason to expect that 
with larger samples the differences predicted 
by dissonance theory would have emerged 
because, with the present sample (NV = 18 in 
each reward condition), the trend is in the 
opposite direction from that predicted by 
dissonance theory. The nine overt role play- 
ers given the most dissonant conditions ($1 
reward with commercial sponsorship) ob- 
tained lower approval scores on the attitude 
measures than the nine given the least dis- 
sonant conditions ($20 reward with public- 
welfare sponsorship). The pertinent results, 
extracted from Tables 1 and 2, together 
with ¢ test data based on the error estimates 
obtained from the corresponding analyses of 
variance, are summarized in Table 3. It will 
be noted that on both attitude indicators, the 
differences are large enough to approach sta- 
tistical significance (p < .10, two-tailed test). 
These findings parallel those obtained from 


the blind ratings of the quality of the essays, ` 


shown in Row 3 of Table 3. Thus, , our 
results fail to bear out the prediction from 
dissonance theory that the most dissonant 
condition of role playing will produce the 
most attitude change; in fact, the outcome 
tends to be the opposite. 

Our findings call into question the general- 
ity of those repofted by the adherents of dis- 
sonance theory (Brehm & Cohen, 1962; 
Festinger & Carlsmith, 1959) with respect 
to the way in which small versus large 
amounts of money paid in advance will influ- 
ence the effectiveness of role playing. Their 
results are subject to a number of alternative 
interpretations. For example, in the Festinger 
and Carlsmith experiment, the students were 
informed by the experimenter that he regu- 
larly does not tell the truth to his subjects 
about the purpose of his study and the role- 
playing task consisted of helping to perpe- 
trate this type of deceit by lying to a fellow 
student who would be the next subject. 
Under these rather unfavorable conditions, 
the finding that a $20 reward resulted in 
less attitude change than a $1 reward might 
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TABLE 3 


SUBGROUP COMPARISONS SHOWING THE INFLUENCE OF EXTREME DISSONANT AND CONSONANT 
CONDITIONS ON THE EFFECTIVENESS OF OVERT ROLE PLAYING 


; yi Role players in 
Role playersin | the least disso- 
he mocni, t conditi Diff 
. Effect measure pant conn "($20 reward, iiniost-leaut 
fercial sponsor) Yebus dissonant) 
(N =9) (N =9) 
1. Total approval score on five objective question- 
naire items (From Table 1) 5.56 10.12 —4.56 
2. Responses to final open-end question (Net 
attitude scores based on explicit arguments; i 
from Table 1) j ə —0.33 +1.45 —1,78 
3. Quality of role-playing essays: ih 
Explicit pro — explicit anti arguments* —0.22 +1.45 —1.67 i 


Note,—Higher scores on the two attitude measures indicate more attitude change following role playing. 
* For the other two subgroups, the corresponding mean was 1.0 in both instances. 


. *p = 08. 
** p = 07. F 
wrk <01. 


well have been the consequence of a predomi- 
nance of negative incentive effects over what- 
ever positive effects accrued to the higher 
monetary value. Blatant lying to a fellow 
, studerit may be felt by college students to 
be even more objectionable. than writing 
down some arguments to be used in the future 
as advertising copy in a commercial sales 
campaign that will try to promote an educa- 
tional policy with which one disagrees, as 
in our present experiment. If we are correct 
in surmising that the context for the role- 
playing activity in the Festinger and Carl- 
smith situation was likely to evoke suspicion 
gr guilt, we would expect an inordinately 
large reward to stimulate æ considerable in- 
crease in suspiciousness, guilt, or other nega- 
tive feelings to such an extent that the posi- 
tive incentive value accruing to the increased 
financial reward could be outweighed and 
thus produce less attitude change than a 
lower amount of reward. 

In order to assess the effects of monetary 
rewards in relation to incentive theory, it is 
necessary to take account of the way in which 
the different amounts of reward were per- 
ceived by the subjects. No information about 
this is provided either by Festinger and 
Carlsmith or by Cohen for the role-playing 
situations they investigated. In the present 
experiment, the final interviews indicated that 
the large monetary reward was an ambiguous 
incentive. While the $1 reward was regarded 


by the subjects as a satisfactory payment) 
the $20 reward evoked diverse and often highly 
conflictual anticipations, apparently function 
ing as a predominantly positive incentive fo 
some subjects, as a negative one for others 
and as a source of puzzlement for everyomt 
who received it. Hence no specific prediction 
about the monetary reward effects can b 
made from incentive theory, since clear-ctl 
predictions are possible only when the larga | 
incentive can be considered as a predomi 
nantly positive one that evokes relativel 
little guilt, suspicion, or other negative affects 

According to incentive theory, if a vetj 
large reward generates negative affect, it wi 
tend to interfere with acceptance of the com 
clusions advocated in the role-playing per) 
formance; but if the monetary reward elicits 
positive feelings of gratitude and satisfaction 
we would expect it to facilitate acceptance 
Thus, the possibility arises of an interaction 
between sponsorship and reward: With post 
tive sponsorship, the very large monetaly 
reward would be expected to have a predomi 
nantly positive incentive effect and thus 
make for an increase in the amount of atti 
tude change; whereas, with negative sponso! 
ship a very large reward would be more likely 
to induce negative affects that would tend 
to make for less attitude change. No suc 
interaction trend emerged in the present 
study, perhaps because it was obscured bY 


ment for the high-reward condition, or be- 
cause the less favorable sponsor was still 
regarded as a socially acceptable business 
enterprise thaf was preparing for a legitimate 
advertising campaign. In order to test the 
interaction hypothesis, it is essential to carry 
out another experiment that replicates the 
design of the present one except that: the 
effects of a plausible large reward should be 
compared with the effects of a small reward; 
and the role-playing task, when carried out 
under unfavorable sponsorship conditions, 
should be perceived as a counternorm action 
(as might have been the case in the Festinger 
and Carlsmith, 1959, experiment).* The evi- 
dence from the present experiment, however, 
suggests that when the overt role-playing 
activity does not violatg any dominant norms 
of our society: (a) the payment of a large 
versus small monetary reward has no effect on 
the amount of attitude change; and (b) favor- 
able sponsorship is markedly more effective 
than unfavorable sponsorship, irrespective of 
whether the accompanying monetary reward 
is large or small. 


4A replicating experiment of the type described 
here, using entirely different types of sponsors, role- 
playing instructions, and counternorm attitude con- 
tents, has been carried out (Elms & Janis, in press). 
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WHEN DISSONANCE FAILS: 


ON ELIMINATING EVALUATION APPREHENSION FROM 
ATTITUDE MEASUREMENT * 


Some propositions concerning evaluation apprehension and affect arousal as 
systematic, data-biasing contaminants are presented. The earlier finding, in 
support of dissonance theory, that the degree of reward for counterattitudinal 
advocacy has an inverse effect upon consequent attitude change is attributed 
to the operation of these contaminants, An altered replication of 1 major 
study on counterattitudinal advocacy is reported. By reducing the likelihood 
that the posited contaminants will be activated it is found that when S writes 
an essay opposite to his own attitude this changes his attitude in direct 
Proportion to the amount of payment received. Additional data show a 
generalization of this effect to a related attitude. These data are interpreted 
as supporting a consistency theory analysis of this type of attitude change and 
as calling the dissonance-theory interpretation into question, 


Certain studies that have been important 
in advancing the dissonance-theory explana- 
tion of attitude-change phenomena seem to 
be open to a particular kind of reinterpreta- 
tion. After an explanatory discussion of some 
general considerations from which this re- 
interpretation derives we shall turn to an 
experiment that attempts to put it to a 
critical test. 

Theorists from Thomas and Znaniecki 
(1918) to Lewin (1935) have contended that 
the person’s “definition of the situation” is 
the ground from which behavior emerges and 
takes its direction. But Psychologists have 
tended to miss the relevance of this nearly 
banal proposition as it might apply to the 
understanding of psychological research it- 
self. For most human subjects psychological 
experiments are ambiguous situations, some- 
times exhilarating, sometimes provocative of 
curiosity or anxiety; and all these forms of 
arousal are likely to set them searching for 
the possibly veiled meanings of the experi- 
mental situation. Just how the subject does 


1 This study was carried out while the author was 
a member of the Psychology Department at Ohio 
State University. It was supported by Contract 495 
(24) with the Group Psychology Branch of the 
Office of Naval Research. The author is indebted to 
Frederick Weizmann for his assistance in executing 
the experiment and to David Glass and Irving Janis 
who raised a number of useful questions. 


28 


MILTON a ROSENBERG 
Dartmouth College 


define the situation, and thus how he is lik 
to behave in it, may often be affected b 
those differences in treatment manipulation 
or in instructions that distinguish one e 
mental condition from another. When 


tion are not intended they may contaminal 
the design and lead to false confirmation, 
for that matter to false disconfirmation i 
hypotheses. Two separate ways in which th 
may happen will be explicated here. 


Evaluation Apprehension as a Research Cor 
taminant 


It is proposed that the typical h mat 
subject approaches the typical psychologic 
experiment with a preliminary expectatid 
that the psychologist may undertake to evall 
ate his (the subject’s) emotional adequ 
his mental health or lack of it. Member 
the general public, including students - 
ifitroductory psychology courses, have ust 
ally learned (despite our occasional effort 
to persuade them otherwise) to attribut 
special abilities along these lines to tho 
whose work is perceived as involving psy 
logical interests and skills.2 Even when 

? As used in this paper the term “psychologist 
is merely a convenient categorical simplifica 
It denotes anyone who “runs” subjects through 
experimental or interview procedure and is 
ceived as being at least somewhat skilled at, 


WHEN Dissonance FATLS 


subject is convinced that his adjustment is 
not being directly studied he is likely to think 
that the experimenter is nevertheless bound 
to be sensitive to any behavior that bespeaks 
poor adjustment or immaturity. 

In experiments the subjects initial sus- 
' picion that he may be exposing himself to 
evaluation will usually be confirmed or dis- 
confirmed (as he perceives it) in the early 
stages of his encounter with the experimenter. 
Whenever it is confirmed, or to the extent 
that it is, the’ typical subject will be likely 
to experience edaluation apprehension; that 
.is, an active, anxiety-toned concern that he 
win a positive evaluation from the experi- 
menter, or at least that he provide no grounds 
for a negative one. Personality variables will 
have some bearing upon, the extent to which 
this pattern of apprehension develops. But 
equally important are various aspects of the 
experimental design such as the experiment- 
er’s explanatory “pitch,” the types of meas- 
ures used, and the experimental manipulations 
themselves. 

Such factors may operate with equal po- 
tency across all cells of an experiment; but 
we shall focus upon the more troublesome 
situation in which treatment differences be- 
tween experimental groups make for differen- 
tial arousal and confirmation of evaluation 
apprehension. The particular difficulty with 
this state of affairs is that subjects in groups 
experiencing comparatively high levels of 
evaluation apprehension will be more prone 
than subjects in other groups to interpret the 
experimenter’s instructions, explanations, and 
measures for what they may convey about 
the kinds of responses that will be considered 
healthy or unhealthy, mature or immature. 
«In other words, they will develop hypotheses 
about how to win positive evaluation or to 
„avoid negative evaluation. And usually the 
subjects in such an experimental group are 
enough alike in their perceptual reactions to 
the situation so that there will be considerable 
similarity in the hypotheses at which they 
separately arrive. This similarity may, in 


professionally interested in, figuring people out. For 
example, this is certainly the case when under- 

graduate subjects participate in a study conducted 
* by an advanced psychology major or graduate 
student. 
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tyrn, operate to systematically influence ex- 
perimental responding in ways that foster 
false confirmation of the experimenter’s 
predictions. 

Let us consider one example of a situation 
in which some well-known findings might be 
aceounted for in these terms. It seems quite 
conceivable that in certain dissonance experi- 
ments the use of surprisingly large monetary 
rewards for eliciting counterattitudinal argu- 
ments may seem quite strange to the subject, 
may suggest that he is being treated dis- 
ingenuously. This in turn is likely to confirm 
initial expectations that evaluation is some- 
how being undertaken. As a result the typical 
subject, once exposed to this manipulation, 
may be aroused to a comparatively high level 
of evaluation apprehension; and, guided by 
the figural fact that an excessive reward has 
been offered, he may be led to hypothesize 
that the experimental situation is one in 
which his autonomy, his honesty, his reso- 
luteness in resisting a special kind of bribe, 
are being tested. Thus, given the pattefning 
of their initial expectations and the routinized 
cultural meanings of some of the main fea- 
tures of the experimental situation, most low- 
dissonance subjects may* come to reason 
somewhat as follows: “they probably want 
to see whether getting paid so much will 
affect my own attitude, whether it will influ- 
ence me, whether I am the kind of person 
whose views can be changed by buying 
him off.” ig 

The subject who has formulated such a 
subjective hypothesis about the real purpose 
of the experimental situation will be prone 
to resist giving evidence of attitude change; 
for to do so would, as he perceives it, convey 
something unattractive about himself, would 
lead to his being negatively evaluated by 
the experimenter. On the other hand, a simi- 
lar hypothesis would be less likely to occur 
to the subject who is offered a smaller 
monetary reward and thus he would be less 
likely to resist giving evidence of attitude 
change. 


Affect toward the Experimenter as a Research 
Contaminant 


Yet another and even simpler type of pos- 
sible systematic bias should be noted. This 
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involves the unsuspected affective conse- 
quences of designs which call for the experi- 
menter to behave differently toward persons 
in different conditions of an experiment. 
Under certain circumstances such differences 
may generate further differences in how sub- 
jects feel toward the experimenter or toward 
his experiment; and these intercell affective 
differences too may have the final conse- 
quence of influencing experimental responses 
in ways which make for false confirmation of 
hypotheses. Thus, turning again to dissonance 
studies in which subjects are offered large 
rewards for the writing of counterattitudinal 
essays, this manipulation, instead of creating 
low dissonance, may establish comparatively 
high arousal of the suspicion that one is 
being deceived; and this in turn may generate 
anger. A possible consequence is that the 
low-dissonance subject, provoked to hostility 
by the suspected duplicity, may find emo- 
tional release in refusing to show the response 
(attitude change) that he perceives the 
experimenter to be after. 


Contaminant Control by Altered Replication 


One way of checking upon the presence of 
these types of contamination is to ask the 
subject how he interpreted the purpose and 
meaning of the experiment. This will often 
-bẹ possible but it may sometimes involve 
“one major hazard: such interviewing in itself 
cah’ be open to the very kinds of contami- 


, hation it seeks to disclose. Another approach 


is to conduct an altered replication of the 
original experiment, one in which we re- 
design those of its aspects that are presumed 
to have fostered the contaminating processes. 
Not only does such an approach enable ap- 
plication of the law of parsimony in inter- 
preting the relation of data to theoretical 


_ claims, it also facilitates further study of 


the social psychology of the psychological 
experiment.* In pursuit of these goals much 


3 For a programmatic statement that defines some 
outstanding prospects and problems in this area see 
the useful article by Riecken (1962). In his com- 
ments on the subject’s desire to “put his best foot 
forward” Riecken is speaking of something rather 
like the concept of “evaluation apprehension” that 
has been developed here and more briefly treated 
earlier (Rosenberg, 1961, 1963). However, for 
Riecken this is basically a source of “unintended 


> 


of this paper will report and discuss 
altered replication of one important sti 
that has been presented as confirming f 
prediction that counterattitudinal adyog 
will generate greater attitude change Wh 
undertaken for a small reward than wi 
undertaken for a large one. First it will 
necessary to consider the background, desi 
and results of the original study upon w 
the present altered replication is based. 7 


to undertake it, will lead to attitude cha 
in inverse proportion to the strength al 
number of cognitions that could be used 
justify such behavior (Brehm, 1960). Off 
many conceivable types of counterattitudi 
performance the one that has been most Í 
quently studied both by dissonance theori 
and others has been advocacy, in orali 
written form, of an attitude position opp 
site to the subject’s actual private attitl 
(Carlson, 1956; Culbertson, 1957; Janis 
King, 1954; Kelman, 1953; Scott, 1 
1959). 

In most of the relevant dissonance studi 
the justification variable has been operatii 
alized in one of three ways: subjects i 
given high or low choice in deciding wheth 
to undertake counterattitudinal advocd 
(for example, Cohen & Latané in Brehm? 
Cohen, 1962); they are told that their p 
formances will be of great or little value 
interested other parties (for example, Coli 
Brehm, & Fleming, 1958); or they are giv 
or promised large or small monetary, | 


él 


variance” in data and the possibility that it | 
exert systematic influence making for false 

tions of hypotheses is not directly examined. 0 
(1962) and Rosenthal (1963) have suggested ol! 
types of systematic bias. The former has argued # 
“subjects are often motivated to help the ex 
menter “prove his hypothesis” while the latter ® 
presented evidence that the experimenter’s hypothe 
or expectations are often indirectly communicat 
in ways that shape the subject’s experimental | 
sponding. These views are not in conflict with, M 
are they particularly close to, the interpretati 
offered here. More directly related are the stué 
on “social desirability” by Edwards (1957) ® 
Crowne and Marlowe (1960). However, these 
vestigators have been basically concerned W 
sources of invalidity in psychological testing 14% 
than with systematic bias in experiments. 


other, rewards (for example, Festinger & 
Carlsmith, 1959). 

The study upon which the present paper 
is focused was conducted by Cohen and is 
one of the many recently reported by Brehm 
and Cohen (1962). Its general design was 
similar to earlier dissonance studies except 
that it used four levels of monetary reward, 
instead of the usual two. The prediction was 
that with this graded range of monetary 
rewards the resulting attitude change would 
be monotonically and inversely related to 
the size of the-reward. 

The subjects were Yale undergraduates. 
The issue concerned the actions of the New 
Haven police in a campus riot that had 
occurred a few weeks earlier. The experi- 
menter, appearing at randomly chosen dormi- 
tory rooms, introduced ‘himself as a “member 
of an Institute of Human Relations research 
team,” ascertained by verbal inquiry that 
the subject disapproved of the actions of 
the police and asked him to write an essay 
in support of the actions of the police.* The 
request for the counterattitudinal essay was 
explained in this way: 


It has been shown that one of the best ways 
to get relevant arguments, on both sides of an 
issue, is to ask people to write essays favoring only 
one side. . . . What we really need now are some 
essays favoring the police side. I understand that 
you have very different views on the matter, but 
as you can see it’s very interesting to know what 
kinds of arguments people brihg up in their essays 
if they have different opinions about it. 


4It is not clear from the research report whether 
the experimenter actually referred to himself as a 
psychologist. But belonging to an “Institute of 
Human Relations research team” would have been 
sufficient to establish that he was some sort of 
psychologist or advanced psychological trainee who 
*would be reporting back to a more senior colleague. 
“This was because the “Institute of Human Relations” 
was the name of the building that housed the Yale 
. Psychology department; no other department that 
gave undergraduate courses was located there and 
the research organization for which it was named 
had long since ceased to exist. The experimenter 
also described himself as a “fellow student.” This 
may have worked to further heighten the arousal 
of evaluation apprehension, since the Yale under- 
graduate culture places great emphasis upon the 
competitive show of maturity, sophistication, and 
“all around balance.” It would probably be par- 
«ticularly important to the subject that the evalua- 
tion of him formed by a psychologically trained 
“fellow student” be a positive one. 
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The reward manipulation was then intro- 
luced by telling the subject that he would 
receive a particular sum for “writing the 
essay against your own position.” Eight sub- 
jects were offered~$.50, 6 were offered: $1, 
10 others were offered $5, 6 others were 
offered $10. The subject wrote his esSay 
on a blank sheet headed “Why the New 
Haven Police Actions Were Justified.” He 
was then told: 


Now that you have looked at some of the reasons 
for the actions of the New Haven police, we would 
like to get some of your reactions to the issue: 
you may possibly want to look at the situation in 
the light of this. So, would you please fill out this 
questionnaire. 


The questionnaire on which the subject was 
invited to indicate approval of the New 
Haven police, if so inclined, began with this 
query: “Considering the circumstances, how 
justified do you think the New Haven police 
actions were in the recent riot?” An a priori 
31-point scale was used with labels at every 
fifth point and ranging from “completely _ 
justified” to snot at all justified.” Addi- 
tional questionnaire items were used to 
check that the subject correctly perceived 
the amount of payment.that he had been 
promised and that he had understood that 
he was to write a strong essay opposite to 
his own attitude. A control group was given 
the attitude questionnaire but received 
neither the manipulation nor the other 
measures. fay 

It was found that the $5 and $10 groups 
did not differ significantly from the control 
group in expressed attitude toward the New 
Haven police. However, the subjects in the 
$.50 group were less negative toward the 
New Haven police than the $1 subjects 
(p< .05) who in turn were less negative 
than the $10 subjects (p < .05); and both 
the $.50 and $1 groups differed significantly 
from the control group. 

Thus in the main the data appeared to 
confirm the original prediction. However, the 
point of view outlined above would suggest 
that in this study, as in others of similar 
design, the low-dissonance (high-reward) sub- 
jects would be more likely to suspect that 
the experimenter had some unrevealed pur- 
pose. The gross discrepancy between spend- 
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ing a few minutes writing an essay and the 
large sum offered, the fact that this large sum’ 
had not yet been delivered by the time the 
subject was handed the attitude question- 
naire, the fact that he was virtually invited 
to show that he had become more positive 
toWard the New Haven police: all these 
could have served to engender suspicion and 
thus to arouse evaluation apprehension and 
negative affect toward the experimenter. 
Either or both of these motivating states 
could probably be most efficiently reduced 
by the subject refusing to show anything 
but fairly strong disapproval of the New 
Haven police; for the subject who had come 
to believe that his autonomy in the face of a 
monetary lure was being assessed, remaining 
“antipolice” would demonstrate that he had 
autonomy; for the subject who perceived an 
indirect and disingenuous attempt to change 
his attitude and felt some reactive anger, 
holding fast to his original attitude could 
appear to be a relevant way of frustrating 


_ the experimenter.’ Furthermore, with each 


step of increase in reward we could expect 
an increase in the proportion of subjects who 


' had been brought to a motivating level of 


evaluation apprehension or affect arousal. 


5 Some other reasons (see Footnote 4 for the first) 
why the experimenter calling himself a “fellow 
student” might have increased the potency of these 
biasing processes are worth noting here. Given the 
fact that the antipolice attitude was highly norma- 
tiye among Vale undergraduates at this time many 
subjects would have been likely to assume that the 
experimenter was also antipoliœ. Thus among the 
high-reward subjects who made this attribution to 
the experimenter any tendency to inhibit showing 
themselves capable of being “bought off” would be 
further strengthened by the expectation that the 
experimenter would personally disapprove of the 
subject's new attitude. On the other hand if the 
experimenter was perceived, as he might have been 
by some subjects, to actually be propolice, the fact 
that he was a “fellow student” would lead to his 
being seen as violating an important group standard; 
and this, particularly for high-reward subjects who 
might interpret the experimenter as trying to “buy 
them off” for the propolice side, would have 
engendered even more anger toward the experimenter 
than would otherwise be the case, In turn this 
would have increased the likelihood that the high- 
reward subject would resist showing any change 
in the propolice direction. Thus both the evalua- 
tion apprehension and affect arousal patterns of 
contamination might well have been intensified by 
the experimenter being perceived as a fellow student. 
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How can such an interpretation be tested) 
If it is correct it points to the importan 


both the dissonance arousal and the attitud 
measurement. Evaluation apprehension a 
negative affect, if they exist, have 


experimenter and possibly even from h 
self) any evidence of having been influence 
by the essay he has just completed. 


be to separate the dissonance arousal phas 
of the experiment from the attitude measutt 
ment phase. The experiment should be of 
ganized so that it appears to the subjetl 


a situation the evaluation apprehension aml 
negative affect that are focused upon ; 
dissonance-arousing experimenter would prol 
ably be lessened and, more important, the 
would not govern the subject’s responsé 
to the attitude-measuring experimenter alll 
to the information that he seeks from 
subject. 


a certain amount, but also the actual deliver 


of that amount to him immediately after M 
completed the essay. It was assumed ti 
this change too would work to reduce: ne 
likelihood that the high-reward subje 
would develop suspicions concerning ™ 


in which some attempt was made to separate colli 
terattitudinal advocacy ' from subsequent _attitil 
measurement; this is the experiment by Festing 
and Carlsmith (1959). However, the degree of se 
ration may well have been insufficient. That e 
ment did not involve, as did the present one, 
guising the two phases as two different Be 
conducted in two different departments, Furtht 
more the dependent variable was not change =i 
previously stable social attitude but rather 2 mh 
mentary rating of how much the subject liked 0 
disliked an experiment just completed. 


experimenter’s possible duplicity and unre- 
vealed purposes. 

Adherence to the dissonance view would 
suggest that under these altered conditions 
the results would still show an inverse rela- 
tionship between magnitude of reward and 
extent of attitude change. Indeed the sig- 
nificance of the dissonance-confirming rela- 
tionship might be expected to increase; for 
now with each subject having actually re- 
ceived a monetary reward the cognitions con- 
cerning reasons for undertaking the counter- 
attitudinal performance would be less variable 
within efperimental groups than could have 
been the case in the original experiment. 

However, the consistency theory developed 
by‘ the present author (Rosenberg, 1956, 
1960) suggests the opposite prediction. It 
holds that the most usual basis for attitude 
change is the establishment of new beliefs 
about ,the attitude object, beliefs that are 
inconsistent with the original affective orien- 
tation toward that object. In this view the 
significance of a reward received for writing 
a counterattitudinal essay (that is, for im- 
provising or rehearsing inconsistency-gener- 
ating cognitions) would be different from 
that claimed in dissonance theory: such a 
reward would, in proportion to its magnitude, 
` be likely to have a positive effect both upon 
the development and the stabilization of the 
new cognitions. From thiseit would be pre- 
dicted that with the removal of the biasing 
factors the degree of attitude change obtained 
after the subjects have written counter- 
attitudinal essays will vary directly, rather 
than inversely, with the amount of reward. 
Thus the altered design outlined here may 
afford something approximating a critical test 
between this approach and the dissonance 
approach as regards their applicability tq 
«predicting the attitude-change effects of 
counterattitudinal advocacy. 


METHOD 


Attitude Issue and Subjects 


To replicate as closely as possible, except for the 
major changes that distinguish the present study 
from its model, the author sought an issue com- 
parable to “the actions of the New Haven police.” 
Late in 1961 the Ohio State University football 
team, having won the Big Ten championship, 


Wuen Dissonance FAILS 


33 


teceived an invitation to the Rose Bowl. Concerned 
with the extent to which its reputation as the “foot- 
ball capital of the world” weakened OSU’s academic 
reputation and performance, the faculty council of 
the University voted fo reject the invitation ‚and 
thereby engendered, both in the student body and 
the surrounding community, a sense of incredulous 
oytrage. This, through the promptings of local news 
media, was rapidly turned toward active protest. 
The immediate result was a riot in which a large 
crowd of undergraduates (estimates varied between 
one to three thousand) stormed through University 
buildings shouting pro-Rose-Bowl and antifaculty 
slogans. The more longlasting result was the stabiliza- 
tién among the undergraduates of an attitude of dis- 
approval toward any limitation upon Rose Bowl 
participation, This attitude remained salient during 
the following year and even in the face of the 
fact that during that year the faculty council, by 
a close vote, reversed its original decision. In general 
interested students felt that future faculty inter- 
ference with participation in bowl games continued 
to be a real possibility. 

With a pilot questionnaire administered carly in 
1963 it was confirmed that opposition to a Rose 
Bowl ban remained a consensual position among 
the undergraduate body; more than 94% of the 
sample indicated strong disapproval toward any 
restoration of the ban in the future. Upon® com- 
pletion of this pilot study a new group of male 
subjects was recruited from sections of introductory 
psychology for participation in the present study, 
In all 51 subjects were finally used. Ten were 
randomly assigned to each 6f three experimental 
conditions and 21 to a control condition. 


Dissonance Arousal 


As each experimental subject arrived at the 
author’s office he found him busily engaged either 
in writing or in a conversation with another 
“student.” The expegimenter then told the subject: 


Im sorry but Im running late on my schedule 
today and TIl have to keep you waiting for about 
15 or 20 minutes. Is that all right? 


Most subjects simply said it was though a few 
expressed concern about getting to their next class 
on time. All of the latter, when assured that. the 
work the experimenter wanted them to do would 
take no more than 20 minutes, accepted the situa- 
tion with equanimity. The experimenter then said: 


Oh, I’ve just thought of something; while you 
are waiting you could participate in another little 
experiment that some graduate student in educa- 
tion is doing. 


The experimenter explained that he had had a call 
the previous day from the “graduate student ” who 
needed volunteers in a hurry for 


some sort of study he’s doing—I don’t know what 
it’s about exactly except that it has to do with 
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attitudes and that’s why he called me, because 
my research is in a similar area as you'll see 
later, [The experimenter went on to say] Of 
course he can’t give you any credit [the usual 
research credit point used tp keep up experimental 
participation rates in introductory psychology 
courses] but I gather they have some research 
funds and that they are paying people insteagl. 
So if you care to go down there you can. 


All but three subjects indicated that they did 
want to participate in the other study. (The 
three who did not were eliminated from the experi- 
ment.) With some show of effort and uncertainty 
the experimenter then recalled the name of the 
education graduate student and the room, actually 
located in the education department, where he 
could be found. 

Upon reporting to the “education graduate 
student” the subject received an explanation modeled 
word-for-word upon that used in the earlier experi- 
ment reported by Brehm and Cohen. Also, as in 
that experiment, it was determined by verbal inquiry 
that the subject held an attitude position opposite 
to the one he was to argue for in the essay. 
Subjects were randomly assigned to one of three 
reward conditions ($.50, $1, $5), and the amount 
that each subject was to receive was made clear 
to him before he undertook to write an essay on 
why the OSU football team should not be allowed 
to participate in the Rose Bowl. After the subject 
had completed the essay he was paid the amount 
that he had been promised, then thanked for his 
participation and dishissed. He then returned to 
the experimenter’s office and, under the guise of 
participating in another study, his attitudes toward 
the Rose Bowl ban and toward various other issues 
were ascertained. 


Attitude Measurement 


` This phase of the study began by the experimenter 
telling the subject that the study for which his 
participation had originally been solicited was a 
continuing survey on student attitudes “that I run 
every semester as a sort of Gallup poll to keep 
a check on opinion patterns on different University 
issues.” (The experimenter, of course, did not know 
at this point which of the three magnitudes of 
reward the subject had received for writing the 
essay.) The subject then filled out an attitude 
questionnaire dealing with eight different issues, 
One of these read, “How would you feel if it 
were decided that from now on the OSU football 
team would not be allowed to participate in the 
Rose Bowl?” Following the procedure in the earlier 
study the subject responded on a 31-point graphic 
scale, marked at every fifth point by these labels: 
I think this decision would be not justified at all; 
very little justified; little justified; slightly justified ; 
rather justified (instead of “quite justified” as in 
the earlier study); -very justified; completely 
justified. 


The same scale form was used with the of 
seven issues. One of these dealt with the areaq 
varsity athletics and read, “How would you feel 
it were decided that the University would no lon 
give any athletic scholarships?” This issue was | 
cluded to provide another and more indirect { 


anti-Rose-Bowl essay under varying conditions 
reward. The other six issues dealt with nonathld 
matters such as dormitory regulations, Universi 
admission policies, library rules, etc. 3 

When the subject had completed this questi 
naire he was asked what he thought the experiméil 
was really about. His responses during a periodi 
subsequent inquiry were transcribed and these we 
to be analyzed for the extent to which they re! 
any suspicion that the two experiments were actu 
related to one another. The subject then fill 
out a follow-up questionnaire, The first item asked 
“while you were filling out the opinion questio 
naire did it occur to you that there might be sot 
connection between this experiment and the W 
you worked on in the education department?” 


that in fact there had been “a connection’ betwe 
the two experiments” and that it would) 
explained after he completed the questionnaire, 


which asked how strong an essay he had agree 
to write, how strong an essay he did write, ho 
free he had felt in his decision to write the ess 
how getting paid for the essay had made him fg 
etc. Each of the questions was answered by choo 
one of a number of alternative positions. 
The experimenter then told the subject abil 
the nature (but not the purpose) of the deceplidt 
that had been used and proceeded to engage Mi 
in an interview designed to elicit further evident 
of any doubts or suspicions that the subject m 
have felt during the experiment. The experime 
then explained the actual purpose of the experimé 
commenting both upon its basic hypothesis and 
design, and then answered all of the subject’s qu 
tions. Before the subject was thanked and dismi 
he was urged not to speak of the experiment | 
any fellow students during the remainder of ! 
academic semester. All subjects promised to comp 
with this request. A 
In distinction to the experimental subjects €% 
of the control subjects, upon reporting for his 
pointment, was merely told that the experime 
was conducting “a sort of Gallup poll on Univ + 
issues” and then filled out the attitude questionnal® 


RESULTS 


In all 62 subjects were originally 
through the experiment. Eleven were @ 
carded from the final analysis because on 4 
basis or another they failed to meet nece 
conditions that had been specified in adv: 
Six subjects (two originally assigned t0 
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control condition and four to the experimental 
conditions) were rejected because post- 
“ experimental questioning revealed that they 
were member’ of varsity athletic teams. It 
had been decided that persons in this category 
would not be used since their pro-Rose-Bowl 


» attitudes could be assumed to be considerably 


stronger, more firmly anchored, than those of 
other students. Two other subjects, originally 
assigned to experimental groups, were dis- 
carded front the analysis because they evi- 
denced virtually complete and spontaneous 
insight into the deception that had been 

. employed. One other subject was discarded 
because he reported, on the postexperimental 
questionnaire, that he had been asked to 
write a “weak” rather than a “strong” essay. 

* Two additional experimental subjects were 
discarded because they impressed both the 
experimenter and his assistant as showing 
psychotic tendencies. However, when the 
analysts reported below is repeated with the 
last three rejected subjects included the 
findings are in no wise altered. 

Except for the manipulated independent 
variable other factors that might influence 
attitudinal response appear to have remained 
constant across experimental groups. Thus 
on the postexperimental questionnaire the 

- subjects in the three experimental groups do 
not differ in their perceptions as to how 
strong an essay they were,asked to write or 
actually did write; nor do they differ in their 
self-reports on how free they felt to refuse. 
From the postexperimental interview data it 
appears that though a few subjects were sur- 
prised to find the Rose Bowl situation fea- 
tured in the “two different experiments,” the 
groups were equally lacking in insight both 

* as regards the deception that was used and 


as regards the real purpose of the experiment.’ 


7It has been already suggested that in interviews, 
as in experiments, subjects’ responses may often be 
influenced by their private interpretations of the 
situation. Thus the postexperimental data collected 
in this study cannot necessarily be taken at simple 
face value. But there is at least one important 


i consideration (probably relevant whenever the credi- 


bility of an experimental deception is being assessed) 
that suggests that the subjects were not holding back 
evidence of having discerned the true design of the 
experiment or of having doubted the explanations 
that were given them. Experienced experimenters 
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, It will be remembered that the measurement 
phase of the present study consisted of a 
questionnaire concerned with eight different 
University issues. On the six issues concerning 
matters unrelated tò athletic policy, and thus 
not subjected to manipulation through the 
essay-writing procedure, statistical analysis 
reveals no overall differences and no dif- 
ferences between any specific groups taken 
two at a time. 

On the main matter of experimental inter- 
est, whether attitude change on the Rose 
Bowl and athletic-scholarship issues varies 
directly or inversely with the magnitude of 
monetary reward, the data reviewed below 
reveal that the former is the case; that is, 
the prediction drawn from a consistency- 
theory interpretation appears to be confirmed 
and the opposite prediction based upon dis- 
sonance theory appears thereby to be dis- 
confirmed. 

Scoring the 31-point attitude scale from 1.0 
(for the banning of Rose Bowl participation 
would be “not justified at all”) through 1.2, 
14... to 6.8, 7.0 (banning Rose Bowl par- 


will probably agree that college student subjects 
usually desire to represent themselves as sophisti- 
cated and as not easily misled. Thus when the post- 
experimental interview situation is a permissive one 
the subjects are likely to disclose, rather than with- 
hold, promptings toward insight. Yet none of the 
present subjects revealed any such insights when, 
after completion of the experiment, they were asked 
“what do you think the experiment was really, 
about?” Later on when told by the experimenter 
that the “two experiments” were really one or 
still later when the full explanation was given, only 
a few subjects (two or three per group) claimed 
to have had suspicions suggestive of what had 
now been revealed. However, in their attitudinal 
responses on the two athletic issues these subjects 
do not differ from others in their groups (that is, 
they are not clustered in the low, middle, or» high 
portions of the within-group attitude score rank- 
ings). Thus it seems likely that most of these par- 
ticular subjects were exaggerating, and some per- 
haps were even imagining, their earlier doubts and 
in so doing were seeking positive evaluation from 
the experimenter after they had been shown capable 
of being “taken in.” As intended, then, the pro- 
cedures of the present experiment seem to have 
achieved their basic purpose which was to ayoid, 
or at least to minimize, the kind of suspicion and 
disturbing confusion that tends to activate such 
biasing processes as affect arousal and evaluation 
apprehension. 
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TABLE 1 


Grover MEANS AND DIFFERENCES BETWEEN GROUPS 
on ATTITUDE TOWARD THE Rose Bowl BAN , 


. Group differences" 
Group M 
d $.50 $1 $.50 and $1 $5 
+ 
Control 1.45 s = 1,97, z = 1,80, s= 2,31, 3 = 3 
p< .03 p< 04 p< 01S p< 
$.50 2,24 s= il sel 
p<: 
1 2.32 z=1. 
$ . hiz 
$.50 and 2.28 z= 211 
1 2 p< 0% 
$5 3.24 


Note.—Overall difference between groups as assessed by Kruskal-Wallis test: H = 17.89, p <.001. 


^ Tested by Mann-Whitney z, one-tailed. 


ticipation would be “completely justified”) 
we find the following mean scores: 1.45 for 
the control group, 2.24 for the $.50 reward 
_ group, 2.32 for the $1 reward group, and 3.24 
for the $5 reward group. The attitude score 
ranges are 1-3 for the control group, 1—4 for 
-< the $.50 group, 1-5 for the $1 group, and 
2-6 for the $5 graup. 

The significance of the reward variable in 
its influence upon attitude toward a Rose 
Bowl ban was assessed by computing the 
Kruskal-Wallis one-way analysis of variance 
from the ranked scores of all groups. H, which 
is distributed as chi square, equals 17.89 and 
has a chance probability of tess than .001 (see 
Table 1). In addition to this overall confirma- 
tion of the original prediction it is desirable to 
test the significance of differences between the 
specific groups. 

Analysis by the Mann-Whitney rank sum 
test , (computing z; see Mosteller & Bush, 
1954) reveals that there is no significant dif- 
ference between the $.50 and $1 groups. Ac- 
cordingly in some of the additional analyses 
these two groups were combined. As Table 1 
indicates the combined $.50-$1 group is sig- 
nificantly more favorable toward banning 
Rose Bowl participation than is the control 
group (p < .015) and significantly less favor- 
able than the $5 group (p < .02). When the 
$.50 and $1 groups are analyzed separately 
each is found to be significantly different 


from both the control and $5 conditions (§ 
Table 1). As would be expected the differen 
between the control and $5 groups is of vel 
large significance (p < .0001). 

Thus the only deviation from the origi 
prediction in this set of findings is the absen 
of a significant difference between the $i 
and $1 groups. Since these groups do dill 
as predicted from both the control and Ñ 
groups, respectively, it might be conjectult 
that the $.50 difference between them dot 
not generate a large enough subjective W 
ference in the magnitude of payment. 
ever, the alternative possibility that even 
small magnitude of difference in reward 00 
have some subtler influence upon attitude) 
suggested by the additional data regardi 
the issue of abandoning the policy of givi 
athletic scholarships. 

This issue was used as a second test of # 
basic hypothesis. The expectation was 
attitude change on the Rose Bowl issue sho 
tend to generalize toward a similar issue; © 
that suggests another way of deemphasidl 
the role of varsity sports in university life? 
would of course be expected that the g1% 
differences would be of lesser magnitude ; 
this issue than on the Rose Bowl issue Si 
the latter served as the actual topic for ™ 
counterattitudinal essay. 

Analysis of the subjects’ responses 0 © 
athletic-scholarship issue reveals again 4 pa 
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tern of findings that supports the original 

hypothesis. Responding on a 31-point scale 
` from 1.0 to 7.0 (with higher scores indicating 
greater approval for “abandoning athletic 
scholarships”) the groups yield the following 
mean scores: 2.28 for the control group, 2.26 
for the $.50 group, 3.04 for the $1 group, 
and 3.88 for the $5 group. The score ranges 
are 1-7 for the control group, 1-4.8 for the 
$.50 group, 1-6 for the $1 group, and 1.2—-7 
for the $5 group. 

Application! of the Kruskal-Wallis test in- 
dicates a significant main effect (H = 14.50, 
p< .005); thus the extent to which the 
writing of the essay affects an attitude related 
to the topic of the essay is shown to be a 
positive monotonic function of the amount of 
reward. 4 

Analysis of the differences between the 
specific groups as reported in Table 2 clarifies 
certain interesting details. While the mean 
attitude scores of the control and $.50 groups 
are virtually identical there is a slight and 
insignificant trend (p < .20) toward a greater 
concentration of extreme negative scores in 
the control group. The difference between the 
control and $1 groups comes closer to an 
acceptable probability level (p < .10) reflect- 
ing the greater differences in means (control 


f- = 2.28, $1 = 3.04) reported above. 


As predicted, the control and $.50 groups 
do show significantly less approval of aban- 
doning athletic scholarships than does the $5 
group; $ < .01 in both cases. When the con- 
trol and $.50 groups are combined the dif- 
ference from the $5 group has a probability 
of less than .005 as compared to less than .08 
when the difference between the combined 
group and the $1 group is assessed. The $1 
group clearly stands in an intermediate posi- 
‘tion, While its mean attitude score reflects 
greater endorsement of the antiathleti¢ 
‘scholarship view than does the $.50 group 
and less endorsement than the $5 group, 
neither of these differences (p < .15 and p < 
.12, respectively) reaches significance. 

Thus in comparison to the $.50 group the 
$1 group is less clearly differentiated from the 
$5 group and more clearly differentiated from 
the control group. From this it is apparent 
that the difference in size of reward between 
the $.50 and $1 groups does have some in- 


y TABLE 2 


Group MEANS AND DIFFERENCES BETWEEN GROUPS 
ON ATTITUDE TOWARD ENDING ATHLETIC SCHOLARSHIPS 


{ Group differences* 
Group M 
$.50 $1 $5° 
a 
Control 2.28) s=.95,| 2=1.33,| s=2.45, 
p<.20| p< .10 | p< 01 
$.50 2.26 z=1.09,| 2=2.36, 
p< AS | p< 01 
e Control 2.27 z=1.44, | s=2.67, 
and $.50 p< 08 | p< 005 
$1 3.04 z=1.24, 
p< 12 
$5 3.88 


Note.—Overall difference between groups as 
Kruskal-Wallis test: H = 14.50, p < .005 
* Tested by Mann-Whitney s, one-tailed. 


assessed by 


fluence upon the extent to which the writing 
of the essay affected the subjects’ attitudes on 
a related issue; and that influence foo is 
consistent with the prediction that attitude 
change following counterattitudinal perform- 
ance will be a positive function of the degree 
of reward received for such performance. 

A question of considerable interest is why 
the difference between the $.50 and $1 
groups shows up more clearly on a related 
issue rather than on the issue with which 
the essay was directly concerned. One possible 
interpretation emerges when we recall that 
the $.50 group dées differ significantly from 
the control group on the Rose Bowl issue 
but does not show such a difference on the 
athletic-scholarship issue. With this small 
amount of reward there may be a minimal 
likelihood that the induced attitude change 
will generalize to a similar issue; with the 
somewhat larger reward of $1 a somewhat 
stronger tendency toward generalization may 
be operative. 

On the basis of the findings that have so 
far been presented, the following conclusion 
seems warranted: when the design of the 
original study reported by Brehm and Cohen 
is altered so as to eliminate aspects that were 
likely to have generated evaluation apprehen- 
sion and unsuspected affect arousal, the pre- 
diction that guided the present study is 
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confirmed and the original dissonance pre» 
diction is disconfirmed. 


Discussion 


This paper has combined two purposes: to 
present some propositions about how subjects’ 
perceptions of experimental situations mđy 
affect their experimental performances; and, 
on this basis, to report an experimental re- 
examination of the dissonance-theory inter- 
pretation of attitude change due to counter- 

attitudinal advocacy. . 

' As regards the first purpose, the confirma- 
tion of the predictions in the present study 
lends support to the original propositions 
about evaluation apprehension and affect 
arousal; for it was in part on the basis of 
those propositions that the experimental pre- 
dictions were formulated. However, more 
direct investigation of these contaminating 
processes is possible and desirable. For ex- 
ample, in two recent studies the author has, 
by intention rather than by inadvertence, 
supplied cues to the subjects about types of 
responding that might connoté maturity and 
immaturity. In one of these studies some sub- 
jects were led to believe that mature persons 
like strangers more than immature people 
do while other subjects were led to believe 
the opposite. In a second study some sub- 
jects were led to believe that mature people 
perform well on dull arithmetic tasks while 
others were led to believe that immature 
people do better at such tasks. The results 
of these studies, to be reported elsewhere, 
strongly demonstrate the power of evaluation 
apprehension in controlling experimental 
responding. 

However, it is necessary that we go beyond 
such demonstration studies if these contami- 
nating processes are to be better understood 
and thus more effectively controlled. A 
number of questions remain to be investi- 
gated. Do such personality variables as pas- 
sivity, low self-esteem, and the need for social 
approval predict to the likelihood that evalu- 
ation apprehension will be aroused in the 
experimental situation? Does evaluation 
apprehension, once aroused, interact with 
experimenter bias (see Rosenthal, 1963) in 
a way that guides the subject in his hypothe- 
sizing about the kinds of responding that will 


win approval? Will exposure to psychologi 
perspectives, as in the introductory cou 
from which so many subjects are drawn, 
to heighten the likelihood of experienci 


seek negative evaluation for masochistic pi 
poses or as a way of asking for help, am 
will this affect their experimental responding 
Can the presence of evaluation apprehensi 
be uncovered by postexperimental inquit 
Comparable questions about the arousal 
aggressive and other contaminating affecti 
states could just as readily be formulate 
Indeed it would seem desirable in h 
studies to attempt an operational separati 
of the two types of contamination that ha 
been stressed in this paper. 

In general the recently developed inter 
in investigating the experimenter-subject i 
teraction as a source of bias in psychologie 
research is a long needed innovation. 
work of Orne (1962), Riecken (196 
Rosenthal (1963) and others has provided 
most useful beginning. To the list of resea? 
contaminating processes that they have 
vestigated, might well be added those ti 
have been discussed here. 

As regards the second major focus of thi 
paper, do the present findings call the vali 
of dissonance theory into question? Recenti 
there have been reported many challengl 
studies testing that theory’s pertinence 
only to attitude change but also to perce 
and motivational processes and even to lea 
ing phenomena. Thus the present 5 
taken alone, cannot be interpreted as © 
lenging the general theory as such. Howe 
it does seem to indicate that, at least m 
account of the attitude-change conseque® 
df counterattitudinal advocacy, dissondl 
theory has been overextended. 


8 For example, and despite the fact that the ati 
has found it possible to reinterpret one of the esf 
ments reported by them, the work of Brehm 
Cohen (1962) does seem to establish the relevil 
of the dissonance approach to the study of Cel 
aspects of motivation and does so with consia®i 
inventiveness and concern for methodological F4 
Similarly the work of Lawrence and Festinger (0! 
has opened a very interesting new line of ind! 
on some problems in the psychology of le3 
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In the author’s view the kind of counter- 
attitudinal performance that best fits the dis- 
* sonance paradigm is a simple overt act that 
directly violates one’s private attitude (for 
example, eating or agreeing to eat a disliked 
food; expressing approval of a disliked pro- 
posal or candidate; merely committing one- 
self to develop counterattitudinal arguments; 
etc.). But when a person actually does elabo- 
rate a set of arguments opposite to his own 
attitude the dissonance he experiences is 
probably of ‘much wider scope than disso- 
nance analysis would have it; it encompasses 
considerably more than merely realizing that 
he has argued against his own position. The 
broader pattern of inconsistency that he en- 
counters is that between the content and 
“apparent plausibility, ọn the one hand, of 
the new arguments that he has developed 
and, on the other hand, his original affective 
judgment of the attitude object. 

Thus the subject who opposes the Rose 
Bowl ban and then argues in favor of it 
may come up with some good arguments (for 
example, “If we ban going to the Rose Bowl 
we will improve our reputation as a serious 
University . . . we will draw better stu- 
dents,” etc.). In so doing he may become 
convinced of the validity of those arguments. 

` This will produce intraattitudinal inconsist- 
ency; that is, the newly established beliefs 
relating the Rose Bowl ban to positive ends 
and values will -be inconsistent with the 
original negative affect toward the ban. 

As was suggested earlier, the author’s theo- 
retical model (Rosenberg, 1956, 1960) takes 
this sort of inconsistency to be a basic condi- 
tion for the occurrence of attitude change. 
‘It will be useful to show how this alternative 
‘model may be applied to interpreting the 
process of counterattitudinal advocacy. From 

*this standpoint attitudes normally are stable, 
affective-cognitive structures and feature con- 
siderable internal consistency. It is assumed 
that the production of inconsistency through 
change in either the affective or cognitive 
component (the latter being more usual and 
likely) will, if it transcends the individual’s 
tolerance limits, motivate further symbolic 
activity. This may lead either to the restora- 
tion of the original attitude or, if this line of 


defense is not available, to its reorganization 
in the opposite direction. 

For the sequence that begins with cognitive 
alterations what is required is that the new 
cognitions be sufficiéntly internalized and dif- 
ficult to reverse; then the most likely outcome 
will be for the affective disposition toward 
the attitude object to move in the direction 
consistent with the newly established cogni- 
tions. Thus attitude change in its conventional 
sense will have occurred. 

„In this context a basic question is: what 
will render the new, inconsistency-generating 
cognitions sufficiently internalized and dif- 
ficult to reverse? Many variables could have 
this influence; but in the present study the 
necessary suggestion would be that the most 
important is the amount of reward expected 
and received for developing such cognitions. 
Putting this another way it may be hypothe- 
sized that the demonstrated influence of the 
magnitude of payment upon ultimate attitude 
change is mediated through its effects upon 
the cognitive processes that are activated 
during the essay-writing task. 

Broadly speaking, two separate kinds of 
mediation are easily conceivable: the expecta- 
tion of payment for counterattitudinal advo- 
cacy may operate as an incentive and thus 
affect the quality of the arguments advanced 
in support of the new cogntions; the receipt 
of payment may operate as a reinforcement 
that further fosters the internalization of the 
counterattitudinal cognitions; and of course 
the scope of thebe two processes would be 
expected to vary as a function of the actual 
amount of payment. 

A subsidiary analysis of the essays them- 
selves tends to support and clarify this view. 
One unequivocal finding is that the $.50 and 
$1 groups differ in the actual number of 
words per essay, the latter group writing the 
longer ones (p < .05).° However, the $1 and 
$5 groups do not show any such difference. 
Considering that the $.50 and $1 groups do 
not differ on the Rose Bowl issue while the 


®All probability estimates reported in this dis- 
cussion are based on a one-tailed interpretation of 
the Mann-Whitney statistic; in each case it was pos- 
sible to make a unidirectional prediction about the 
attitude-change effects of the mediational variable 
under study. 


e 


40 MILTON J. ROSENBERG 


$1 and $5 groups do, sheer verbal productiv- 
ity does not seem to mediate the main effect. 
Furthermore, separate analyses within each 
of the three experimental groups reveal abso- 
lutely no relationship between essay length 
and the postessay attitude toward the Rose 
Bòwl ban. 

But do the essays vary in quality, in the 
actual persuasiveness with which they are 
written; and if so does this relate to the 
postessay attitude score? Two judges, work- 
ing without knowledge of the different reward 
conditions and using a 5-point scale, rated 
all the essays for their basic persuasiveness. 
As part of their instructions they were asked 
to ignore the length of essays “because a 
long one may often be less persuasive than 
a short one.” The interjudge reliability of 
these ratings proved quite adequate: for 80% 
of the essays the two ratings were either 
identical or within 1 point of each other. 

Six of the 20 subjects in the combined $.50 
and $1 group had persuasiveness scores that 
were dower (1 and 1.5, based on the pooled 
ratings of the two judges) than any that 
occurred in the $5 group. Four of these 6 
subjects also had extreme negative attitudes. 
A comparable finding is obtained when we 
split the $.50-$1 group into approximately 
equal low persuasiveness and high persuasive- 
ness halves. Those who wrote comparatively 
unpersuasive essays show significantly more 
attitudinal negativism toward the Rose Bowl 
ban than those who wrote comparatively per- 
suasive essays (p < .03), When the same sort 
of analysis is separately performed with the 
$.50 and $1 groups, respectively, similar find- 
ings are obtained with borderline significance 
( <.10 in both instances), On the other 
hand within the high-reward group a division 
of the subjects into those who got the five 
lowest (though not as low as the comparable 
subjects in the low-reward group) and the 
five highest persuasiveness ratings does not 
yield any corresponding difference in attitudes 
toward the Rose Bowl ban. 

An exactly similar finding is obtained when 
we use as the estimate of persuasiveness 
not the judges’ ratings but the subjects’ own 
postexperimental judgments of “how strong” 
their essays actually were. In the combined 
low-reward group those below the median in 


their self-ratings are less favorable to { 
Rose Bowl ban than those above the medi 
(p < .05). Again no such effect is discoyey 
in the high-reward group. 

From these findings it may be condud 
that one mediating source of the overall, 
ference between the low- and _high-rewap 
groups is that some of the subjects in 
former group were insufficiently motival 
by the small reward that had been promisi 
them: in consequence they wrote insufficienl 
developed essays, essays that were essential) 
unpersuasive to themselves. Thus it w 
seem appropriate to conclude that the ow 
all positive relationship between reward a 
attitude change reflects the operation of @ 
incentive or effort variable. iq 

However, our analysis need not stop at th 
point. While some low-reward subjects w 
essays that are rated as extremely low 
persuasiveness others did not. Thus it is p 
sible to match the low- and_high-tewal 
groups on this factor and by so doing 1 


1.5) on the 5-point persuasiveness E 
With persuasiveness thus equalized (actuallj 
the mean persuasiveness score is then sli 
higher for the remaining low-reward subje 
than for the high-reward subjects) 1 
high-reward group still shows significant 
greater acceptance of the Rose Bowl W 
(p< .05) and also of the proposal th 
athletic scholarships be abandoned (p < 05 
These last findings do thus seem to coniiff 
the expectation that, in addition to 4 
incentive effect of variations in promis 
neward, there is yet another factor that 0d 
tributes to the positive relationship betwe 
reward and attitude change. It would 
reasonable to interpret this other factor 4 
based not upon the promise of reward b 
rather upon its receipt; thus our origi 
conjecture that a reinforcement dynai 
may be operative seems, on these grount 
to be rendered more plausible. : 

The use here of the term reinforcem@™ 
should not, of course, be taken as referm 
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solely to the kinds of relationships empha- 
sized in conventional models of instrumental 

- learning. In the present study the $5 pay- 
ment, once received, could have increased 
the habit strength of the improvised counter- 
attitudinal cognitions by directly increasing 
their attractiveness and credibility. Similarly, 
working for an expected large reward could 
have made the essay-writing a more ego- 
involving task and thus could have sensitized 
the subject to pay closer attention to the 
persuasive werth of his own arguments or 
to find greater merit in them. Furthermore, 
the amount of payment may also have af- 

` fected the very clarity with which the new 
counterattitudinal arguments were remem- 
bered after the essay writing session. 

In this discussion we have attempted to 
state, and also to preŝent some additional 
data in support of, a consistency theory view 
of how counterattitudinal advocacy produces 
attitude change. That view can be sum- 
marized in the following set of propositions: 
the counterattitudinal improvisation estab- 
lishes new cognitions that are inconsistent 
with the original attitudinal affect; the extent 
to which the affective judgment of the object 
will move toward the content of these new 
cognitions will depend upon the degree of 

. affective-cognitive inconsistency they gener- 
ate; this in turn will depend upon the 
strength and stability of the new cognitions; 
the strength and stability bf the new cogni- 
nations are influenced, among other things, 
by the degree of reward received for their 
improvisation—and this is probably due 
both to the promised reward operating as 
an incentive and the received reward as a 
reinforcement; in consequence when counter- 
attitudinal advocacy is investigated in a way 
‘that circumvents certain biasing factors it 
will be found, as in the present study, thas 
‘it produces attitude change in proportion 
to the magnitude of the reward for such 
advocacy. 

Turning again to dissonance theory and 
shifting from its approach to one type of 
attitude change to its approach toward at- 
titude change generally, the author would 
venture the judgment that dissonance re- 
search in this area has been complicated by 
certain difficult methodological and interpre- 


tive issues. Thus, as Chapanis and Chapanis 
(1964) have noted, it is common to many 
of these studies that they do not investigate 
the subject’s personal reactions to the 
dissonance-arousing ‘situation, that the magni- 
tudes of attitude change are often quite 
spall and that often a rather large number 
of subjects is, for one or another theory- 
based reason, eliminated from analysis. To 
this must be added the present demonstration 
that, in experiments on counterattitudinal 
advocacy, certain data-biasing processes may 
be invoked to account for reported findings. 
Indeed, since dissonance studies on other 
types of attitude change also place some, 
but not other, subjects in highly puzzling 
and unexpected situations it should be recog- 
nized that in these studies as well biased 
contamination may often affect the results 
obtained. In the light of all these points 
it would seem desirable to undertake an 
empirical reexamination of some of the major 
studies that have been offered as confirming 
the dissonance analysis of attitude change. 
In the opiniop of the present author the 
consequence of such reexamination would not 
be the disconfirmation of the dissonance view 
of attitude processes but «the discovery that 
its generality is of somewhat smaller scope 
than its advocates have estimated and that 
certain kinds of attitude change are better 
predicted and accounted for by other theories. 
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INFLUENCE OF JUDGES’ ATTITUDES ON RATINGS OF 
FAVORABLENESS OF STATEMENTS ABOUT 
A SOCIAL GROUP * : 


MARISA ZAVALLONI2 anp STUART W. COOK ë e 
Research Center for Human Relations, New York University 


Replication of a study by Hovland and Sherif confirmed their central hy- 
pothesis, that scale values assigned to items are influenced by raters’ attitudes. 
However, the present findings require modification of their more specific 
hypotheses in 3 respects: (a) ratings by 5 criterion groups spaced along an 
attitude continuum show a regular order,"whereas Hovland and Sherif hy- 
pothesized that ratings are influenced by attitudes only in the case of raters 
. » with extreme attitudes and high involvement; (b) ratings of unfavorable as 
well as of neutral items are influenced by raters’ attitudes; (c) the prediction 
that extreme judges make fine discriminations at their own end of the scale 
and lump together items at the other end is supported only with respect to 


unfavorable items. 


The purpose of this study was to explore 
the possibility of using ratings of favorable- 
ness of*statements about a social group as an 
indirect measure of raters’ attitude toward 
the group. This possibility was suggested by 
Hovland and Sherif (1952) in discussing their 
finding that, contrary to earlier beliefs, judges’ 
attitudes influence their assignment of scale 
values to items in the construction of a Thur- 
stone-type scale, In the years since the Hov- 

. land and Sherif paper, there have been a num- 
ber of investigations of the relation between 
judges’ attitudes toward a social object and 
their ratings of the favorableness of state- 
ments about the object. Almost without ex- 
ception, these studies have found some rela- 
tion between judges’ attitudes and their rat- 
ings of items, but findings about the nature 
of the relation have not been consistent. Most 
relevant to our problem are studies by Prothro 
(1955, 1957) and by Upshaw (1962), which, 
jike the Hovland-Sherif study, dealt with atti- 
tudes toward, and statements about, racial,* 
religious, or nationality groups. While the re- 
sults of all of these studies seem to us to be 


1 This study was part of a program of research on 
the measurement of social attitudes supported by 
grants from the National Science Foundation and 
the Air Force Office of Scientific Research. The gen- 
eral framework of the program is presented in Cook 
and Selltiz (1964). 

2Now at New York State Division for Youth, 
New York City. 

8 Now at the University of Colorado. 
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consistent with Hovland and Sherif’s central 
hypothesis that judges’ attitudes toward a so- 
cial group influence their ratings of statements 
about the group, they do not support Hov- 
land and Sherif’s more specific hypotlteses: 
that judges with extreme attitudes at either 
end of the scale tend to lump together items 
at the opposite end of the scale, piling them 
in the extreme categories; that only raters 
with extreme positions and strong involve- 
ment make such displacements; that ratings 
of neutral and ambiguous items are most in- 
fluenced by raters’ attitudes. 

Because of our interest in the possible use- 
fulness of ratings as an indirect measure of, 
attitude, we addréssed ourselves, within the 
general framework of a replication of the 
Hovland-Sherif study, to two major ques- 
tions: 

1. Do groups of subjects spaced at various 
points along the attitude continuum differ 
systematically in their ratings of statements? 
If, as Hovland and Sherif suggested, the effect 
is limited to subjects with extreme positions 
and strong involvement, such ratings hold 
little promise as an attitude measure. 

2. What kinds of items—in terms of con- 
tent, structure, or degree of favorableness— 
show greatest and most consistent differences 
in ratings by subjects with different attitudes? 
If one or more kinds of items that are espe- 
cially susceptible to displacement can be 
identified, it should be possible to increase 
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the sensitivity of the technique as an attitude 
measure by constructing additional items 
along these lines. 


METHOD 
Subjects 


Two substudies were carried out, differing in the 
basis on which subjects’ attitudes were inferred. In 
one, subjects were classified on the basis of their 
membership in organizations or their participation 
in activities such that it seemed reasonable to be- 
lieve that many, if not all, of their members woyld 
hold certain attitudes toward race relations. In the 
other, a large number of heterogeneous subjects was 
tested, and subgroups were selected for comparison 
on the basis of their replies to a self-report inven- 
tory of attitude toward Negroes, In both studies, all 
subjects were college students. 

In the first study, five criterion groups were used: 


ə 


Group I: Subjects assumed to have strongly 
equalitarian views on race relations and to be ex- 
tremely involved in the issue. Negro students 
active in organizations working for integration 
(N=45). 

Group II: Subjects assumed to have strongly 
equalitarian attitudes and to be actively concerned 
with race relations. White students active in the 
same organizations as Group I (N = 69). 

. Group III: Subjects assumed to have equali- 
tarian attitudes but, not to be actively concerned. 
We assumed that students taking elective courses 
in intergroup relations fit this description, since 
they voluntarily expose themselves to information 
about minority groups which is typically presented 
with an equalitarian view, but are not taking part 
in efforts to change race relations. Information 

. about organizational membership was obtained in 

» order to separate out from this group subjects 
who belonged in Group I or II (N = 143). 

Group IV: Subjects assumed to have somewhat 
anti-Negro attitudes but not to be actively con- 
cerned with questions of race relations. It is diffi- 
cult to identify organizations membership in which 
provides a satisfactory basis for this assumption. 
However, an earlier study in this program (Selltiz 
& Cook4) had found a higher proportion of 
moderately or extremely anti-Negro subjects in 
right-wing political organizations than in other 
student groups; thus subjects for Group IV were 
recruited from these organizations (N = 79). 

Group V: Subjects assumed to be anti-Negro 
and to be actively concerned with race relations. 
“Pledges” of fraternities known to have led a 
campaign against the admission of Negro students 
to a border-state university (N = 40). 


*Unpublished study entitled “An Exploration of 
Characteristics of the PDPQ (Paired Direct and 
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students with points of view on public issues” Py 
ment of $1.50 was made for each such subject ij 
participated, the payment being made either tol 
organization’s treasury or to the individual sub 
as the organization chose. Subjects in intergroup 
lations classes were tested during class time. 
Obviously, this is not an ideal design. Th 
are not evenly spaced along the continuum: Grog 
I, II, and III are probably fairly close togetheni 
toward the equalitarian end; there is no “middle’s 
“neutral” group; Group V, although clearly tf 
vorable, does not represent the extreme possibleg 
sition at this end of the continuum; and the l 
tion of Group IV is quite uncertain, except thal) 
can be presumed to be somewhere between o 
III and V, or perhaps even between the hypothe 
midpoint and Group V. Nor do the groups allp 
vide equally good bases for inferences about ai 
tudes toward Negroes. Group IV is especially we 
in this respect; its predicted location between Gi o 
IM and V is based on an assumed average posit 
summarizing a rather wide distribution of memb 
attitudes rather than a probability that the al tu 
of most of its members are close to a specified p 
sition, as in the case of the other groups. In 
tion, other variables enter differentially. Sub; 
Group I were Negroes; subjects in all other g 
were white. Subjects in Group V were attendinh 
university in a border state; all other subjects we 
at colleges or universities in the northeast, and pi 
dominantly in the New York metropolitan a 
Nevertheless, it seems to us reasonable to cons 
these groups as falling on an ordinal scale of a 
tude toward Negroes. The deviations from the mo 
of groups equally spaced along a single conting 
do not seem likely to produce a consistent bias Wi 
respect to the hypotheses being tested. On the 0 
hand, the fact that the two extreme groups difen 
other characteristics (race and geographical locat 
as well as in assumed position on the attitude 0 
tinuum would probably work in favor of then 
pothesis that the extreme groups of raters will ail 
in the scale values they assign to items. On the 0% 
hand, the relatively small difference between Grol 
TI and II in assumed position on the attitude q 
tinuum, plus the uncertainty about the Jocation 
Group IV, would probably work against th 
‘pothesis that ratings of items will show a predi 
order among all five groups as well as betwee? 
extreme groups. P 
Use of such criterion groups is costly in time 
money and limits the number of potential subj? 
Therefore we supplemented this approach WIA | 
more usual one of selecting from a large grou 
heterogeneous subjects subgroups who could v 
cated on the attitude continuum on the basis o det 
responses to a self-report inventory of attitu' 


ward Negroes. These subjects were all students 1) 
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introductory psychology course at a midweste! 
tudes.” 


versity. 


JUDGES’ ATTITUDES AND SCALE VALUES 


Materials and Procedure 


| 
| For the sake of comparability we used the set of 
|114 items used by Hinckley (1932) in constructing 
his scale of attitude toward the social position of 
| the Negro.” These were the items used both by Hov- 
| land and Sherif (1952) and by Upshaw (1962). The 
| instructions we gave for sorting the cards were essen- 
ie those described by Thurstone (Thurstone & 
Chave, 1929), including designation of Category 6 
as representing a neutral position.® 
After they had finished sorting the cards, the sub- 
jects were asked to indicate which category came 
closest to their own position; which was most ob- 
jectionable to them; and which, if any, categories 
other than the one closest to their own position were 
also acceptable to them. 
In the study in which the subjects’ attitudes were 
pas on the basis of their replies to a self- 
report measure, an inventory of attitude toward Ne- 
groes developed by Collins (M. E. Collins & E. Haupt, 
personal communication) was used. This inventory 
consisted of 60 statements with response alternatives 
of “Agree” and “Disagree,” balanced for numbers of 
Agree and Disagree responses scored as prejudiced 
and unprejudiced. It included approximately equal 
numbers*of items considered as representing cogni- 
tive, emotional, and policy orientation aspects of 
attitude, This inventory was administered after the 
cards had been sorted and after the subjects had 
indicated which of the categories they found accept- 
able and objectionable. 


| Analysis 


On the basis of findings reported by Hovland and 
Sherif, by Prothro, and by Upshaw (plus examina- 
tion of the complete data of the Upshaw study which 
he very kindly made available to us), it seemed ad- 
visable to treat separately the ratings of favorable, 
unfavorable, and intermediate items. Decisions as to 
which items are to be considered as belonging in each 
of these categories are to some extent arbitrary. We 
used as the basis for this classification the median 
scale values assigned by anti-Negro subjects in Up- 
shaw’s study. We treated as unfavorable the items 
to which this group assigned a median scale value 
of 4.99 or less (58 items) ; as intermediate, items as- 
‘signed a median scale value from 5.00 to 7.99 (23 
items); as favorable, those given a median scale 
value of 8.00 or more (33 items),” 


available to us not only the entire set of statements 
but the scale values assigned to each item by his 
| three criterion groups: Negroes, whites favorable to 
Negroes, and whites unfavorable to Negroes. 

® The instructions given the subjects in the Hinck- 
ley, Hovland-Sherif, and Upshaw studies do not 
seem to have mentioned the neutral point. However, 
in the Upshaw study, the card for Category 6 car- 
ried the label “Neutral” (H. S. Upshaw, personal 
communication). 

7 Two items that were placed at the favorable end 
of the scale on the basis of median ratings by Up- 


att ee e 
.° We wish to thank E. D. Hinckley for making 
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In comparing ratings by groups of subjects with 
different attitudes, three scores were computed for 
each group: the mean ratings of the unfavorable 
items, of the intermediate items, and of the favor- 
able items. We predicted that the mean ratings -by 
Groups I through V would follow a regular order, 
with Group I lowest, Group V highest. Since the 
chąnce probability that five objects will appear in 
any specified order is 1 in 120, the hypothesis will 
be considered supported if the mean scores appear 
in the predicted order. 

In the study in which subjects were grouped on 
the basis of their scores on the self-report attitude 
inventory, three subgroups were compared: subjects 
in the most prejudiced quintile (scores 9-39, N = 38), 
those in the middle quintile (scores 48-52, N = 34), 
and those in the least prejudiced quintile (scores 
56-59, N=41). The significance of the differences 
among these groups in mean scale values assigned to 
the unfavorable, intermediate, and favorable items 
was tested by analysis of variance. 

In comparing the rating behavior of subjects with 
different attitudes, we also examined the number of 
statements placed by each subgroup in the two ex- 
treme categories, that is, the number of items given 
median scale values of less than 2,00 (extremely un- 
favorable) and the number given median scale values 
of more than 10.00 (extremely favorable). 

In examining the relative effectiveness of different 
types of items in djscriminating among subjects with 
different attitudes, the median scale values assigned 
to each item by different subgroups were compared. 
Since the purpose of this part of the analysis was 
not to test any specific hypothesis but simply to pro- 
vide guidance in constructing additional items, no 
statistical tests were applied. This item-by-item com- 
parison was carried out only in the study based on 
the five criterion groups. 


RESULTS 


Mean Ratings by Groups with Different Atti- 
tudes 


Table 1 shows the mean values assigned by 
different groups of subjects to the unfavor- 
able, intermediate, and favorable items, Con- 
sidering first the upper part of the table, 
showing the values assigned by the five cri- 
terion groups, it can be seen that, taking each 
set of items as a whole, the five groups agreed 
in their placement of items in the different 
sections of the scale; there is no overlap be- 
tween, for example, the mean rating of un- 
favorable items by Group V and the mean 
rating of intermediate items by Group I. How- 


shaw’s anti-Negro subjects but that were frequently 
placed at the unfavorable end by subjects in our 
pretests were omitted from our analysis; this left 31 
items at the favorable end of the scale. 


© 


» 


at eae í , TABLE 2 f 
sean ratings of the favorable items, how- yee or Ireus PLACED IN Extreme CATH 
» ever, do not follow the predicted pattern. BY DIFFERENT GROUPS 
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TABLE 1 6.04 for the unfavorable items, 14.02 fo 


MEAN RATINGS OF UNFAVORABLE, INTERMEDIATE, intermediate. Again, differences in- 
AND FAVORABLE Items By DIFFERENT GROUPS ings of the favorable items are 


irregular, with a slight, nonsignificant fq 
Afar | interme | Favorable  ency for the most prejudiced subjects to 
» Group v |_———______———_ them as least favorable. i 
Hae a 
Sai o RI cares Number of Items Placed in Extreme GW 
gories by Groups with Different Attituda 
a a eS ee eee Table 2 shows the numbers of items pli 
IL 138} 214 4.97 8.87 in the extreme categories by different gui 
By 4 rap nee ia of subjects. Considering first the upper 


of the table, showing the numbers of ite 


Mean ratings by subjects selected from heterogeneous placed in the extreme categories a 3 
group on basis of attitude inventory criterion groups, the result§ for the unfavo 
items are again as would be predicted ot 


Least prejudiced | 38) 214 | 478 | 8.82 basis of Hovland and Sherif’s hypothesi 
uintile s, z r P 
Middle quintile | 33| 2.46 5.77 8.87 extreme judges pile many items in tl 


Most prejudiced | 22| 2.64 5.81 8.46 treme category opposite to their own pos 

quintile the three equalitarian groups give many mi 

i i s than 2,00 

^ Ns shown here differ from those given in the text because the itemsia median, value. of les joaind 
subjects who made 12 or more “errors" were omitted. Displace- do the two anti-Negro groups. But 4 
ment of a favorable item to the unfavorable end of the scale, or ° h lts for 
of an unfavorable item to the positive end, was considered an with the mean scores, the resu j 
favorable on the basi of ratings iy ipehaw's subjects Scart Tate favorable items are not as would be pred 
ng: ' }, or r classi ib tl . fae 
basis of ratings by Upshaw's subjects, were considered as errors . DY the Hovland-Sherif hypothesis; in 
due to confusion as to which was the favorable end of the scale, they are in the opposite direction, wi 
; Bae í _ three equalitarian groups placing more! 
ever, within the unfavorable and intermedi- jn the most favorable category than the 
ate sections, the ratings differ in the predicted anti-Negro groups. For both sets of i 
order; Group I gives the lowest (most un- although the five groups show a regular ú 
favorable) mean ratings for both unfavorable 
and intermediate items, Group V the highest. 


Differences among the gfoups in the mean 
ratings of these items are small and irregular. 
„To the extent that any trend can be distin- 
guished, it is in the direction opposite to that 


Number of items given 
scale values 


predicted; that is, the more equalitarian sub- com 
jects show some tendency to rate these items 1.00-1.99 
So Con favorable) than the more anti- ; Meena. 
subjects. 
The same pattern appears among the sub- = E 
groups selected from a pool of subjects on the TIL 40 
basis of their scores on the attitude inven- ee i 


tory. For the unfavorable and the intermedi- 
ate items, the least prejudiced subjects give 


Subjects selected from heterogeneous group q 


the lowest mean rating (that is, judge the basis of attitude inventory s 
items as least favorable), the most prejudiced 

give the highest mean rating. Differences on Least prejudiced quintile 38 

both sets of items are significant at less than Mie minae AN 


Most prejudiced quintile 14 


the .01 level, by analysis of variance; F = 
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(except for the minor reversal between Groups 
IV and V on the favorable items), the differ- 
ence between the three equalitarian groups 
and the two anti-Negro groups is far more 
| striking than the order across the five groups. 
Again, the subgroups selected from a pool 
of subjects on the basis of the attitude inven- 
tory show a pattern similar to that of the 
criterion groups; the most prejudiced subjects 
place many fewer items in either of the 
extreme categories than do the less prejudiced 
subjects. 


„Group Differences in Ratings of Various 
Types of Items 


In seeking to find whether some kinds of 
items are more subject to displacement than 
others, we first ordered. items within each 
subset (favorable, intermediate, unfavorable) 
on the basis of the size of the difference be- 
tween the median ratings by our extreme cri- 
terion groups—northern Negro members of 
organizations working actively for integration, 
and white members of border-state fraternities 
known to have campaigned against desegrega- 
tion. We then examined the ratings by all 
five groups to see whether items that showed 
large differences between the extreme groups 
also showed a regular order among the less 
extreme groups of subjects.® 
Unfavorable Items x 

The 58 items classified as being unfavor- 
able have three major themes: inferiority of 
Negroes, importance and rightness of segrega- 
tion, and denial that Negroes have rights that 
should be respected. All were placed well 
below the midpoint of the scale by all groups 
of subjects. All but one were rated lower 
(more unfavorable) by Group I than by 
Group V. For 15 of the items, however, the , 
median scale values assigned by Groups I 
and V differed by less than .50. These are 


8A 13-page table, showing all the items and the 
median scale value assigned to each item by each of 
the five criterion groups, has been deposited with the 
American Documentation Institute. Order Document 
No. 8105 from ADI Auxiliary Publications Project, 
Photoduplication Service, Library of Congress, Wash- 
ington, D. C. 20540. Remit in advance $1.75 for 
microfilm or $2.50 for photocopies and make 
checks payable to: Chief, Photoduplication Service, 
Library of Congress. 


all extremely unfavorable, asserting either 
that Negroes are subhuman (for example, 
“The Negro will always remain as he is— 
a little higher than the brutes”) or that 
Negroes have no rights that must be recog- 
nized—not even the right to live (for ex- 
antple, “Race purity of the whites must be 
attained at all costs, even to the extermina- 
tion of the Negro”). All of these items were 
rated as extremely unfavorable by subjects 
in all groups; all but one had median ratings 
of Jess than 1.50 in Group V and less than 
1.25 in Group I. Three of these items are 
among those for which Hovland and Sherif 
give detailed findings, with the same results— 
median ratings of less than 1.50 by anti- 
Negro whites and less than 1.25 by actively 
involved Negroes. 

The other 42 unfavorable items show dif- 
ferences of .50 to 3.36 in median scale values 
assigned by Groups I and V. In general, the 
less extreme the statement, the greater the 
difference between the median ratings by 
the two extreme groups. For example, 
“Negroes must undergo many years of civi- 
lization before they may be said to reach the 
social level of the whites” and “Social segrega- 
tion of races is the only solution to the 
Negro problem” show differences of 2.00 or 
more in median ratings by Groups I and V, 
whereas “The Negro is by no means fit for 
social equality with the commonest white 


people” shows a difference of less than 1.00., 
On the whole, the size of the difference be- « 


tween the median ratings by these two groups 
is a function of the rating by Group V, the 
actively anti-Negro subjects. This group made 
discriminations among the unfavorable state- 
ments; for example, they saw the statement, 
“A Negro at the front door is an impostor; 
a Negro at the back door may be acceptable,” 
as definitely less unfavorable than the state- 
ment, “The Negro will always remain as he 
is—a little higher than the brutes.” Group I, 
however—northern Negroes actively con- 
cerned with race relations—saw these two 
statements as equally unfavorable. Two of 
the items for which Hovland and Sherif give 
detailed findings are among these 42 items, 
and they show the same pattern in the two 
studies. 

Consideration of the unfavorable items in 


e 


æ 


48 


terms of the median ratings by all five cri- 
terion groups, rather than simply by the two 
extreme ones, gives a similar picture. Of the 
58 negative items, roughly half (28) showed 
the predicted regular pattern in ratings by 
Groups I through V. Moreover, the larger 
the discrepancy between the ratings eby 
Groups I and V, the greater the probability 
of a regular order for the five groups. Thus, 
of the 15 items where the median ratings 
by Groups I and V differed by less than .50, 
only 3 showed a regular order of ratings, by 
all five groups; at the other extreme, of the 
14 items with differences of 2.00 or more 
between the median ratings by Groups I and 
V, 11 showed a regular order of ratings by 
all five groups. 


Intermediate Items 


The 23 items classified as being in the 
middle section of the scale have a variety 
of content: expressions of lack of interest or 
knowledge (“I am willing to let the Negroes 
work out their own destiny”); statements 
whose implications as to favorableness or un- 
favorableness are not clear (‘Newspapers 
greatly exaggerate the differences between the 
white and colored races”); statements of 
unspecified differences between the races (“In 
our efforts to help the Negro, we must not 
blind ourselves to the definite and marked 
differences which actually exist between the 
„two races”); statements of differences among 

» Negroes (“So great is the social range be- 
tween the highly educated Negro and the 
‘nigger’ that the race as a whole cannot be 
assigned to any one notch in the social 
scale”); endorsements of a “separate but 
equal” policy (“The Negro should have the 
advantage of all social benefits of the white 
man but be limited to his own race in the 
practice thereof”); statements that attribute ° 
both positive and negative characteristics to 
Negroes (“The rich spiritual life of the Negro 
compensates adequately for the defects in 
his nature”). 

All 23 of these items were rated as more 
favorable by Group V than by Group I, with 
only 3 showing a difference of less than .50 
between median ratings by these two groups. 
These 3 are unequivocal statements of lack 
of interest in or knowledge about Negroes 
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or race relations, which actively involved 
northern Negroes and anti-Negro southern 
whites both place squarely at the midpoint 
of the scale (median values 5.77 to 6.05 in 
Group I, 5.90 to 6.15 in Group V). In this 
connection, it should.be noted that instruc- 
tions to the subjects in this study specified 


that Category 6 represented a neutral position. © 


On the whole, the largest differences be- 


tween the median ratings by Groups I and V . , 


appeared on the “separate but equal” state- 
ments and those attributing both positive and 
negative characteristics to Negroes. All of 
these items were seen as unfavorable by. 
Group I (median ratings below 5.00), while 


all of them were placed in the middle range . 


by Group V. Three of the four midscale 
items for which Hovland and Sherif give’ 
detailed data are of this type; the fourth is 
a statement of differences among Negroes. 
All of them show an extreme form of the 
pattern described here, being ratéd very 
slightly lower by Hovland and Sherif’s ac- 
tively involved Negroes than by our Group I 
and somewhat higher by their anti-Negro 
whites than by our Group V. 


As with the negative items, consideration * 


of the midscale items in terms of the median 
ratings by all five criterion groups gives a 
picture similar to that based on the ratings 
by the two extreme groups. Of the midscale 
items, roughly half (11) showed the pre- 
dicted regular pattern in ratings by Groups I 
through V. The larger the discrepancy be- 
tween the ratings by Groups I and V, the 
greater the probability of regular order for 
the five groups. Thus, of the 7 items where 
the median ratings by Groups I and V dif- 
fered by less than 1.00, none showed a regular 
arrangement of ratings by all five groups; at. 
the other extreme, of the 12 items with dif- 
ferences of 2.00 or more between the median 
ratings by Groups I and V, 9 showed ‘a 
regular order of ratings by all five groups. 


Favorable Items 


The items classified at the favorable end 
of the scale show a variety of content: the 
implications of some with respect to the 
favorable-unfavorable dimension seem unclear 


(“If you give freedom, education, and the. 


Christian religion to colored men, you cannot 
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confine them to a future of permanent sub- 
ordination”); some, although ostensibly fa- 
vorable, have a derogatory tone (“A wide 
awake Negro is physically superior and in 
other respects equal to the white man”); 
some attribute positive characteristics—usu- 
„ally “spiritual”—to Negroes (“The Negro 
- must possess a much deeper moral nature than 
the white man, since he has progressed in the 
, face of far greater obstacles”); some attribute 
difficulties in race relations to whites rather 
than Negroes ‘(Our refusal to accept the 
Negro is not based on any fact in nature 
but rather*on prejudice, and should be over- 
come”); some state that Negroes and whites 
are equal in ability (“Inherently, the Negro 
And ‘the white man are equal”); some advo- 
cate a social policy of equal treatment (“The 
Negro should be given the same educational 
advantages as the white man”). There are 
no clear, prointegration statements, and none 
expressing a positive view that a multiracial 
society has values not found in a society 
dominated by one race. 
The differences between Groups I and V 
„in their median ratings of these items are 
smaller than for the unfavorable and inter- 


*’ mediate items, and they are not consistent 


in direction. Of the 31 items classified as 

favorable, 11 showed differences of less than 

-50 between the median ratings by the two 
extreme groups. ° 

Of the items where the median ratings dif- 

t fered by .50 or more, eight were rated as 

more favorable by Group V than by Group I 


`- (that is, the difference was in the same direc- 


tion as in the case of the unfavorable and 
intermediate items). Most of these are items 
with a derogatory or condescending tone. 
‘Although both groups placed all these items 
above the midpoint of the scale, the median 


ratings by Group I placed all of them in the f 


intermediate range (median values of 6.23 to 
s 7.61), whereas Group V rated most of them 
„as moderately favorable (median values 7.28 
to 9,54). 

Most of the 12 items on which the direction 
of difference was reversed (that is, that were 
rated more favorable by the northern Negroes 
than by the segregated southern fraternities) 
are clear statements that the Negro is equal 
to the white man and should be so treated. 


49 


On the whole, items that stress a social policy 
of equal treatment show larger differences 
than those that simply state that the two races 
are equal in ability. Most of these items on 
which the direction of differences between 
the extreme groups is reversed were rated 
as ‘very favorable by Group I (9 of the 12 
were given median scale values of 10,00 or 
more by Group I); although all were seen 
as favorable by Group V, none has a median 
rating above 9.50 in that group. Two of 
these items are among those reported in detail 
in the Hovland-Sherif study, where they 
show the same pattern found here, that is, 
they are rated higher by involved Negroes 
than by anti-Negro whites. 

The favorable items were even less con- 
sistent in ordering the five criterion groups 
than in distinguishing between the two ex- 
tremes. There was no favorable item on which 
the median ratings of Groups I through V 
followed a regular order, either in the direc- 
tion found in other sections of the scale or 
in the reverse direction. 


Discussion 


The results of this study strongly confirm 
Hovland and Sherif’s central hyothesis, that 
ratings of the favorableness of items are in- 
fluenced by raters’ attitudes. However, they 
require modification of three of the more 
specific hypotheses advanced by those investi- 
gators, having to do with the fineness of dis-. 
criminations made, by the technique, the 
nature of the items that are most susceptible 
to displacement, and the patterns of ratings 
of items at opposite extremes of the con- 
tinuum. 


Characteristics of Subjects Who Displace 
Items 


Hovland and Sherif suggested that ratings 
are influenced by raters’ attitudes only in the 
case of raters with extreme positions and 
strong ego involvement in the issue to which 
the statements refer. In the present study, 
however, mean ratings of both intermediate 
and negative items were in the predicted 
order for five criterion groups assumed to 
be in ordinal, but not equidistant, positions 
along the attitude continuum. Although none 
of the criterion groups was thought of as 
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representing a neutral position, two of the 
groups (III and IV) were assumed to be not 
strongly involved in the issue of race rela- 
tions and probably moderate rather than ex- 
treme in their own positions. Upshaw (1962), 
selecting from within a pool of “average white 
students” three subgroups identified as Pro, 
Neutral, and Anti on the basis of an inde- 
pendent self-report questionnaire, found that 
the three groups showed a regular pattern 
and differed significantly in their mean 
ratings of extremely pro, moderately pro, 
moderately anti, and extremely anti items. 
Thus, it would seem that the technique has 
even greater potentiality as an indirect at- 
titude measure than Hovland and Sherif 
suggested. There seems to be no real issue 
here, either of fact or theory. Hovland and 
Sherif’s view that ratings are influenced only 
in the case of subjects with extreme positions 
and strong involvement seems to represent 
simply an assumption; nothing in their theo- 
retical discussion seems to require it. Since 
their study compared extreme judges with a 
large group of “average white subjects” 
within which they did not attempt any dif- 
ferentiation, the,assumption was not put to 
the test in their study. 


Nature of Items Susceptible to Displacement 


A second respect in which our findings 
require modification of the Hovland-Sherif 
' hypotheses has to do with the nature of the 
items that are most susceptible to displace- 
ment. Hovland and Sherif reported that 
“neutral” items are most subject to displace- 
ment, and thus most discriminating if the 
rating technique is used as an attitude meas- 
ure. They describe the “neutral” items as 
unstructured, or ambiguous, Although we too 
found that, on the whole, items in the middle 
range of the scale are most subject to dis- 
placement, examination of the specific items 
shows a variety of content in the midscale 
items in the Hinckley pool. Some are clear 
statements of lack of interest in or knowledge 
about the position of Negroes; some are un- 
clear in their implications of favorableness or 
unfavorableness; others state a “separate but 
equal” position; still others are double- 
barreled, containing both a negative and a 
positive aspect. In the present study, the 
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statements of lack of interest and those whose 
implications are unclear show little difference 
among groups of subjects with different at- 
titudes; all groups place these statements at 
or near the midpoint of the scale.” The 
“separate but equal” statements and the 


double-barreled ones, on the other hand, show. , 


the greatest differences between median ' 


ratings by extreme groups and a regular order 


in ratings by five groups assumed to be in, 


ordinal positions on the attitude continuum. 
In the present study, negative as well as 
intermediate items showed differences be- 
tween the median ratings by extrème groups 
and a regular order of ratings by the five 


criterion groups. The more extreme the state- ` 


ment, however, the less the difference between 
the extreme groups and the less regular the 
order among the five groups. Statements 
asserting that Negroes are subhuman and 
denying that Negroes have any rights that 
should be respected show no differences 
among groups, all rating them as extremely 
unfavorable. 

Findings by both Prothro (1955, 1957) 


and Upshaw (1962) are even more at vari-, 


ance with those of Hovland and Sherif 
(1952) with respect to what items are most 
subject to displacement. Upshaw, using the 
same materials as Hovland and Sherif and a 
similar design (that is, comparing ratings 
of the Hinckley items by subjects classified 
as pro, neutral, or anti on the basis of an 


independent measure), found that only on 


the midscale items did the mean values as- 


signed by these three groups fail to show a .” 
regular order. Prothro, using different ma- “ 


terials and a different design, comparing 
ratings by two samples of strongly anti- 
Jewish Arab subjects of statements worded 


, to refer to Jews and the same statements 


worded to refer to Arabs, found no consistent 
pattern for the midscale items. 


These differences concerning the suscepti- 


bility of midscale items to displacement seem 


to involve issues both of empirical findings . 


and of theory. It seems possible that Hovland 


and Sherif’s statement that the items most . 


susceptible to displacement are those that are 


? The fact that our instructions defined Category 6 


as the neutral position may have contributed to this’ 4 


result. 


° 
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neutral, ambiguous, or unstructured, was 
meant simply as a description of the mid- 
‘scale items. Examination of the items for 
which they present detailed data shows that, 
although the ones on which the greatest dis- 
placement occurred are in the middle range, 
they are not, strictly speaking, neutral, am- 


*. biguous, or unstructured; rather, they are 


endorsements of a “separate but equal” policy 
or statements that include both positive and 


* negative elements—precisely the kinds of 


items that we «found to be most susceptible 
to displacement. 

. The expectation that neutral, unstructured, 
and ambiguous items would show the greatest 


” displacements is based in part on the assumed 


correspondence of these items to the mid- 
range stimuli in studies of psychophysical 
judgments and the typical finding of psycho- 
physical studies using the method of single 
stimuli that judgments of midrange stimuli 
show the greatest variability and the greatest 
shift with changes in context or in experi- 
mental procedures. It is consistent both with 
adaptation-level theory (Helson, 1948, 1959) 


+ and with anchoring theory (Rogers, 1941; 


Volkmann, 1951). However, it has been found 
that the ratings of clearly neutral statements 
about social policy are not affected either 
by context (Fehrer, 1952) or by raters’ at- 
titudes (Weiss, 1959). These investigators 
have pointed out that statements such as 
“There is as much to be said on one side as 


. the other” have a definite meaning, derived 


from general language usage, which is not 
easily affected either by experimental pro- 
cedure or by individual point of view. 
Campbell, Lewis, and Hunt (1958) have 


„Suggested that what is crucial for the sus- 


ceptibility of midrange stimuli to displace- 
ment is not whether the stimuli are physical 
objects or verbal statements but whether the 
dimension on which they are to be judged 
has a qualitative shift at the midpoint (for 
example, sweet-salty, favorable-unfavorable) 
or is simply a continuum of “more” or “less” 


` (for example, light-heavy, short-long). Shift- 


ing of judgments of intermediate items across 
the midpoint in response to changes in con- 


„text or other influences is to be expected in 
` the latter case, not in the former. 
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Ratings of Items at Opposite Ends of the 
Continuum 


The final point on which the results of this 
study fail to confirm the Hovland-Sherif 
hypotheses—the patterns of ratings of items 
at opposite ends of the scale—may havé 
important theoretical implications. Hovland 
and Sherif predicted that extreme judges at 
both ends of the scale tend to make fine dis- 
criminations among items at the end of the 
scale corresponding to their own positions and 
to jump together items at the opposite end 
of the scale. Specifically, they predicted that 
raters who are very favorable toward the 
group in question will place many items in 
the most unfavorable category and few in the 
most favorable, while raters who are very 
unfavorable will place many items in the most 
favorable category and few in the most un- 
favorable. From this it would follow that the 
expected pattern of differences in ratings 
between attitudinal subgroups would be con- 
sistent across the entire scale, or at least at 
the two ends of the scale; since favorable 
subjects would“ “downgrade” unfavorable 
items and unfavorable subjects would 
“upgrade” favorable items, it follows that 
both kinds of items would be rated lower 
(less favorable) by pro subjects than by 
anti’s. Again, this is what would be predicted 
on the basis of findings and theory with 
respect to psychophysical judgments, How- 
ever, no study of social judgments seems to: 
have yielded results completely consistent ‘ 
with this prediction. 

The prediction has been generally sup- 
ported with respect to ratings of unfavorable 
items: subjects favorable to the group in 
question tend to displace unfavorable items 
toward the unfavorable end of the scale, 
placing many of them in the lowest category, 
and unfavorable subjects place relatively few 
items in this category (Hovland & Sherif, 
1952; Prothro, 1955, 1957; Upshaw, 1962; 
the present study). However, the prediction 
that subjects unfavorable to the group will 
displace items toward the favorable end of 
the scale, placing many of them in the most 
favorable category, and that favorable sub- 
jects will place relatively few items in this 
category, has not been supported. Hovland 
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and Sherif themselves found only a slight 
tendency for their anti-Negro judges to con- 
centrate items at the favorable end of the 
scale; and Negro as well as anti-Negro sub- 
jects placed slightly more items in the highest 
category than did “average” or pro-Negro 
whites. Upshaw found that mean ratings, of 
both moderately favorable and extremely fa- 
vorable items by his anti, neutral, and pro sub- 
jects followed an order opposite to that found 
for the unfavorable items; that is, the 
favorable items were rated as most favorable 
by the pro’s, least favorable by the anti’s. 
Supplementary analysis of these data shows 
that his pro’s placed more items in the ex- 
treme favorable category, as well as in the 
extreme unfavorable category, than the anti’s. 
Although Prothro reported simply that his 
findings failed to confirm Hovland and 
Sherif’s prediction that unfavorable subjects 
would displace favorable items toward the 
extreme favorable end of the scale, examina- 
tion of his published data shows a marked 
trerfd in the opposite direction. All of his 
17 favorable items were rated as more favor- 
able when applied to Arabs (the group toward 
which his subjects were favorable) than when 
applied to Jews (toward whom his subjects 
had extremely hostile attitudes); and more 
items were placed in both the extreme cate- 
gories when the items referred to Arabs than 
when they referred to Jews. In other words, 
both Prothro’s findings and Upshaw’s are 
_ clearly contrary to the Hovland-Sherif pre- 
diction about patterns of tatings of favorable 
items. In the present study, mean ratings of 
favorable items by five criterion groups 
showed no clear pattern; however, subjects 
in the three groups favorable to Negroes 
placed more items in the extreme favorable 
category, as well as in the extreme unfavor- 
able category, than did subjects in the two » 
anti-Negro groups. In short, the general find- 
ing has been that favorable subjects place 
more items in the extreme Categories at 
both ends of the scale than do unfavorable 
subjects. 

Several suggestions have been advanced to 
account for these findings with respect to the 
ratings of favorable items. One such sug- 
gestion has been that the subjects who have 
been labeled “anti” have not in fact had 
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extremely unfavorable attitudes but have 
been only moderately unfavorable or even 
neutral. If this has been the case, the 
Hovland-Sherif hypothesis has not been ade- 
quately tested. Moreover, if this description 
of the anti’s is correct, the obtained results 
are consistent with the assimilation-contrast 
model specified in later work by Hovland 
and Sherif and their associates as the theory 
of judgment on which their predictions are 
based (Hovland, Harvey, & Sherif, 1957; 
Sherif & Hovland, 1961), Ia general, the 
assimilation-contrast model predicts that 
items close to the subject’s own position are 
displaced toward his position (assimilation), 
while items farther away are displaced away 
from the position (contrast). If the favor- 
able subjects are strongly favorable, the 
assimilation-contrast model would lead to 
the prediction that these subjects would 
“downgrade” unfavorable items by contrast 
with their own position and that they would 
either rate the favorable items objectively or 
“upgrade” the moderately favorable ones by 
assimilation toward their own extremely 
favorable position. As for the supposedly un- 
favorable subjects, this model would predict 
that if they are in fact only moderately un- 
favorable or even neutral, they would tend 
to “upgrade” the unfavorable items by 
assimilation to their own moderate position, 
perhaps to “downgrade” the moderately 
favorable items by assimilation to their own 
position, and to show little upgrading even 
of the extremely favorable items since these 
items might not be far enough from their 
own position to be subject to contrast ef- 
fects. Thus, both favorable and unfavorable 
items would be rated as more extreme by 
the (strongly) favorable subjects than by 
the (moderately) unfavorable subjects2° ` 

This assumption about the relative posi- 
tions of the pro and anti subjects seems 


19In a study of ratings of statements about fra- 
ternities, Manis (1960) found that both strongly pro 
and strongly anti subjects tended to rate both fa- 
vorable and unfavorable statements as more ex- 
treme than did neutral subjects—a result which he 
interpreted as consistent with an assimilation-con- 
trast model. This study is not discussed in detail 
here because it differed considerably from the group 
of studies discussed here both in content and in 
method. 
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tenable with respect to the subjects in the 
studies by Hovland and Sherif, Upshaw, and 
the present study. Hovland and Sherif, in 
describing their subjects, make it clear that 
their Negro subjects and their pro-Negro 
whites were extremely favorable and strongly 


_ .involved, and state that they had difficulty 
: in locating strongly anti-Negro subjects. In 


the Upshaw study, where subjects’ attitudes 


„were measured by their responses to a scale 


` 


A 
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made up of items from the Hinckley pool, 
all of the pro’ subjects endorsed the most 
favorable statements, whereas only about 
one-fifth of the anti subjects endorsed the 
most unfavorable items. In the present study, 


‘ unfortunately, so many subjects misunder- 


stood the instructions to indicate their en- 
dorsement or rejection of the various cate- 
gories of items that these data were not 
analyzed. It seems possible, however, that 
subjects in Groups I and II, and perhaps in 
III, had strongly favorable attitudes, while 
those in Groups IV and V were neutral or 
moderate, rather than extremely unfavorable. 
This point is being checked in a study now 


under way, using subjects from comparable 


criterion groups. In summary, the assumption 
that the unfavorable subjects were not ex- 
treme in their attitudes while the favorable 
Subjects were extreme seems tenable in the 
case of the studies dealing with attitudes 
toward Negroes, and thus «the findings of 
these studies may be thought of as consistent 
with an assimilation-contrast model. It is dif- 


ficult, however, to account for Prothro’s find- 


ing on such grounds; his anti-Jewish subjects, 


` far from being neutral, are described as 


intensely hostile. 
A second suggestion has focused on the 


‘position of the favorable subjects in relation 
to the range of items being judged. This , 


approach proposes that the observed patterns 
of ratings of favorable items are consistent 
with the assimilation-contrast model if the fa- 


-vorable subjects have positions at or just be- 


yond the most favorable items in the series. In 


‘this case, too, the assimilation-contrast model 


. would predict that favorable subjects will 


X 


“upgrade” the favorable items by assimilation 
toward their own position. However, a simple 
assimilation-contrast model would predict 


s 
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that unfavorable subjects will “upgrade” the 
favorable items by contrast with their own 
position; thus the ratings of favorable items 
by pro and anti subjects would presumably 
be similar, since both groups would be dis- 
placing them upward. One or more additiona} 
assumptions are needed to account for the 
observed findings in  assimilation-contrast 
terms. Two such assumptions are available. 
One is the possibility suggested above—that 
in fact the position of the unfavorable sub- 
jects is not sufficiently unfavorable to lead 
to contrast effects with respect to favorable 
items. Another is that judges’ attitudes affect 
their ratings of items only when the judges’ 
own attitudinal positions are outside the 
item range. This assumption, combined with 
the assimilation-contrast model, would lead 
to the prediction that ratings by (out-of- 
range) favorable subjects will be affected by 
their attitudes—contrast effects on unfavor- 
able items, assimilation effects on favorable 
ones—while ratings by (within-range) .un- 
favorable and neutral subjects will not be 
affected by their‘attitudes. 

The assumption that the position of the 
pro subjects is beyond the range of the item 
series being judged seems tenable with respect 
to the studies by Hovland and Sherif, 
Upshaw, and the present study; thus the 
findings of these studies with respect to dif- 
ferences between pro and anti subjects in 
ratings of favorable items may be thought | 
of as consistent with a qualified assimilation- 
contrast model. Even with the qualifying 
assumptions, however, it is difficult to see 
how this model can account for the observed 
differences between ratings by “anti” and 
“neutral” subjects in the Upshaw study and 
in the present study. And again, there seems 
no basis for believing that subjects in the 
Prothro studies met the assumptions needed 
to make the findings consistent with an 
assimilation-contrast model. Nevertheless, the 
possibility that results in the studies using 
the Hinckley items (Hovland and Sherif, 
Upshaw, the present study) may have been 
affected by the fact that some or all of the 
favorable subjects hold positions outside the 
item range suggests the desirability of repli- 
cating the present study with a set of state- 


54 


ments about Negroes designed to include the 
probable positions of all judges; such a study 
is now under way. 

A third suggestion to account for the 
unexpected reversal between pro and anti 
subjects in ratings of favorable items posits 
a quite different dynamic—that subjects Who 
agree with an item tend to rate it as more 
favorable than those who disagree with it, 
whatever its general location on the scale. 
Prothro (1957), in discussing his findings, 
proposed a limited version of this hypothesis; 
he suggested that 


shifting occurs only when there is strong personal 
involvement or identification, and that the shifts in 
such cases may demonstrate either a contrast effect 
or an assimilation effect depending on whether the 
item fits one’s perception of his own group [p. 196]. 


Examination of his data, however, as well as 
that of the other studies discussed here, 
suggests that the qualification as to personal 
involvement and the restriction to perception 
of one’s own group are not necessary. The 
simple assumption that suBjects who agree 
with a statement tend to see it as more 
favorable than those who disagree with it 
leads to the prediction that unfavorable sub- 
jects, who agree with unfavorable statements, 
will rate these statements as more favorable 
than will pro subjects, who disagree with 
the statements; at the other end of the scale, 
. this assumption leads to the prediction that 
favorable subjects, who agree with the favor- 
able items, will rate them as more favorable 
than the anti subjects, who disagree with 
them. Thus, this hypothesis seems adequate 
to account for the major findings of all four 
of the studies discussed here—those by 
Hovland and Sherif, Upshaw, Prothro, and 
the present study. However, in view of the 
fact that this is an ad hoc hypothesis, not 
derived from any existing theory of judgment, 
it seems desirable to test it in a study spe- 
cifically designed for that purpose. Such a 
study, in which the same subjects, or sub- 
jects matched in relevant attitudes, rate two 
sets of items, one referring to a group toward 
which they are favorable, another toward 
which they are unfavorable, is planned. 


2 


Marisa ZAVALLONI AND STUART W. Cook 


REFERENCES 


CAMPBELL, D. T., Lewis, Nan A. & Hunt, W. A. 
Context effects with judgmental language that is 
absolute, extensive, and extra-experimentally 
anchored. Journal of Experimental Psychology, 
1958, 55, 220-228. 

Coox, S. W., & SeLLTIZ, Crame. A multiple-indicator 
approach to attitude measurement. Psychological 
Bulletin, 1964, 62, 36-55. 

Fenrer, ELrzaseTH. Shifts in scale values of attitude 
statements as a function of the composition of 
the scale. Journal of Experimental Psychology, 
1952, 44, 179-188. 

Hetson, H. Adaptation-level as a basis for a quanti- 
tative theory of frames of reference. Psychological 
Review, 1948, 55, 297-313. 

Hetson, H. Adaptation level theory. In S, Koch 
(Ed.), Psychology: A study of a science. Study I, 
Conceptual and systematic. Vol. 1. Sensory, per- 
ceptual, and physiological formulations. , New 
York: McGraw-Hill, 1959. Pp. 565-621. 

Hincxtey, E. D. The influence of individual opinion 
on construction of an attitude scale, Journal of 
Social Psychology, 1932, 3, 283-296. 

Hovranp, C. I, Harvey, O. J., & Suerir, M. As- 
similation and contrast effects in reactions to 
communication and attitude change. Journal of 
Abnormal and Social Psychology, 1957, 55, 244-252. 

Hovraxp, C. I, & Smer, M. Judgmental phe- 
nomena and scales of measurement: Item displace- 
ment in Thurstone scales. Journal of Abnormal 
and Social Psychology, 1952, 47, 822-832. 

Manis, M. The interpretation of opinion statements 
as a function of recipient attitude. Journal of Ab- 
normal and Social Psychology, 1960, 60, 340-344. 

Proruro, E. T. The effect of strong negative at- 
titudes on the placement of items in a Thurstone 
scale. Journal of Social Psychology, 1955, 41, 
11-17. r 

Prormro, E. T. Personal involvement and item 
displacement on Thurstone scales. Journal of Social 
Psychology, 1957, 45, 191-196. 

Rocers, S. The anchoring of absolute judgments. 
Archives of Psychology, New York, 1941, No. 261. 

Suen, M., & Hovranp, C. I. Social judgment: 
Assimilation and contrast effects in communication 
and attitude change. Vol. 4. (Yale Studies in 
Attitude and Communication) New Haven: Yale 
Univer. Press, 1961. 


» THURSTONE, L. L, & Cuave, E. J. The measurement 


of attitude. 
1929. 
Ursuaw, H. S. Own attitude as an anchor in equal- 


Chicago: Univer. Chicago Press, 


appearing intervals. Journal of Abnormal and | 


Social Psychology, 1962, 64, 85-96. 

VOLKMANN, J. Scales of judgment and their implica- 
tions for social psychology. In J. H. Rohrer & M. 
Sherif (Eds.), Social psychology at the crossroads. 
New York: Harper, 1951. Pp. 273-294, 

Weiss, W. The effects on opinions of a change in 
scale judgments. Journal of Abnormal and Social 
Psychology, 1959, 58, 329-334. 


(Received July 3, 1963) 


es 


4, 


Journal of Personality and 


Social Psychology 
1965, Vol.-1, No. 1, 55-62 


RETENTION OF ANXIETY MATERIAL AS A FUNCTION 
OF COGNITIVE DIFFERENTIATION ' 


FRANK W. UHLMANN 2 
Detroit Edison Company, Michigan 


AND 


ELI SALTZ 


Wayne State University 
e 


56 high differentiators and 50 low differentiators were selected from a group 
of 164 tested men. (a) Immediate and delayed recall of connected discourse 
was superior for high differentiators. (b) Retention by high differentiators was 
not affected by 3-hr. delay. Retention of anxiety material by low differentiators 
deteriorated over the. 3 hr., nonanxiety material did not. (c) The drop in 
retention of low differentiators over time vas not related to intelligence. 


The purpose of the present study was to 
investigate the retention of meaningful, con- 
nected discourse as a function of two vari- 

* ables: anxiety properties of the material to 
be remembered, and individual differences in 
the cognitive control variable of ‘“differentia- 
tion,” 

Saltz (1961) has suggested that differentia- 
tion of S-R systems, rather than association 
between stimuli and responses, is probably 
the critical factor in most human learning 
and retention of meaningful material. From 
this point of view, learning involves the de- 
velopment of boundaries around S-R systems 
which prevent intrusions from other systems. 

+ Evidence also suggests that stress breaks down 
differentiation. For example, Saltz and Riach 
(1961) found that shock produced a decre- 
ment in discrimination performance. The dec- 

` rements were related to degree of preshock 
differentiation, with high differentiators ex- 
hibiting almost no decrements where low 
differentiators showed marked decrements. 
Finally, Saltz and Asdourian (1963), in an 
animal study, found evidence that the break- 


- ‘down in differentiation with stress increases as 


a function of time following shock. Animals 
exhibiting low preshock differentiation showed 
“the most severe breakdown in postshock dif- 
ferentiation as a function of time. 


+ This paper is part of a dissertation by the senior 
author submitted to the Department of Psychology, 
Wayne State University, in partial fulfillment of the 
requirements for the degree of Doctor of Philosophy. 
The study was supported in part by National 
Science Foundation Grant G 21476, in part by the 
Detroit Edison Company. 

? Currently at General Motors Institute, Flint, 
Michigan. 
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The studies cited above suggest the poten- 
tial utility of general tests for individual dif- 
ferences in cognitive differentiation, Since 
differentiation is here conceptualized as resist- 
ance to interference, a relationship to recent 
work by Klein (1954) and Witkin, Dyk, 
Faterson, Goodenough, and Karp (1962) is 
suggested. Klein’s flexible-constricted cognitive 
control style is measured by the Stroop Color- 
Word Interference Test. This test inVolves 
color names printed in incongruent colors 
(for example, the word RED printed in green 
ink). The subjects’ task is to name the color 
of the word, not read the: word, hence ignor- 
ing the interference produced by the word. 
Similarly, one of Witkin’s measures of field 
dependence-independence is the Concealed 
Figures Test. This test requires that subjects 
be able to resist interference from irrelevant 
stimuli in order to detect the concealed figures, 

There is evidefice that the type of differ- 
entiation involved in the Concealed Figures 
Test may be a measure of individual differ- 
ences in the type of differentiation relevant 
to learning and retention. Gollin and Baron 
(1954) used a concealed figures test to 
measure nonsense syllable retention. Results 
indicated that subjects who were faster on 
concealed figures tended to be less susceptible 
to the interference in the retention task. 
Gardner and Long (1961) found that Witkin’s 
version of the concealed figures test was 
related to a recall and recognition measure 
in an interference situation. A study by Long 
(1962) showed that performance on con- 
cealed figures was a factor in verbal learning 
and recall. 

The present study used both color inter- 
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ference and concealed figures as a joint index 
of differentiation, since they appear to meas- 
ure different aspects of differentiation. Color- 
word interference appears to measure a 
tendency to resist habit interference, whereas 
coftcealed figures appear to measure a tend- 
ency to resist detection. High differentiators 
were defined as those subjects who were above 
the mean in correct performance on both 
tests; low differentiators were subjects scoring 
below the mean on both tests. 
The retention task includes both anxiety 
` and nonanxiety verbal material in an im- 
mediate and 3-hour delay recall situation. 
Reasoning from the Saltz studies, it was ex- 
pected that individuals who scored high on 
the differentiation tasks should recall and 
retain more verbal material than individuals 
who scored low on the differentiation tasks. 
If stress breaks down differentiation in the 
manner suggested by Saltz and Riach (1961), 
high differentiators should show relatively 
little foregetting of the anxiety material over 
the 3-hour retention period, as compared with 
low differentiators. On the other hand, high 
and low differentiators should show relatively 
similar forgetting of the nonanxiety material. 


METHOD 
Subjects 


The subjects were 164 male substation operators 
at the Detroit Edison Company who voluntarily 
served in this research. Substation operators are a 
highly select group of men, who must satisfactorily 
pass a rigorous screening process and physical ex- 
amination. Each man undergoes a 3-year training 
program before qualifying for the job classification, 
The age range of those tested was from 23 to 64 
with a mean age of 46.7, 


Tests 


The four tests relevant to this research are de- 
scribed below. 

Concealed Figures Test. This test is a revision of 
Gottschaldt’s (1929) Embedded Figures Test, pre- 
pared by Thurstone and Jeffery (1959). Subjects 
are requested to “find” simple geometric figures 
in more complex figures. The test consists of 49 
items, with 196 embedded figures, and has a 10- 
minute time limit. 

Color-Word Interference Test. A group adapta- 
tion of Stroop’s (1935) test was devised by Uhl- 
mann (1962). The test consists of four subtests, 
each printed on a separate page, and each subtest 
has a 1-minute time limit. The first two subtests 
are primarily for warm up, and are not scored. The 


relevant two subtests include: (a) four color names 
(red, yellow, blue, and green) which were printed in 
color and presented in random order, each name 
appearing 25 times (10 lines containing 10 words) ; 
the color and word were always incompatible (ie, 
the word BLUE was printed in red, yellow, or green) ; 
the subject was requested to print the first initial of 
the word, disregarding the color, on a line below 
the word. (b) Subtest 4 was identical to Subtest 3, 
except that the subject had to print the first initial 
of the color, disregarding the word itself. Scoring 
was based on an interference measure between the 
two subtests. 

Incident Recall Test. A taped account of a build- 
ing explosion was presented to the subjects who 
were requested to write down whatever details they 
could immediately after hearing the tape. A 
similar recall was requested after a delay of 3 hours. 
The score on this test was determined by the 
number of anxiety and nonanxiety words recalled, 
Prior to scoring, the words had been independently 
classified by each of the writers into anxiety (for 
example, blamed, explosion, blast) and nonanxiety 
(for example, air, building, Monday) words. There 
was 87% agreement between the writers in these 
independent classifications, The final decisions were 
made by the senior author. To facilitate relating 
anxiety performance to nonanxiety performance, 
the immediate and delayed anxiety recall scores were 
combined and converted to T score form (M = 50, 
SD=10), as were the immediate and delayed 
nonanxiety recall scores. 

Wonderlic Personnel Test. A 50-item, 12-minute 
mental abilities test used in business and industrial 
situations, 


Procedure 


Subjects were tested in groups of 3-20 as part of 
a larger in-service testing program conducted by the 
Detroit Edison Company. The testing began with the 
Wonderlic followed by the Concealed Figures Test. 
The taped incident was then presented and subjects 
were given as much time as necessary to complete 
the immediate recall (they rarely required more 
than 7 minutes), Testing was resumed with the 
Color-Word Interference Test after a 10-minute 
coffee break. This was followed by paper-and-pencil, 
tests relevant to the in-service testing program but 
not relevant to the present research. These tests 
iieasured mechanical comprehension, reasoning abil+ 
ity, etc. Following a 40-minute lunch break, and 
3 hours after the immediate recall test, subjects 
were again requested to recall the building explosion 
incident. 


RESULTS 


A gross view of the interrelationships be- 
tween the variables involved in the study can 
be obtained from examination of Table 1; 
which indicates the Pearson product-moment 
correlations between these variables. 
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TABLE 1 
PEARSON CORRELATIONAL ANALYSIS 
Test 2 3 4 5 6 7 8 

1. Concealed Figures 36** .19* .26** 6 Alere 56% —.43** 
2, Color recognition .18* 15 .19* .19* .34** —,29** 
3. Immediate recall-anxiety .67** .80** .62** KpE =09 e 
4, Immediate recall-nonanxiety . Dore, .88** .30** —.15 
5. Delay recall-anxiety 61** Om —.09 
6. Delay recall-nonanxiety hh —.20* 
7. Wonderlic Personnel —.45** 
8. 


. Age 


* Correlation of .16 required for significance at .05 level. 
** Correlation of .21 required for significance at .01 level. 


As demonstrated in Table 1, there is a 
moderate correlation (r= .36) between the 
two indices of cognitive differentiation. An 
examination of the scatterplot for the two 
variables suggested a nonlinear relationship 
might exist. An eta analysis was conducted 
(r= .46); while the hypothesis of non- 
linearity could not be rejected, the eta cor- 
relation suggests that the relationship may 
be higher than indicated by the Pearson r. 
These correlations must be viewed with cau- 
tion, however, since both differentiation meas- 
ures correlate moderately with intelligence 
and age, and these latter could conceivably 
account for the relationship between the two 
differentiation indices, 

Table 1 indicates relatively high stability 
for the incident recall test between immediate 
and delayed testing. These stability coefficients 
are probably somewhat inflated and should 
best be thought of as reflecting the upper 
bounds of retest reliability. This is due to the 


probability that the delayed retention score is 
a function of the individuals’ memory for 
their initial written recall, as well as their 
memory for the original taped incident record- 
ing. Formulations of retest reliability make 
the assumption that subjects do not recall 
answers given on the first test administration 
during the second test administration. 

Table 1 also indicates that both intelligence 
and age are related to the two cognitive 
style tests. The four incident recall measures 
all had significant relationships with the intel- 
ligence factor, and were slightly negatively 
related to the age variable. These significant 
correlations present some*problems in inter- 
preting the test results. Additional analyses 
will be presented to clarify the effects of 
these variables. 


Retention and Differentiation 


The 56 subjects who were above the mean 
on both the Color-Word Interference Test 


TABLE 2 


ANALYSIS Of VARIANCE OF RECALL SCORES ON ANXIETY AND NoNANXIETY MATERIAL 
AS A FUNCTION OF DIFFERENTIATION AND TIME 


Anxiety material Nonanxiety material 
Source v 

. ` df MS F df MS F 
Between subjects 105 105 
Differentiation (A) 1 1,364.56 9.62*** 1 1,455.18 8.49" 
Error (b) 104 145.01 104 171.47 
Within subject: 106 106 
Time (B) lates 1 81.95 4,00** 1 70.42 9.30*** 
AXB 1 62.84 3.07* 1 2.92 39 
Error (w) 104 20.48 104 7.57 

* 
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NON- ANXIETY RECALL 


+ HIGH DIFFERENTIATORS 
LOW DIFFERENTIATORS 


RETENTION 


3 coy 3 


INTERVAL (HOURS) 


Fic. 1. Immediate and delayed recall as a function of individual differences in differentiation. 


(M = 60) and the Concealed Figures Test 
(M = 64) were identified as high differentia- 
tors; the 50 subjects who were below the 
mean on both tests were identified as low 
differentiators. Figure 1 indicates the recall 
performance of these two groups. 

It had been predicted that on the anxiety 
material, the high differentiators would show 


performance superior to the low differentia- 
tors. As can be seen from Figure 1, the data 
support this prediction. The high differentia- 
tors recalled significantly more of the anxiety 
material than the low differentiators, The low 
differentiators exhibited a greater deteriora- 
tion in recall as a function of time than did 
the high differentiators. An analysis of vari- 
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Fic. 2. Immediate and delayed recall as a function of individual differences in differentiation at 
two age levels. 


ance for this material is shown in Table 2. 
The main effect was significant at the .005 
level. The interaction effect (deterioration 
as a function of time and differentiation) was 
significant at the .08 level. Since the latter 
difference was predicted, utilization of a 
single-tailed test would place the interaction 
at the .04 level. 


For the nonanxiety material, it was again 
predicted that the high differentiators would 
show better recall than the low differentiators. 
On the other hand, it was predicted that 
low differentiators would not exhibit the rela- 
tive deterioration in recall on nonanxious ma- 
terial. As can be seen in Figure 1, this predic- 
tion was supported. The anxiety and non- 
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anxiety deterioration as a function of time 
for the low differentiators is similar; how- 
ever, the deterioration was greater on the 
anxiety material, and was nowhere near an 
acceptable level of significance for the non- 
anxiety material. 


+ Retention and Age 


To control for the age factor, subjects in 
the two groups were equated for age. This 
resulted in a loss of 14 subjects from the 

„ high differentiation group, resulting in a total 
N = 92. Figure 2 illustrates the results of the 
equated young (under 47) and equated old 
(over 47) subjects in the two extreme groups. 

On the anxiety material, the young high 
differentiators actually show an increment in 
recall after the delay, though this increment 
is not significant. However, the superiority of 
the young high differentiators over the young 
low differentiators in both immediate and 
delayed recall of anxiety material is significant 
beyonil the .05 level (F = 5.65, df = 1/41). 
Also, in recall of the anxiety material the 
interaction between retention interval and 
differentiation is significant beyond the .05 
level (F = 4.00, df = 1/41) for the young 
subjects, with the low differentiators showing 
the predicted deterioration in recall as com- 
pared to the high differentiators. On the non- 
anxious materials, there are no differences 
between young high and low differentiators, 
despite the tendency for the high differentia- 
tors to recall more than the low. 

On the other hand, the old high differ- 
entiators did not differ significantly from the 
old low differentiators in either immediate or 
delayed recall. In brief, older subjects behave 
like low differentiators in recall despite high 
scores on the differentiation indices. 


Witkin, Lewis, Hertzman, Machover, Meiss- 
ner, and Wapner (1954) have shown that ad- 
vanced age induces deterioration on cognitive 
tasks for older subjects. The present study 
indicates similar conclusions. It appears that 
retention deteriorates more with age than 
does performance on the cognitive control 
tasks. 


Retention and Intelligence 


It could be hypothesized that differences in 
differentiation and recall could be due to 
intelligence, rather than cognitive style. Table 
3 illustrates the results of individuals scoring 
above and below the median on the intel- 
ligence measure and mean scores on the four 
incident recall measures. As can be seen from 
Table 3, no interaction occurs between intel- 
ligence and retention over time, though the 
high intelligence group learned and retained 
more than the low intelligence group. It ap- 
pears reasonable to conclude that the deteri- 
oration exhibited by the low differentiators 
on anxiety and nonanxiety material was not 
caused by intelligence. 


Analysis of Joint Index 


Following the main analysis of the data, 
subanalyses were conducted to determine the 
individual effects of the Color-Word Inter- 
ference Test and the Concealed Figures Test 
on recall. A summary of these analyses is 
presented in Table 4. The general trend of 
the data is quite similar for these two tests. 
In both cases, high test scores are associated 
with better recall. This relationship is sig- 
nificant for both anxiety and nonanxiety ma- 
terial beyond the .01 level for the Con- 
cealed Figures Test, but does not reach an 
acceptable level of significance for either 


TABLE 3 


Mean RECALL or ANxrETY AND NONANXIETY MATERIAL FOR 
INDIVIDUALS SCORING ABOVE AND BELOW THE MEDIAN ON 
THE WoNDERLIC TEST 


Anxiety material 


Nonanxiety material 


Group n 
Immediate Delayed Immediate Delayed 
re recall recall recall 
High intelligence 82 53.7 52.6 52.9 52.5 
Low intelligence 82 46.9 46.6 47.5 47.2 


: 
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oer TABLE 4 


MEAN RECALL OF ANXIETY AND NONANXIETY MATERIAL FOR SUBJECTS ABOVE 


AND BELOW THE MEDIAN ON THE COLOR-WORD INTERFERENCE TEST AND FOR 
THOSE ABOVE AND BELOW THE MEDIAN ON THE CONCEALED FIGURES TEST 


\ 
j Anxiety material Nonanxiety material 
| Group n e— 
Immediate © Delayed Immediate Delayed 
bee recall recall Te recall 
d Color-Word Interference Test 
r Above median 86 50.80 51.08 50.05 50,36 
5 Below median 76 49.83 48.09 49,79 48.66 
4 Concealed Figures Test 
Above median 85 52.10 x 51.91 53.02 52.09 
Below median 79 48.45 47.23 47.70 46,83 
f . anxiety or nonanxiety material in the case variables under consideration are relevant to 


of, the Color-Word Interference Test. For 
_ * both tests, low test scores are associated with 
greater deterioration in recall of anxiety ma- 
terial as a function of delay. This relationship 
as measured by the interaction between test 
scores and delay is significant beyond the 
05 level for the Color-Word Interference 
Test, but not for the Concealed Figures Test. 


` Discussion 


3 One of the purposes of the present study 
was to see if some of the cognitive tests that 
intuitively looked like differentiation, would 

+ yield some of the functional relationships that 
One would expect on the basis of a conceptual- 
ization of how differentiation operates. The 
results obtained from thfs research appear 
consistent with this rationale. The present 

+ study tested the relationship of differentia- 
_ tion to anxiety and nonanxiety recall. Re- 
s sults indicated that subjects exhibiting low 
differentiation were more likely to deteriorate 
on the recall of anxiety material as a function 
. of time than were subjects exhibiting high 
a ‘differentiation. These results are consistent 
with those of Saltz (1961) and Saltz and 
*Asdourian (1963), which indicated that dif- 
be ferentiation tends to break down as a result 
of anxiety, 
E It should be noted that the recall material 
used in this research was meaningful, con- 
nected discourse. This contrasts with much of 
the cited research on differentiation which has 
used nonsense syllables and closely controlled 
Tetention tests such as serial anticipation. 
Results of the present study suggest that the 


x 


performance even in the looser free recall 
situations more characteristic of everyday life. 

The relationship of differentiation and in- 
telligence is of import theoretically. Good- 
enough and Karp (1961), Gardner, Holz- 
man, Klein, Linton, and Spence (1959) and 
others have found positive relationships be- ` 
tween cognitive and intelligence measures. In 
the present study, the relationship between 
these two variables was systematically con- 
sidered. The results suggest that intelligence 
was not the factor producing deterioration 
of anxiety material over time, despite a sig- 
nificant positive relationship between the 
cognitive and intelligence variables. These 
results suggest that intelligence cannot simply 
be considered an aspect of differentiation. 
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ATTITUDE TOWARD THE COMMUNICATOR, PERCEPTION 
: OF COMMUNICATION CONTENT, AND 
ATTITUDE CHANGE * 


HERBERT C. KELMAN anp ALICE H. EAGLY 5 
University of Michigan 


The relationship between attitude toward a communicator and perception of 
the content of his message was explored in 2 experiments. In the Ist, a 
r. marked tendency to misperceive the message of a negative communicator by 
displacing it away from one’s own position (contrast) was found. In the 
2nd, this tendency was shown to be a direct function of the strength of 
negative feeling toward the communicator. Similarly, tendency to displace the 


» message of a positive Communicator toward one’s own position (assimilation) 
was shown to be a direct function of the strength of positive feeling. Amount of 


attitude change relates consistently to the perception data. The results are 
conceptualized in terms of 2 approaches to the communication situation: source 


E orientation and content orientation. 


According to Heider’s (1958) theory of 
cognitive balance and related theoretical 
formulations, there are strong forces to main- 
tain similarity of affect toward another per- 
son and toward the things that are associated 
with him—his actions, his possessions, his 
friends. Thus, if P likes O, he also tends to 


* like what O does, or believes, or owns; and 


if P dislikes O, he also tends to dislike what- 
ever is integrally linked with O. If there 
-is a discrepancy in affect—for example, P 
might like O, but disapprove of O’s action— 
forces toward restoring a balanced configura- 
tion will come into play.*If the action is 
sufficiently ambiguous P may tend to mis- 
Perceive it, thus removing the disturbing 
incongruity. An early experimental demon- 
Stration of this phenomenon comes from a 
Study by Zillig (see Krech & Crutchfield, 
1948), who trained a group of popular chil- 
. dren to give a poor performance of calis- 
thenics exercises (by introducing deliberate 


errors), and a group of unpopular childrene 


1 This investigation is part of a research program 
on social influence and behavior change, supported 
by Public Health Service Research Grant MH-07280 
from the National Institute of Mental Health, United 
States Public Health Service. We are very grateful 
to Fritz Heider for his stimulating and helpful com- 
ments on the data and interpretations presented in 
this paper. An earlier version of this paper was read 


‘at the Hovland Memorial Session on Attitude 


a 


Change during the meetings of the New England 
Psychological Association, Boston, November 9, 1962. 


to give a good performance. When their 
classmates were then asked to judge these 
performances, they attributed the poor per- 
formance to the unpopular children and the 
good performance to the popular ones. « 

A similar phenomenon should occur when 
the element linked to O is his position on a 
particular issue toward which P himsglf holds 
an attitude. An unbalanced configuration 
should result if a liked O takes a position 
that is too much at variance with P’s own 
stand, or if a disliked O takes a position 
that is too similar to P’s own. If there is 
enough ambiguity in the situation, we would 
expect that this state of affairs might result 
in a misperception or misunderstanding of` 
O’s position. Misperception could be ‘called 
assimilation when P displaces O’s position 
toward his stand, and contrast when P dis- 
places O’s position away from his stand 
(Hovland, Harvey, & Sherif, 1957). In the 
case of a positive O—one who is attractive, 
liked, trustworthy—the misperception should 
take the form of assimilation; that is, P will 
perceive O’s position to be closer to his own 
than it actually is. In the case of a negative 
O—one who is unattractive, disliked, un- 
trustworthy—this misperception should take 
the form of contrast; that is, P will perceive 
O’s position as further removed from his own 
than it actually is. In either case, balance 


would be restored since the position at- 


tributed to O would be congruent with P’s 
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attitude toward him. Some interesting illus- 
trations of this phenomenon come from the 
voting studies of Berelson, Lazarsfeld, and 
McPhee (1958). They found that, during 
an election campaign, committed partisans 
tended to be inaccurate in their percep- 
tions of candidates’ stands on a numberof 
issues. Thus, partisans, whatever their own 
stand, tended to perceive the stand of their 
own party’s candidates on various issues as 
close to their own stand and the stand of 
opposition candidates as removed from their 
own stand. It should be noted that Berelson 
et al. found a negative relationship between 
the extent to which such distortions occur 
and the amount of relevant information to 
which the individual has been exposed. 

It is reasonable to propose that processes 
of this kind may play a role in the reaction 
to persuasive communications. There should 
be at least a tendency for the audience to 
perceive communications as congruent with 
their attitude toward the communicator. They 
should see the message of a negative com- 
municator as more negative than it actually 
is (that is, further removed from their own 
preferred postion), and the message of a 
positive communicator as more positive than 
it actually is (that is, closer to their own 
position). To the extent that such displace- 
ments occur, they can be expected to have 
significant effects on the amount of attitude 
change that is produced. In the case of a 
‘negative communicator, misperception of his 
message is likely to reduce even further his 
persuasive impact: the further the listener 
displaces the communicator’s position from 
his own, the more likely he is to dismiss the 
communication as irrelevant and outside of 
his range of acceptance. Moreover, such 
misperception of a negative communicator’s 
message will limit the occurrence of a change 
of attitude toward the communicator himself. 
The discovery by P that O agrees with him 
more than he had anticipated may be the 
beginning of a beautiful new relationship 
(Osgood & Tannenbaum, 1955)—but if P 
misperceives O’s message in such a way as to 
enhance their disagreement, such a restruc- 
turing is ruled out. In the case of a positive 
communicator, misperception may neutralize 


the persuasive impact that he might other- « 


wise have. If the listener perceives the com- 
municator’s position as identical to his own, 
he has effectively eliminated the challenge 
that a discrepancy in attitude would present. 

In view of these considerations, it is im- 
portant to explore the precise conditions 
under which such misperceptions of com- 
munication content are most likely to occur, 
and the conditions under which perception is 


veridical despite the presence of cognitive | 


imbalance. Such an exploration should have 
more general implications for what is proba- 


bly the major unresolved question in balance. 


theory and its derivatives: the question of 
what determines the particular mechanism 
that a person chooses, out of the range of 
possibilities available to him in a given 
situation, in order to restore balance or reduce 
dissonance. We would assume that these dif- 
ferent mechanisms cannot always be sub- 
stituted for one another, even though they 
produce the same formal outcome. The like- 
lihood that one or another mechanism will 
come into play depends, to a great extent, 
on the structure of the situation in which the 
person finds himself. 

Communication studies in which attitudes 
toward the communicator were varied have 
yielded different findings with respect to the 
perception of communication content. In two 
related studies, carried out by Hovland and 
Weiss (1952) and by Kelman and Hovland 
(1953), trustworthy, “high credibility” com- 
municators were found to be clearly more 
effective in producing attitude change than 
untrustworthy, “low credibility” communi- 
cators. There was no evidence, however, that 
attitudes toward the communicator had any 
effect on the perception and comprehension 
>of communication content. Subjects showed 
no tendency to misperceive the messages itl 
a way that would make them congruent with 
their attitude toward the source. Kelley and 
Woodruff (1956), however, found such tend- 
encies in a study in which they presented 
subjects with communications designed to 
change group-anchored attitudes. The taped 
communication was interrupted periodically 
by applause which, for half the subjects, was 
attributed to members of a relevant reference 
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group, and, for the other half, to anonymous 
«outsiders. Subjects in the former group 
clearly exhibited a greater tendency to distort 
the communication—that is, to perceive it as 
being closer to the group norm and their own 
opinion than it actually was. In another 
- study, Harvey, Kelley, and Shapiro (1957) 
presented subjects with unfavorable evalua- 
tions of themselves which deviated to vary- 
» ing degrees from their own self-evaluations. 
These evaluations were attributed either to 
acquaintances or to strangers. The results 
have to be described separately for the low- 
‘discrepancy conditions (in which the other’s 
evaluation deviated only slightly from the 
` subject’s self-evaluation) and for the high- 
«discrepancy conditions. In the former case, 
they found a greater tendency to distort the 
communication in a favorable direction when 
it came from an acquaintance as compared 
to a stranger. In the latter case, the greatest 
amount of distortion occurred when the 
source was an acquaintance who was highly 
regarded. In a recent study, Manis (1961) 
attributed various communications to either 
» high-prestige or low-prestige sources, He 
found a tendency for subjects to displace 
the views of the high-prestige sources toward 
„their own position. There was no consistent 
tendency, however, to displace the views of 
the low-prestige sources away from one’s own 
position, 
These various findings suggest some of the 


F . conditions under which attitudes toward the 


3 


a 


communicator are likely to affect the percep- 
tion of communication content. There is a 
need now to establish more precisely just 
what the crucial conditions are. In this paper 
+ We will present data from two experiments 
that may contribute to an understanding of 
these conditions. Experiment I produced a 
Serendipitous finding that clearly illustrates 
» 4 contrast effect in the perception of the 
Message of a negative communicator. Experi- 
ment II explores the effect of the intensity, 
. as well as the direction, of attitude toward 
the communicator on the perception of his 
message. Using these findings as our starting 
Point, we shall try to develop a general 
` hypothesis about the conditions that favor 
misperception in a congruent direction. 
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EXPERIMENT I 


Experiment I was carried out in 1954 as 
a pilot study in connection with a different 
project, but it yielded some interesting finid- 
ings relevant to our present concern. The 
subjects were 72 freshmen in a Negro college 
in Maryland. The time was shortly before 
the Supreme Court decision on desegregation 
in public schools. 


Procedure 
° 


The experimenter, a Negro man, introduced him- 
self to each of three history classes as a representa- 
tive of the “Research Institute for Radio Education.” 
After telling the students that his agency was inter- 
ested in evaluating educational radio broadcasts, 
he played a tape-recorded communication designed 
to change the students’ attitudes on an issue relating 
to the impending court decision. A different version 
of the communication was presented to each of 
the three classes, but each version included essentially 
the following message: If the Supreme Court rules 
that segregation is unconstitutional, it would still 
be desirable to maintain some of the private Negro 
colleges as all-Negro institutions in order to preserve 
Negro culture, history, and tradition, Preliminary 
testing indicated that a large majority of the sub- 
jects would not approve this message. 

The communications took the form of an inter- 
view between a moderator and,a guest. The presen- 
tation of the guest constituted our experimental 
manipulation. While the three communicators deliv- 
ered exactly the same message, they demonstrated 
marked differences in personality, background, and 
general attitudes. Their distinguishing characteristics 
emerged in the moderator’s introductory comments 
and in their own comments in the course of the 
interview. ‘ 

One communicator,“Mr, Sprague, was introduced 
as a philanthropist, the president of the National 
Foundation for Small Colleges. From his remarks 
during the interview, it became evident that his 
foundation had been supporting Negro colleges and 
that he kept almost complete control over the 
foundation’s funds even though he knew nothing 
about education. He was presented as a pompous, 
paternalistic, authoritarian individual who was ready 
to use his power as it suited his ends. He indicated 
that he would have to withdraw his foundation’s 
support from Negro colleges if he found that Negro 
college students disagreed with his position. 

The second communicator, Prof. Lancaster, was 
introduced as a professor of social history at a 
leading university and an expert on the problems 
of minority groups. It became apparent that he 
was a highly respected scholar who had a great 
concern for problems of the American Negro. He 
was presented as a modest, humble man, objective 
and scholarly, with a broad background of knowl- 
edge. He favored the maintenance of some private 
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TABLE 1 
REACTIONS TO THE THREE COMMUNICATORS 
Communicator 
A Description Mr. Prof. | Rev. 
Sprague | Lancaster| Bryant 
(N =22) | (N =24) | (N 26) 

1. A man whose judg- 

ment I can trust. 68 29 8 
2. An expert on Negro 

problems, 64 29 42 
3. “My type of person.” 82 33 19 
4. A person whom I 4 

would want to be like. 77 33 27 
5. Aman who speaks 

mostly fora group, 

rather than himsel 73 12 8 
6. Aman who speaks jie 

a group of which I am 

a member. 59 8 12 


Note.—Figures represent the percentage of subjects in each 
group who disagreed with the descriptions of the communicator. 


all-Negro colleges on the basis of historical evidence, 
suggesting that minority groups can survive only if 
they have some educational institutions of their own. 
The third communicator, Rev. Bryant, was intro- 
duced as a Negro minister in a large city. A modest 
person, he did not consider himself an expert on 
the problems under discussion, but spoke as a 
member of the Negro community. He emphasized 
that Negroes, in order to foster group unity, ought 
to put the interests of the Negro community as 
a whole ahead of their more personal and limited 
interests. He brought up the issue of the private 
Negro colleges as an example of the kind of problem 
for which Negro group unity and support were 
important. 
> Thus, the first speaker was presented in a “negative 
' light,” that is, as uninformed, untrustworthy, and 
generally unattractive, while the second and third 
speakers were presented positively. A questionnaire 
administered immediately after the program con- 
tained questions on attitude toward the communi- 
cator and on the comprehension of an agreement 
with the content of the message. 


Results 


Table 1 shows quite clearly that the atti- ° 


tudes toward the communicators were differ- 
ent for the three conditions. The table lists 
six descriptive statements to which subjects 
responded on a 5-point scale, with positions 
ranging from “agree strongly” to “disagree 
strongly.” Next to each statement, the per- 
centage of subjects in each group who checked 
the “disagree strongly” or “disagree some- 
what” positions on the scale is given. As can 
be seen, Mr. Sprague, our “negative” speaker, 
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was consistently judged lower in trustworthi- 
ness, expertness, general attractiveness, and 
representativeness than the other two speak- 
ers. (On 11 out of the 12 comparisons, the 
differences are significant at the .03 level or 
better.) There were no significant differences 
between Prof. Lancaster and Rev. Bryant. 

Responses to an open-ended question ask- 
ing for comments on the program or the 
speaker gave further evidence of the negative. 
reaction to Mr. Sprague and of the relatively 
positive reactions to the other two speakers. 
Several subjects described Mr. Sprague as — 
prejudiced; others accused him of being self- 
centered or conceited and of using his influ- s 
ence to get what he wants. In general, reac- _ 
tions to the open-ended question indicated 
disagreement with Mr. Sprague’s message. — 
Most of the comments about Prof. Lancaster 
and Rey. Bryant were favorable. Despite 
their positive attitude toward these two : 
speakers, however, a large proportion of 
subjects expressed disagreement with the 
content of the main message these men : 
presented. 

Table 2 presents data on the subjects’ 
perceived agreement with the positions of the 
three speakers, The percentage of subjects in 
each group who selected the “agree strongly” 

r “agree somewhat” responses on a 5-point — 
scale is given. As can be seen, the majority 
of subjects perceived themselves and their l 


TABLE 2 
PERCEIVED AGREEMENT WITH THE THREE 
COMMUNICATORS 
Communicator 
f | 
Question Mr. Prof. Rev. 
Sprague | Lancaster] Bryant 
(N =22) | (N =24) | (N =26) 
1. In general did you $ i 
agree with what was 
said? 23 58 i 
2. Do you think most Ne- 
gro college students l 
would agree with the 
speaker’s position? 18 54 13 
3. Do you think most | 
members of the Negro 
community would 
agree with his position?} 14 55 77 


| 

7 
Note.—Figures represent the percent each 
group who answered affirmatively. ea 


reference groups as agreeing with Prof. 
Lancaster and Rev. Bryant, but not with Mr. 
‘Sprague—although his central message was 
identical to that of the other speakers. The 
differences between Mr. Sprague and the 
other speakers are significant at the .03 level 
or less, while those between Prof. Lancaster 
: ‘and Rev. Bryant are not significant. 
In part, the subjects’ disagreement with 
the negative speaker’s message probably re- 
* flects their general disapproval of this speaker 
and everything. he stands for, rather than dis- 
agreement with the specific content of what 
he said. It may also be, however, that the 
“subjects disagreed with his message to a 
_ greater extent because they misperceived his 
position in such a way as to displace it away 
‘from their own position. Table 3 indicates 
that this is precisely what happened. Subjects 
were asked which of three statements came 
closest to the position of the speaker. Of 
the subjects who had listened to the negative 
speaker, 41% attributed to him a more segre- 
gationist position than the one he had 
actually taken. None of the subjects in the 
other two groups checked this particular re- 
* sponse. This difference (41% versus 0%) is 
significant at less than the .01 level. Thus, 
we have a clear demonstration of a contrast 
„effect produced by a negative speaker. Sub- 
jects perceived his position as further re- 
moved from their own than it actually was. 
It is interesting that the parallel phenomenon 
—assimilation of the positive speakers’ mes- 
sages—did not manifest itself. All of the sub- 
jects who had listened to the professor and 
almost all of the subjects who had listened 
to the minister perceived the message veridi- 
cally—even though there are indications that 
. Many of them did not accept it. 
. ` We can only speculate about the reasons 
h why the message of the negative communi-e 
cator—and only of the negative communicator 
—was misperceived so dramatically in this 
particular situation. It would seem that the 
negative speaker in this situation aroused 
Particularly strong feelings in the subjects— 
` presumably more negative, if it were pos- 
sible to measure this, than the feelings toward 
the positive communicators were positive. 
:,. The strength of feeling here cannot be under- 
stood simply in terms of the negative charac- 
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TABLE 3 
COMPREHENSION OF THE MESSAGE 
Communicator 
Sr mr, | Prot, | Rev. 
Sprague | Lancaster} Bryant 
(N =22) | (N =24) | (N =26)e 
= 
1. All Negro schools 
should become inter- 
racial. 9 0 4 
2. Some Negro schools 
should remain all- 
Negro. 50 100 92 
3. an Negro schools 
should remain all- 
Negro. 41 0 0 
No answer 0 0 4 
Total 100 100 100 


Note.—Figures represent the percentage of subjects in each 
group who checked the statements as coming closest to the 
position of the speaker, 


teristics of the communicator, but depends 
to an important degree on the context within 
which these characteristics are revealed. Mr. 
Sprague aroused such strong feelings not 
simply because he was pompous, authori- 
tarian, and manipulative, but because he 
manifested these characteristics with respect 
to the issue of segregation—an issue that was 
highly central and ego jnvolving for our 
subjects. In this context, Mr. Sprague was 
not simply another unpleasant, objectionable 
individual. He was readily identified with that 
class of white men who, in the guise of 
philanthropy, perpetuate the segregation of 
the races, who will do things for the Negro 
provided he keeps his place and does not ` 
insist on entering into white society. Any 
position that this man favored was immedi- 
ately suspect and the subjects would want 
to disassociate themselves from it. 

We would propose that, in a situation in 
which strong feelings toward the communi- 
cator are aroused, the tendency to perceive 
communication content in line with one’s 
attitude to the communicator should be par- 
ticularly strong. In the case of a negative 
communicator, such as Mr, Sprague, this 
tendency should take the form of contrast. 
Positive communicators, of course, may also 
arouse strong feelings. In that case, the tend- 
ency to perceive communication content in 
line with one’s attitude to the source should 
take the form of assimilation. In the present 
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experiment, however, the positive communi- 
cators apparently did not arouse such strong 
feelings, and hence their messages were per- 
ceived veridically even though they were 
unacceptable to the subjects. We shall explore 
the possible reasons for this finding after 
presenting a second, more recent experiment 
on this problem. 


EXPERIMENT II 


In this experiment subjects were exposed 
to communications from a positive and a 
negative communicator. It was possible” to 
classify subjects in terms of the strength 
of feeling aroused by each of the communi- 
cators and to observe the effect of this vari- 
able on their perception of communication 
content, as well as on attitude change. 
Experiment II thus allows us to go beyond 
Experiment I by demonstrating the effects of 
variations in strength of feeling, toward both 
a positive and a negative communicator, on 
perception of communication content; and, 
furthermore, by examining the amount of 
attitude change under these same conditions. 

With respect to the perception of com- 
munication content, we would predict that, 
as the strength of feeling aroused by the 
communicator increases, the tendency to dis- 
place or misperceive his position will in- 
crease. In the case of the positive com- 
municator, this displacement will be toward 
the subject’s own position—that is, take the 
form of assimilation. In the case of the 
negative communicator, it should be away 
from the subject’s own position—that is, take 
the form of contrast. 

These effects should show a systematic 
relationship to the amount of attitude change. 
In predicting attitude change as a function of 
strength of feeling toward the communicator, 
we must consider both a direct and an in- 
direct effect of this variable. The direct effect 
refers to the forces toward change induced 
by the communicator. This effect should be 
directly related to the strength of positive 
feeling, and inversely related to the strength 
of negative feeling. The indirect effect of 
attitude toward the communicator on attitude 
change refers to the effect of this variable 
on perception of communication content, 
which in turn may affect attitude change. 


This indirect effect of strength of feeling 


should be inversely related to the amount of 
attitude change, regardless of whether the 


feeling is positive or negative. That is, the 
stronger the feeling, the greater the tendency 
to displace the message—toward one’s own 


Position if the feeling is positive, away from 


one’s self if the feeling is negative. In either d 


case, the greater the displacement, the less 
likely the occurrence of attitude change. If 


the person sees the message as further away 


from his own position than it actually is, 
then he is in essence pushing it into a zone 
where it no longer requires his serious atten- 
tion: it is no longer within his own range of 
potential acceptability. If the person sees the 
message as closer to his own position than 
it actually is, then he has successfully avoided 
its challenge: since he is not aware that it 
differs from his own position, there is pre- 
sumably no call for attitude change. Thus, the 
greater the strength of feeling—positive or 
negative—the less the likelihood of attitude 
change. 

According to this reasoning, the direct 
effect and the indirect effect should be 
mutually reinforcing in the case of the nega- 
tive communicator. Both would tend to reduce 
the probability of attitude change, as strength 
of feeling increases. In the case of the positive 
communicator, however, the two effects push 
in opposite directions. The greater the strength 
of feeling, the stronger the forces toward 
change induced by the communicator. At the 
same time, however, the greater the strength 
of positive feeling, the greater the tendency 
to displace the position of the communicator 
toward one’s own and thus avoid the necessity 
of changing altogether. Thus, it becomes 
difficult to predict the effect of strength of 
feeling toward a positive communicator on 
attitude change. If the direct and indirect 
effects are more or less equal, then they 
should cancel each other out and yield a 
curve that is essentially flat. 


Procedure 


Subjects. The subjects in this experiment were 
151 high school students, comprising six sections of 
a history course. 

Communications. Each of the six classes listened 
to two tape-recorded communications on juvenile 
delinquency, involving interviews between a modera- 
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tor and a guest (the communicator).? Both speakers 
took the same general position—that is, that there 
js a need for more clear-cut standards for the 
behavior of teen-agers. In one communication 
(the “laws message”) this general point was 
brought out with respect to the laws that govern 
the behavior of teen-agers. The laws were claimed 
to be confusing because they made “conflicting 
*demands” and failed to set up definite guides for 
behavior. In the other communication (the 
“parents message”) the central theme was that 
, teen-agers are not provided with adequate standards 
because of a lack of meaningful contact between 
parents and children. In a preliminary study, we 
found that most subjects were either neutral or 
mildly negative with respect to the positions rep- 
‘resented by* these two communications. This was 
again confirmed on the before-test of the main 
- experiment. The communications were straight- 
forward, but sufficiently ambiguous so that it was 
‘possible to interpret the position presented as more 
restrictive than it actually was—a position rejected 
by most subjects; or as more appreciative of the 
good judgment of teen-agers than it actually was—a 
position favored by most subjects. In other words, 
there was room for displacement in both directions. 
Each of the two communications was preceded 
by an introductory exchange between the moderator 
and the guest, in which the personality, background, 
and general attitudes of the communicator were 
revealed. There were two versions of this introduc- 
“ tion: one in which the communicator projected a 
Positive image, to use the modern jargon, and one 
in which he projected a negative image. Combina- 
tions of introduction and message were counter- 
‘balanced, so that three classes heard the positive 
communicator give the laws message and the nega- 
tive communicator give the parents message; and the 
other three classes heard the positive communicator 
give the parents message, and the negative com- 
municator give the laws message. 
__ The positive communicator was presented as an 
individual who would be personally attractive to a 
teen-age group and at the same time a highly 
credible source of information on delinquency. He 
was director of a boys’ camp, member of a civic 
committee investigating juvenile delinquency, him- 
. Self a father, but young enough to be able to identify 
with teen-agers. It was apparent that he had a 
sympathetic understanding of the problems of this 
age group. He projected the image of a knowledge- 


able friend to the teen-ager. 
SEE Sets 


* Three variations in introducing the experiment 
were employed in order to induce differences in set. 
ne variation was designed to create a source- 
_ Oriented set, a ‘second to create a content-oriented 
Set, and a third to create a “neutral” set. Each 
Variation was administered to two classes. Analysis 
Si Tesponses to a postexperimental questionnaire 
indicated that these manipulations of set were un- 
` Successful (probably because of the brevity and 
timing of the set-inducing statements). They will, 
be disregarded in the present report. 
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By contrast, the negative communicator projected 
the image of an ignorant enemy. He was an un- 
pleasant person and also completely incompetent 
to discuss juvenile delinquency. He was a retired 
accountant, and an active member of a historical 
society (a familiar type in Massachusetts, where 
this study was done). He was stuffy and extremely, 
conservative, if not outright reactionary. He also 
turned out to be a bachelor (presumably because no 
woman would have him) and his only acquaint- 
ance with teen-agers derived from seeing them 
play baseball on his way to the historical society. 

Reactions to communicators. After listening to 
the „communications, the subjects responded to a 
series of questions concerning their reactions to 
the communicators, their perceptions of communica- 
tion content, and their attitudes toward the issues 
discussed in the communications, The items assessing 
reaction to the communicators consisted of 12 
descriptive statements, referring either to the at- 
tractiveness or to the credibility of each communica- 
tor, Six of these statements were positive, and 6 
were negative. Subjects were instructed to check the 
statements that described each communicator. There 
was no restriction on the number of items they 
could choose. 

In addition to providing information about the 
success of our manipulation of the communicators’ 
characteristics (see Results), responses to these items 
served as a basis fór classifying subjects in terms of 
strength of feeling toward each communicator, A 
subject could check from 0 to 6 positive items and 
from 0 to 6 negative items as, descriptive of each 
communicator. For each subject’s response to each 
communicator, the number of negative items checked 
was subtracted from the number of positive items 
checked. Thus, a resultant tendency to perceive 
the communicator favorably or unfavorably was 
obtained, with a range of possible scores from 
+6 to —6. For each message, then, we were able, 
to divide the subjects into a group that was strongly + 
positive toward the ‘positive communicator (sub- 
jects with a score of +5 or +6), another that was 
moderately positive (subjects with a score of +3 
or +4), and a third that was mildly positive (sub- 
jects with a score of +1 or +2). Similarly, we 
obtained a group that was strongly negative 
toward the negative communicator (subjects with a 
score of —5 or —6), another that was moderately 


e negative (subjects with a score of —3 or —4), and 


a third that was mildly negative (subjects with a 
score of —1 or —2). For each of these groups, we 
can examine misperception of communication con- 
tent, as well as attitude change. 

Measures of dependent variables. To measure 
perception of communication content, the following 
procedure was used. At the end of the communica- 
tion session, subjects were presented with six sets 
of statements—three for each message—and asked, 
in each case, to pick the statement that represented 
the communicator’s position. For example, one set .0 
statements read as follows: ? 


© 
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According to the camp director, Mr, Hartley, 
what should the lawmakers take for granted when 
they write laws about teen-agers? 3 

(a) They should take for granted that 95% of 
all teen-agers are potential delinquents. 

(b) They should take for granted that 95% of 
> all teen-agers need to be shown what is right and 
wrong. ° 
(c) They should take for granted that 95% of 

all teen-agers want to behave in the right way. 


Statement b in the above set represents the correct 
answer. A statement very similar to b—“The law- 
makers should take for granted that 95% of all 
teen-agers need definite guidance”—was, in fact, 
embedded in the laws message. In addition to the 
correct alternative, each set of statements con- 
tained two kinds of incorrect alternatives: statements 
that were closer to the subjects’ own position than 
the correct statement, and statements that were 
further removed from their own position than the 
correct statement. Thus, in the example above, 
Statement a represents the alternative less acceptable 
than the correct one, and Statement c represents 
the more acceptable alternative. 

The incorrect alternatives were developed on the 
basis of a preliminary study with comparable 
subjects, who were asked to indicate their degree of 
agreement with a whole series of items relating to 
juvenile delinquency. Using the information ob- 
tained in the preliminary study, we were able 
to select items that varied in their degree of ac- 
ceptability to our subjects. These items were included 
in a before-questionnaire, administered to the 
subjects a month before the communication session. 
For each item, subjects were asked to check 1 of 6 
responses, ranging from “agree strongly” to “disagree 
strongly.” Scores on each item ranged from 7 to 1, 
omitting 4 since no neutral position was provided. 
Returning to the example given above, we find that 

, the mean agreement score for Statement a in the 
before-questionnaire was 1.83} the mean score for 
Statement b was 2.73, and the mean score for 
Statement c was 5.80, Thus, as can be seen, the 
correct statement (Statement b) was intermediate 
in acceptability, Statement a was further removed 
from the subjects’ own Position than the correct 
statement—thus allowing for the manifestation of 
a contrast effect. Statement ¢ was much closer to 
the subjects’ own position than the correct state- 
ment—thus allowing for the manifestation of an 
assimilation effect. 

Each set of statements used in the Measure of 
perception of communication content included items 
varying in acceptability in the same way as the 
items of the above example. That is, each set in- 
cluded one correct statement, of intermediate ac- 
ceptability; one or two incorrect statements low in 
acceptability; and one or two incorrect statements 


®In the groups that heard the negative com- 
municator deliver the laws message, the question 
began with “According to the accountant, Mr. 
Morgan .,, ,” 
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high in acceptability. Since there were three sets of 
statements for each message, the subjects had three 
opportunities to displace towards or away from their 
own position in judging the communication con- 
tent. Thus, for each message, we obtained an as- 
similation score with a possible range from 0 to 3, 
and a contrast score with a possible range from 
0 to 3. 


Repetition of some of the before-questionnaire 


items on the questionnaire administered immediately 
after the communications yielded a measure of 


attitude change, The items chosen for repetition on 


the after-questionnaire were identical to the correct 
alternatives used in the measure -of perception of 
communication content. Thus, for example, one of 
the items used to measure attitude change read as 
follows: “The lawmakers should take for granted 
that 95% of all teen-agers need to be shown what 


is right and wrong.” As will be noted, this is- 


equivalent to Statement b in the example from the 
perception measure given above. Five items were 
repeated in this manner; two of these pertained to 
the laws message, two to the Parents message, and 
one to both messages. Two attitude-change scores 
were obtained for each subject: one score was based 
on the two laws message items and the -other on 
the two parents message items, Thus, for each 
message (which also means, of course, for each 
communication—positive and negative) we obtained 
a measure of attitude change based on the same 
items that entered into the measure of perception 
of communication content. 


Results 


Responses to the items measuring reactions 
to the communicators confirmed the success of 
our manipulation of communicator character- 
istics. Thus, a much larger percentage of sub- 
jects attributed each of the positive qualities 
to the positive communicator than to the 
negative communicator. Similarly, a larger 
Percentage of subjects attributed each of the 
negative qualities to the negative communica- 
tor than to the positive communicator. These 
Tesults can be found-in Table 4. G 

The upper portion of Table 5 presents the 

> data for displacement of the Positive speaker’s 
message. The figures represent amount cf 
displacement toward the subjects’ own posi- 
tion—in other words, 
tion (mean number of incorrect items high 
in acceptability chosen, with a possible range 
from 0 to 3). For both Messages, as antici- 
pated, there is a direct relationship between 
strength of positive feeling and amount of 
displacement toward one’s own position. In 
each case, the ordering of the three means is 
exactly according to hypothesis—that is, the 


the amount of assimila- ‘ 


— 
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strongly positive subjects show the greatest 
amount of displacement, and the mildly posi- 


` tive subjects show the least. The joint prob- 


ability of this precise ordering in each of two 
samples is 1/36 or <.03. The weighted mean 
difference between the strongly positive and 
the mildly positive subjects from both samples 
combined (cf. Snedecor, 1946, p. 290) is of 
borderline significance (¢ = 1.47, p < .08, one- 
tailed test). 

The lower portion of Table 5 presents the 
comparable dàta for the negative communica- 
tor. The figures here represent amount of 
displacement away from the subjects’ own 
position—in other words, the amount of con- 
trast. The general trend is again consistent 
with our expectation, although there is one 
slight reversal for the parents message. How- 
ever, the weighted mean difference between 
the extremes—the strongly negative and the 
mildly negative subjects—for both samples 
combined is clearly significant (¢ = 2.18, 
$ < .02, one-tailed test). 

In addition, tendencies to contrast the 
positive communicator’s message (displace it 


TABLE 4 
REACTIONS TO THE COMMUNICATORS 
Communicator 
Description 
Š Positive | Negative 
Positive statements 
1, Aperson who represents my 
general point of view. 70 21 
2. A likable person. 83 33 
3. The kind of person I would like 
to be. 43 6 
4. A well-informed person. 87 25 
5. An expert on juvenile delin- 
ý quency. 48 9 
. 6. À person whose statements can 
be trusted. 70 22 
, Negative statements R 
1. An unpleasant person. 2 34 
2. A person I Soild not want to 
know. 3 40 
3. A person who stands for things I 
cannot accept. 5 30 
4. A person who does not know 
what he is talking about. 3 45 
5. A person whose judgment I can- 
not trust. 3 38 
6. A person whose opinions are 
biased and one-sided. Lie aes 


Note.—Figures represent the percentage of subjects who 


$ checked the descriptions for each communicator (V = 151). 
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TABLE 5 
ASSIMILATION AND CONTRAST 
Laws message | Parents message 
Strength of attitude 
toward communicator 
N M N M 
a 
*  Congruent displacement of the positive com- 
municator’s message (assimilation) 
Strongly positive 40 1.10 26 1.27 
Moderately positive | 23 1.00 24 1.12 
Mildly positive 10 0.80 15 0.93 
Congruent displacement of the negative com- 
municator’s message (contrast) 
Strongly negative 9 0.44 29 1,00 
Moderately negative} 10 0.40 8 0.38 
Mildly negative 16 0.12 12 0.50 


away from own position) and to assimilate 
the negative communicator’s message (dis- 
place it toward own position) were measured. 
The amount of contrast of the positive com- 
municator’s message was negligible and 
showed no relationship to strength of feeling. 
The amount of assimilation of the negative 
communicator’s message was appreciable, but 
also showed no consistent relationship to 
strength of feeling. (None of the differences 
in amount of assimilation between strength 
of feeling categories was significant.) The 
finding of a general tendency to assimilate 
even the negative communicator’s message 
suggests an underlying “autistic” factor in 
perception. That is, people may be pre- 
disposed to displace stimulus input toward 
their own attitudes, especially if the stimuli 
are relatively ambiguous. Tendencies to as- 
similate a positive communicator’s message 
and to contrast a negative communicator’s 
message may, then, be “superimposed” on this 
underlying autistic tendency.* 

From the main findings on displacement, 
as presented in Table 5, it seems fair to 


4 Our measures allowed us to tap both directions 
of displacement in relation to each communicator. 
A unitary interpretation measure, such as that used 
by Manis (1961), would indicate the resultant of 
the two displacements, This difference in measure- 
ment procedures may account for the absence of 
contrast of the negative communicator’s message 
in the Manis study. 


c 


72 


TABLE 6 
ATTITUDE CHANGE 


Laws message Parents message 
Strength of attitude 


toward communicator 


N M N M 
> 


Attitude change under the positive communicator ” 


Strongly positive 31 1.40 18 0.50 
Moderately positive | 16 1.56 17 | -0.14 
Mildly positive 8 1.44 11 0.73 


Attitude change under the negative communicator” 


Strongly negative 5 —0.60 | 22 0.30 
Moderately negative 9 0.72 5 0.40 
Mildly negative 10 1.00| 10 1.70 


conclude that—for both the positive and the 
negative communicator—there is a direct 
relationship between strength of feeling and 
the tendency to perceive communication con- 
tent in such a way as to yield a cognitively 
balanced configuration. The data on the 
relationship between strength of feeling and 
attitude change are presented in Table 6. 

As will be recalled, in the case of the nega- 
tive communicator,» we expected the direct 
and indirect effects of strength of feeling to 
reinforce each other. That is, the greater the 
strength of feeling, the greater the tendency 
to reject the communicator’s message, as well 
as to misperceive it—both of which would 
militate against change. Combining the two 
effects should, therefore, field an inverse 
relationship between strength of feeling and 
attitude change. And, indeed, this is what we 
find very clearly in the lower portion of 
Table 6. The weighted mean difference be- 
tween the strongly negative and mildly nega- 
tive subjects for both samples combined is 
clearly significant (t = 2.84, p < .01). 

In the case of the positive communicator 
we expected the direct and indirect effects 
to cancel each other out. In the absence of 
any information about the relative strengths 
of the two effects, one would predict no 
consistent relationship between strength of 
feeling and attitude change. The relevant 
data are presented in the upper portion of 
Table 6. As far as the laws message is con- 
cerned, the pattern is very clear: the three 
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means are almost identical. As for the parents 
message, the slightly negative change in the 
moderately positive group is very difficult to 
explain. Fortunately, however, none of these 
differences is significant. The meaning of a 
lack of significant differences is, of course, al- 
ways ambiguous. But the data here are at least 
consistent with the proposition that an in- 
crease in strength of positive feeling toward 
the communicator may not be accompanied 
by the expected increase in attitude change 
because of the simultaneous intrease in the 
tendency to misperceive his message. 

In discussing the “indirect effect” of the 
strength of feeling variable, we postulated 
an inverse relationship between amount of 
displacement—positive or negative—and 
amount of attitude change. To check on this 
assumption, we examined the relationship be- 
tween positive displacement and attitude 
change for the positive communicator, and 
between negative displacement and attitude 
change for the negative communicator. In 
each case, 2X2 tables, comparing high 
versus low displacement and high versus low 
change, were formed. The data from both 
messages were combined for these analyses. 

The predicted inverse relationship between 
attitude change and negative displacement of 
the messages delivered by the negative com- 
municator was found (x? = 4.33, p< .05). 
In this analysis, the sign of the change score 
was taken into account, so that a negative 
score represented the low end of the change 
continuum. It might be argued, however, that 
displacement reduces the probability of nega- 
tive change as well as positive change. Thus, 
if the subject perceives the message of a 


negative communicator correctly, he may | 


change in the opposite direction in order to 
Gifferentiate himself from the communicator 


—that is, to increase the distance between ` 


his own position and the communicator’s posi- 
tion. But, if he displaces the message in a 
negative direction, he can achieve the same 
end without the necessity of negative change. 
By this logic, we would expect an inverse 
relationship between amount of negative dis- 
placement and absolute amount of attitude 
change (ignoring whether the change is in a 
negative or a positive direction). This ex- 
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pectation was fully confirmed by the data 
(x? = 16.46, p < .001). 

No significant inverse relationship between 
attitude change and positive displacement of 
the messages delivered by the positive com- 
municator was found, although there was a 
slight trend in the predicted direction. 


Discussion 


The main conclusion to be drawn from 
' the two experiments we have described is that 
the tendency to perceive communication 
content in line with one’s attitude toward 
_ the communicator is most likely to come into 
play when the communicator arouses strong 
feelings. This finding is quite consistent 
with what one would expect from balance 
* theory, if we make two fairly straightforward 
assumptions. First, as the feeling toward O 
becomes stronger, the imbalance and the 
resulting forces to restore balance generated 
by arfy sign of incongruity also become 
stronger. Since misperception is one of the 
mechanisms for restoring balance, the likeli- 
hood of its use—provided the situation per- 
mits it—will thus increase. Second, as the 
feeling toward O becomes stronger, one of 
the major mechanisms for restoring balance— 
namely, changing one’s attitude toward O— 
* becomes less and less available: it is difficult 
to devalue a strongly positive O just because 
one finds one’s self in disagreement with him 
(Pilisuk, 1962), and it is difficult to open 
one’s heart to a strongly negative O just 
because one finds one’s self in agreement with 
him. Thus, the likelihood increases that other 
mechanisms for restoring balance, such as 
misperception, will come into play. With these 
_ additional assumptions, balance theory is 
+ perfectly capable of encompassing the present 
findings. 
,, Even though the data do not require anf 
further explanation, we feel that there is some 
value in conceptualizing them in terms of 
certain qualitative extensions of balance 
theory. We are proposing that misperception 
of communication content is not simply a 
function of the intensity of the attitude to- 
ward the communicator, but that it depends 
on the nature of the concern that the situa- 
tion arouses. Specifically, we hypothesize 
that misperception is a function of the de- 


gree to which the incongruous situation raises 
questions of self-definition in the subject. 
According to this interpretation, the variation 
in strength of feeling toward the communica- 
tor in Experiment II represents not merely a 
variation in the intensity of attitude, but algo 
ope in the degree to which the issue of self- 
definition is brought into play. The experiment 
does not enable us to test this interpretation, 
since the latter effect cannot be separated 
from the effect of differences in intensity 
per se. Nevertheless, we regard an inter- 
pretation that uses concern with self-defini- 
tion as the intervening motivation to be 
theoretically fruitful. It should contribute to 
needed extensions of balance theory by at- 
tempting to spell out the specific conditions 
under which a particular mode of restoring 
balance is likely to be used. Furthermore, it 
permits us to generalize the present findings 
by suggesting other variables—aside from 
strength of feeling—that would be likely to 
bring misperception into play. 

In order to clarify what we have in mind 
by concern with self-definition and its relation 
to misperception, let us return to Experi- 
ment I, We would propose that Mr. Sprague’s 
communication raised, for our subjects, a 
question of self-definition. On an issue central 
to their self-concepts, they were confronted 
with a man who symbolized at least some of 
the forces in society that they regard as 
blocking them and in opposition to them. 
He represented a negative reference person, 
intimately tied t6 their self-concepts in the 
sense that they defined themselves in op- 
position to him. The person’s relationship to 
such negative reference points, from which 
he differentiates himself and which represent 
the side he is against, may be as important to 
his self-definition as is his relationship to 
the individuals and groups representing his 
own side. It is in part through this process of 
differentiation that he defines what he himself 
stands for. Given, then, a situation in which 
concern with self-definition is brought into 
play, the person is not particularly oriented 
to what the other says, to the details of his 
argument. Rather, he is oriented to what the 
other is, to the other’s position in relation to 
the self; is he on my side or is he against me? 
What the communicator says is likely to be 
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viewed as an aspect of what he is, as a part 
of him that cannot easily be separated from 
the rest.. The tendency to perceive communica- 
tion content in line with one’s attitude to the 
communicator should thus be particularly 
strong. 

This phenomenon, of course, is not at all 
restricted to negative communicators, A posi- 
tive communicator, dealing with a particular 
issue, may similarly bring a concern with 
self-definition into play. Here a positive self- 
defining relationship would be at stake. The 
other would be important to the person’s self- 
concept in the sense that he sees the other as 
representing his own side and defines him- 
self as similar to him. It is generally assumed 
that it is in part through this process of com- 
parison with positive reference points that 
the person defines what he himself stands for. 
When a positive communicator, then, arouses 
a concern with self-definition, we would again 
expect the tendency to perceive communica- 
tion content in line with one’s attitude to the 
source to be especially strong. In this case, 
of course, this tendency should take the form 
of assimilation. In Experiment I, we feel that 
the positive communicators—while respected 
and highly regarded—did not arouse any 
particular concern with self-definition.’ Dis- 
agreement with them may have been regarded 
as a difference in intellectual judgment or in 
strategy, but not as a fundamental difference 
in one’s place in the world. Under these 
circumstances, it was presumably possible for 
the subjects to separate the content of the 
communication from its source and to perceive 
it veridically. 


5 Two items from Table 1, “my type of person” 
and “a person whom I would want to be like,” are 
particularly relevant to self-definition. While re- 
sponses from the subjects exposed to the negative 
communicator were highly polarized toward the 
disagree end of the scale, responses from the sub- 
jects exposed to the positive communicators were 
not highly polarized toward the agree end of the 
scale. For example, in response to the item “my 
type of person,” 55% of the subjects checked 
“disagree strongly” for Mr. Sprague, while only 8% 
of the subjects checked “agree strongly” for Prof. 
Lancaster, and 12% of the subjects checked “agree 
strongly” for Rey. Bryant. Thus, it would seem that 
disassociating themselves from Mr. Sprague was 
more relevant to the subjects’ self-definition than 
associating themselves with the positive speakers. 


In Experiment II, both the positive and 
the negative communicator can be presumed 
to have some potential relevance to the sub- 
jects’ self-definition. It is reasonable to as- 
sume that, in those subjects in whom strong 
feelings were aroused, the issue of self-defini. 
tion did indeed come into play. The greater 
tendency to misperceive the content of the 
communication on the part of subjects with 
strong positive or negative feelings toward 
the communicator would thus be ascribed to 
their greater concern with self-definition in 
relation to the communicator. 

It should be noted that, according to the 
hypothesis we are proposing, perception of 
communication content should not be affected 
by the strength of Positive or negative at- 
titudes to the communicator if these are based 
strictly on his credibility. We would expect 
more attitude change with a highly credible 
communicator, but no particular effect on 
comprehension of the message—which is what 
happened in the Hovland-Weiss (1952) and 
Kelman-Hovland (1953) experiments. It is 
also possible that a highly credible communi- 
cator will produce neither attitude change nor 
distortion, but that the person will accept 
the disagreement and tolerate the resulting 
incongruity. Furthermore, according to our 
hypothesis, perception of communication con- 
tent should not even depend on liking and 
disliking of the communicator per se. One can 
like a person, in ‘the sense of enjoying as- 
sociation with him, or dislike him, in the 
sense of being uncomfortable in his presence, 
without the issue of self-definition arising. 
A liked person will probably produce more 
attitude change, but it is also possible to 
“agree to disagree” and continue a pleasant 
association with him. Similarly, it is possible 
to tolerate agreement with a disliked person. 
While the like-dislike dimension and the 


dimension of relevance to one’s self-definition’ 


are, thus, theoretically independent, they are 
closely intertwined as we approach high in- 
tensities of like or dislike. A person whom we 
like intensely or hate profoundly will almost 
inevitably be linked to our self-definition. 


Source Orientation and Content Orientation 


We can speak of the tendency to view 
one’s relationship to the communicator in 


e 


' 
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terms of its relevance to self-definition as 


. source orientation. Source orientation is de- 


fined as an approach to the communication 
situation in which the relation of the source 
to the person’s self-definition has the greatest 
motivational significance. It can be contrasted 
with content orientation, an approach to the 
communication situation in which the rela- 
tion of communication content to the person’s 


, value system has the greatest motivational 


4 


significance. 
When source orientation prevails, the per- 
son is primarily concerned with the implica- 


* tions of ‘any communication from the other 


for his own relationship to him. Given this 
set, he is unable to separate the content of 


. thé communication from the source. All com- 


munications are treated as emanations from 
the other and as parts of him, and their con- 
tents tend to be perceived, understood, and 
interpreted in the context of the source. Under 
these circumstances, when there is an in- 
congruity between message and source, we 
would expect a tendency to displace the 
message in a congruent direction—that is, to 
assimilate a message from a positive source 
and to contrast a message from a negative 
source. Other mechanisms may be brought 


, into play when the message is so clear that 


displacement at the perceptual level becomes 
difficult. For example, the person may refuse 
to believe that the source „actually made the 
statement ascribed to him (cf. Osgood & 
Tannenbaum, 1955) or he may reinterpret 
the message in such a way that it is congruent 
with the source (cf. Asch, Block, & Hertzman, 
1938; Lewis, 1941). All of these mechanisms 
have the effect of permitting the person to 


„maintain message and source in a balanced 


: configuration. The stronger the source orienta- 
tion, the greater the likelihood that they will 
be brought into play. 

Source orientation should also be related, in 
a systematic way, to attitude change. The 
greater a person’s source orientation, the 
stronger should be the impact of the com- 
municator’s position on the person’s own 
attitudes. Thus, when the communicator is 
positive and source orientation is high, the 
person would be strongly motivated to agree 
with the communicator: agreement, in this 
case, is essential to the maintenance of a 
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relationship in which the person’s self-defini- 
tion is anchored. Strength of source orienta- 
tion, then, should be directly related to 
amount of attitude change in response to a 
positive communicator. On the other hand, 
when the communicator is negative and 
source orientation is high, the person would 
be strongly motivated to differentiate himself 
from the communicator: disagreement is 
essential, in this case, to the maintenance of 
the person’s self-definition. The stronger the 
seurce orientation, then, the greater should 
be the tendency to reject the message of a 
negative communicator, or even to change in 
a negative direction (thus enhancing the 
distance and degree of differentiation between 
the source and the self). 

Displacement and attitude change can be 
viewed as alternative ways of achieving the 
congruity between source and message that 
is so important when source orientation 
prevails. One can assume that displacement is 
the most likely mechanism to be used when 
the message is highly ambiguous and thus 
easily lends itself to different interpretations. 
When the message is very clearly structured, 
however, attitude change should occur. Typ- 
ically, the message is of intermediate am- 
biguity and thus both mechanisms come 
into play to some degree. When this happens, 
it becomes difficult to predict the relation- 
ship between strength of source orientation 
and amount of attitude change produced by. a 
positive communicator. On the one hand, the 
stronger the source orientation, the greater 
the tendency to change in the direction of 
agreement with the communicator. On the 
other hand, the stronger the source orienta- 
tion, the greater the tendency to assimilate 
the message, thus making attitude change 
unnecessary. In other words, as we indicated 
above, the direct and the indirect effects of 
source orientation on attitude change point 
in opposite directions and thus may cancel 
each other out. In the case of a negative 
communicator, the direct and indirect effects 
reinforce each other. That is, the stronger 
the source orientation, the less the tendency 
to change. Also, the stronger the source 
orientation, the greater the tendency to con- 
trast the negative communicator’s message, 
which makes attitude change less necessary. 
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Thus, both tendencies should lead to greater 
rejection of the message under high negative 
source orientation. The picture becomes more 
complicated, however, if we think not of re- 
jection but of negative change. Here the 
direct and indirect effects of source orienta- 
tion should again move in opposite directions. 
The stronger the source orientation, the 
greater the tendency to change away from a 
negative source, On the other hand, the 
stronger the source orientation, the greater 
the tendency to contrast the message, thus 
making negative change less necessary: dif- 
ferentiation has been accomplished through 
perceptual displacement. 

Content orientation implies an approach to 
the communication very different from source 
orientation. When content orientation pre- 
vails, the person is primarily concerned with 
the implications of the particular content of 
the communication for his efforts at goal 
achievement and value maximization, While 
his attitude toward the source determines 
how seriously he takes the communication, he 
is perfectly capable of separating content 
from source. In fact, he may deliberately try 
to strip the message of its original context, 
so that he can evdluate it in the context of 
his own value system with minimal extraneous 
interference. Under these cirmumstances, the 
person is likely to perceive the content of 
the communication veridically, even if there 
is incongruity between content and source. 
Congruity is less important here than an ob- 
jective, accurate perception of the material 
he has to evaluate. The stronger the con- 
tent orientation, the greater the tendency to 
perceive objectively. 

Tt should be noted that content orientation, 
while it counteracts perceptual distortions, 
may on occasion have dysfunctional con- 
sequences. If the person makes too sharp a 
separation between message and source, he 
may miss the central point by failing to 
see the message in its proper context. By the 
same token, source orientation, even though it 
often leads to perceptual distortions, is not 
necessarily dysfunctional under all circum- 
stances. By viewing the message in the con- 
text of the source, the person may become 
alerted to certain valid clues to the “teal 
meaning” and intent of the speaker’s state- 


ments, and to certain relationships between 
different statements that might otherwise be y 
seen as completely separate. Presumably, a_ 
balance between these two orientations, such i 
that the person sees the content in relation 
to the source, yet at the same time is able 
to separate it out and judge it at its face 
value, would have the greatest functional 
utility in most situations. 

Content orientation, like source orientation, 
should be related in a systematic way to 
attitude change, In general, given a credible 
communicator, we would expect a direct re- 
lationship between content orientation and 
attitude change. When content orientation is 
high, the person is more open to the im- 
plications of communication content for *his 
own goals and values, and more ready to 
change. While this greater openness does not 
necessarily lead to change, it does at least 
increase the probability that change will 
occur. Whether, indeed, change does occur 
depends on the nature of the communication 
content—its relevance to the person’s value 
system. The fact that the communication 
comes from a positive source, in and of itself, 
would not lead to attitude change, since the 
Pperson’s relationship to the source is not 
particularly at issue. By the same token, when 
content orientation prevails, the fact that a 
communication comes from a negative source 
should not lead to negative change, although 
it should decrease the probability that the 
communication will be accepted. 

Source orientation and content orientation 
are essentially the motivational bases of two 
Processes of attitude change that have been 
distinguished in earlier work (Kelman, 1958, 
1961): identification and internalization. 
Identification refers to the adoption of be- 
havior derived from another because it is 
Associated with a satisfying self-defining re- 
lationship to this other. Internalization refers 
to the adoption of behavior derived from 
another because its content is congruent with 
the person’s value system. The concepts of 
source orientation and content orientation 
thus allow us to coordinate two types of per- 
ception of communication content with two 
types of attitude change. If, for whatever 
reason, a person’s primary concern in an 
influence situation takes the form of source 


. 
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COMMUNICATOR AND ATTITUDE CHANGE 


orientation, and the communicator is at- 
tractive (Kelman, 1961, p. 68), then we 
` would predict that he will either change his 
attitudes through identification or misperceive 
the communication through assimilation. If, 
on the other hand, the person’s primary con- 
cern takes the form of content orientation, 


` and the communicator is high in credibility 


. 


(Kelman, 1961, p. 68), then we would predict 
that he will either change his attitudes through 
internalization or simply accept the fact that 
he disagrees with the other. 
Both under source orientation and con- 
. tent orientation, the probability that the 
person will change his attitudes rather than 
misperceive or accept disagreement, as the 
¿case may be, will depend on the prepotency 
of the induced behavior—a key variable in 
the earlier formulation of social influence 
(Kelman, 1961, pp. 68-69). Prepotency refers 
to the extent to which the induced behavior 
comes to stand out as the most desirable path, 
relative to alternative response possibilities. 
There are various ways in which prepotency 
may be achieved. The induced behavior may 


‘ become prepotent because it has been clearly 


designated as a requirement of a particular 
role. This type of prepotency is especially rele- 
. vant to source orientation. If source orienta- 
tion prevails and the communicator is attrac- 
tive, then we would expect attitude change 
(via identification) to the extent to which role 
requirements have been clearly delineated. In 
the absence of this type of prepotency, as- 
similation of the communicator’s message is 
the more likely outcome. Another way in 
which the induced behavior may become pre- 
potent is through the restructuring of means- 
ends relationships, such that the implications 
‘of the new behavior for certain of the person’s 
values are clearly brought out. This type of, 
Prepotency is especially relevant to content 
orientation. If content orientation prevails 


° and the communicator is credible, then we 


would expect attitude change (via inter- 
nalization) to the extent to which new value 
implications have been clearly brought into 
focus. In the absence of this type of pre- 
potency, the person—while perceiving the 


». Message content veridically—may simply 


accept the difference between the communica- 
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tor’s view and his own without changing his 
attitude. 

If we accept the conceptual utility of the 
notions of source orientation and content 
orientation, we must proceed to study the 
conditions under which these orientations are 
likely to prevail. We have, of course, already 
discussed the proposition that source orienta- 
tion, rather than content orientation, is likely 
to prevail when the communicator arouses 
strong feelings. Our data are consistent with 
this proposition, but the use of this variable 
is not likely to provide a conclusive test of 
the validity of the distinction because it is 
wide open to alternative explanations. Source 
orientation and content orientation should 
also be a function of personality disposi- 
tions. If we can distinguish between in- 
dividuals who generally tend to be source 
oriented and those who tend to be content 
oriented, we can obtain stronger tests of 
the hypotheses proposed in this paper. To this 
end a recent study (Weil, 1962) used the 
index of source orientation versus message 
orientation developed by McDavid (1959). 
The study showed that source-oriented sub- 
jects are more likely to assimilate the content 
of a communication ascribed to a positive 
reference group. As expected, this tendency is 
particularly pronounced when the prepotency 
of the induced position is low—that is, when 
there is some ambiguity about the extent 
to which the communication content repre- 
sents group norms.” $ 

The most conclúsive test of our hypotheses 
would be provided by a study in which 
source orientation and content orientation are 
experimentally induced—for example, through 
the manipulation of situational sets. Such a 
study remains to be done. 
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ay Differences in the 
by Eysenck both to differences in 


(excitation) and to the rate of growth of 
to test either or both versions of the theory have been indecisive. In the present 
experiment, groups of extreme introverts and extraverts, matched for neu- 
practiced the pursuit rotor under spaced conditions to 


roticism (drive), 


performance of extraverts and introverts haye been related 
the rate of growth of associative connections 


reactive inhibition. Previous attempts 


asymptotic level. They were then switched to massed practice. No differences 


A between the groups 
Eih `” level reached, or in decline in 
latter condition, 


both groups showed a decline of about 30% 


were found in rate of acquisition of the skill, in asymptotic 
performance under massed practice, In the 


time on target 


“over a period of 7 min. The results do not support either the 1955 or the 1957 
Ole versions of Eysenck’s theory of extraversion-introversion, 


In a number of studies Eysenck (1956, 
1957, 1960, 1962) has attempted both to con- 
firm, and account theoretically for, the ob- 
Served differences found in earlier studies 
_ between extraverts and introverts (Eysenck, 


e147). In doing so, he has made use of the 


basic formula of Hull+ (1943): 
sEp = (sHp X D) — (Ir + sIr) 


„ Eysenck’s theoretical position with respect 

to the explanation of individual differences 

on the introversion-extraversion (I-E) dimen- 

sion was first formulated in 1955 and revised 

in 1957, In the original version (Eysenck, 

. 1955), three postulates are invoked. The 
J first postulate states that 


` Whenever any stimulus-response connection is 
made in an organism (excitation), there also occurs 
Simultaneously a reaction in the nervous structures 

> mediating this connection which opposes its recur- 
rence [p. 96]. 

. 


According to the second postulate 


Human beings differ with respect to the speed 
with which reactive inhibition [Ie] is produced, the 
Srength of nl, and the speed with which (el is 

„' dissipated. These differences themselves are proper- 
ties of the physical structures involved in the evoca- 
tion of responses [p. 96]. 


r 1 Subsequent modifications of Hull’s basic equation 
ee Jones, 1958) do not affect the arguments of 

ag 3 paper, n i i imental 

‘indi 5 » nor the interpretation of the experiment 
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The third (typological) postulate links the 
behavioral concepts to the personality dimen- 
sion and states that 


Individuals in whom [In] is generated quickly, 
in whom strong [In] are generated, and in whom 
Un] is dissipated slowly are thereby predisposed 
to develop extraverted patterns of behavior and to 
develop hysterical disorders in cases of neurotic 
breakdown; conversely, individuals in whom [In] 
is generated slowly, in whom weak [In] are gener- 
ated, and in whom [Ir] is dissipated quickly, are 
thereby predisposed to develop introverted patterns 
of behavior and to develop dysthymic disorders in 
cases of neurotic breakdown [p. 97]. 


It is clear that this formulation related 
personality differences on the I-E dimension 
to Hulls Zz construct and presumably as- 
sumed that the drive and habit strength 
components remain comparable in introverts 
and extraverts during training and perform- 
ance. The possible relevance of conditioned 
inhibition (s7) was not mentioned. However, 
the second (revised) version of the postulate 
system (Eysenck, 1957) asserted that 


individuals in whom excitatory potential is gener- 
ated slowly and in whom excitatory potentials so 
generated are relatively weak, are thereby pre- 
disposed to develop extraverted patterns of behay- 
iour and to develop hysterical-psychopathic disorders 
in cases of neurotic breakdown [p. 114] 


while the contrary is asserted to be the case 
for introverted subjects. The postulate con- 
cerning rate of growth, strength, and rate of 


e 
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dissipation of Zz is retained in the same form 
as in the 1955 system. Now, since Eysenck, 
tentatively at least, equates neuroticism with 
high drive (e.g, Eysenck, 1962, p. 127) 
and since he also regards I-E and neuroticism 
“as constituting orthogonal dimensions of per- 
sonality in normal subjects, it seems clear 
that by the term “excitatory potential” he 
is actually referring to the rate of growth, 
etc., of sHp, that is, to the actual strength 
of the habit structures (associations) involved 
in learning and in performing a particular 
response. Thus, he argued that 


there is no indication in our system that difference 
in irrelevant drive should produce any differences in 
conditionability; consequently our prediction points 
to a correlation between conditionability and 
extraversion-introversion, but not to any correlation 
between neuroticism and conditionability [Eysenck, 
1957, p. 115] 


and proceeded to infer that normal and 
neurotic introverts should condition equally 
well, 

Several attempts have been made to test 
the validity of the theory, *though it is not 
always clear which version is under scrutiny. 
The general line of argument has been that 
under conditions’ of massed practice, extra- 
verts will generate more Zp than introverts, 
so that, if a rest pause is inserted after a 
period of massed practice, the dissipation 
of Zp during the rest period will result in 
extraverts showing a greater increase in level 
of performance when practice is resumed (the 
reminiscence phenomenon). These empirical 
studies have yielded apparently contradictory 
results, which need not be considered in 
detail here, since they have recently been 
reviewed by Eysenck (1962), who has 
pointed out that some of the contradictions 
may be resolved if the precise conditions of 
practice and rest are taken into account. It 
seems clear, however, that attempts to test 
either version of the theory via the remi- 
niscence phenomenon must lead to ambiguous 
results. Comparison of the Postrest perform- 
ance of groups of introverts and extraverts 
in this way, while the particular associations 

` under investigation are still being acquired, 
must of necessity confound the rate of growth 
and strength of sHp with the rate of growth 
and the strength of Zp, since sHz by defini- 
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tion is not asymptotic. Hence, at any par- 
ticular point at which a rest pause is insert 
following massed practice, introverts and 
extraverts may not be comparable with re- 
spect to sHr; indeed, in terms of the 1957 
version, they should not be equal in this 
respect. It is concluded, therefore, that a 
more satisfactory way of unequivocally test: 
ing either version of Eysenck’s theory is 
required. Such a method is reported in this 
paper. 

Consider a task such as _pursuit-rotor 
performance. In unpracticed subjects the 
initial level of performance (as indexed by 
time on target per 10-second trial) is very 
low. If groups of introverts and extraverts 
(matched for drive level) are given appropri- 
ately spaced practice on this test, then during” 
acquistion of the skill neither Zp nor s/p will 
be generated in amounts sufficient to affect 
performance differentially in the two groups. 
Hence, if sufficient practice is given so that 
performance reaches asymptotic level, the 
performance curve obtained will be a direct 
measure of sHp, all other variables in the basic 
equation having been controlled. In terms of 
Eysenck’s 1955 theory, then, it would be 
predicted that under these conditions of 
acquisition the learning curves of extraverts 
and introverts would be identical, both as 
regards rate of approach to asymptotic level 
and the asymptotic levels reached. On the 
other hand, if the 1957 version is correct 
(that is, that extraverts also build up sHr 
more slowly) then, with spaced acquisition 1 
trials, extraverts will reach the same asymp- 
totic levels as introverts, but will approach 
the asymptote more slowly. Vs 

In either event, once the curve of acquist 
tion reaches asymptotic level for both groups’ 
it becomes possible to make a direct test of 
Eysenck’s inhibition postulate by a switch 
to massed practice. With sz maximized and 
drive level controlled, it would be predicted, 
that extraverts will show a greater decrement 
in response under massed practice than intro- 
verts, since the former will accumulate {r 
more rapidly than the latter. 

In null form, the two hypotheses under 
test are: that under conditions of spaced | 
Practice and with drive level controlled, ; 
extraverts and introverts will not differ 2 f 
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TABLE 1 


EXTREME GROUPS SELECTED ON THE Basis or 


MPI Scores AND CUMULATIVE 


Time Error Test 


Test 
Group MPI E scale MPIN scale Time error i 
M SD Range M SD Range M SD Range 
Extraverted (V = 10) | 39.90 | 2.70 37-46 | 23.60 | 2.11 20-28 4.00 2.07 2-9 
Introverted (N = 10) 20.60 6.51 23.70 | 3.42 19-28 | 21.75 | 10.67 | 18-69 


3-26 


* 


rate or asymptotic level of acquisition in 
pursuit-rétor learning; and will not differ in 
amount of decrement under massed practice 
instituted after asymptotic level has been 


» reached. 


“conceded that the MPI is not an entirely 


e 


MeEtTHOopD 


Subjects 


Scoresefor the Neuroticism (N) and Introversion- 
Extraversion (E) scales ofthe Maudsley Personality 
Inventory (MPI) were available for 260 first-year 
Psychology students. The initial selection of extreme 
E groups was made by abstracting the highest and 
lowest E scorers on the MPI whose N score was 
within +.5ø of the average N score for the total 
sample. There was no overlap between these groups 
on E score, but they were closely comparable as 
to N score. Since Eysenck and Claridge (1962) have 
satisfactory 
Criterion of extraversion, final selection of the ex- 
treme groups was made from within this initial 
dichotomy by administration to all of these subjects 
of the cumulative time error test described by Lynn 
(1961). The final extreme groups were made up of 
Subjects gaining extreme scores on both the MPI 
and the time error test, There were 10 subjects in 
each group with sex differences equally distributed 
and the groups did not differ on N score. The 
characteristics of the groups are shown in Table 1, 


É Apparatus 


The Lafayette pursuit rotor was used. This instru- 
ment has a 75-inch target rotating at 50 revolutions 
per minute on a circular disk of 3.25-inch radius. 
The stylus consisted of a plastic handle attached 
to an articulated metal tip, 4 inches in length. For 

S purposes, three Relion electric chronoscopes 
Were interconnected. One was used as a switch to 
alternate the other at 10-second intervals, with time- 


: On-target recorded to an accuracy of one-hundredth 


of a second, 


Practice and Rest Periods 


oan a further modification of the theory, Eysenck 


Maxwell (1961) have argued that the dissipation 


. 


of Ir will be maximally reinforcing (that is, result 
in increments of sIr) only if Ir is allowed to reach 
a critical level which results in involuntary rest 
pauses (IRPs). Thus, the spaced Practice trials had 
to be chosen so that IRPs would not occur during 
Practice on the pursuit rotor and any Iz which did 
accumulate would entirely dissipate during the subse- 
quent rest period. A search of the literature on dis- 
tributed practice on the pursuit rotor revealed con- 
siderable variation in the practice-rest_ combinations 
used in previous studies comparable to the present 
one. The ideal condition would clearly be that in 
which the highest asymptotic level is yielded by a 
short practice time and a long rest interval 
(Ammons, 1950; Ammons & Willig, 1956). Hence, it 
was decided to use work periods of 12 seconds 
and rest intervals of 58 seconds in this study. Under 
these conditions, no IRPs would be expected (and 
therefore no substantial increments of slr) and if 
minimal amounts of Jz were generated, this should 
all dissipate during the rest period. 


Procedure 


Subjects were tested individually. The usual in- 
structions were given, explaining the nature of the‘ 
task, and subjects were informed that a number of 
short trials, broken by rest periods, would be fol- 
lowed by a longer unbroken period of practice. 
The experimenter demonstrated the apparatus but 
no pretesting trials were given. The subject stood 
while a trial was in Progress, but sat down between 
trials, when the rotor was at rest, Seven seconds 
warning were given before a trial commenced to 
enable the subject to stand up and grasp the stylus; 

© a 2-second ready signal preceded the starting of the 
rotor. Forty 12-second trials were given, with a 
rest period of 58 seconds between each trial. After 
the fortieth trial, the subject was given a rest of 
10 minutes and was then given 7 minutes of continu- 
ous practice (that is, 42 10-second trials), 


Scoring Procedure 


The first 2 seconds of each practice period were 
ignored. Thus, each acquisition trial was 10 seconds 
in length, and comparable with the 10-second prac- 
tice periods during massed Practice. Scores were 
recorded immediately after each trial, the time-on- 


= 
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target score being visible to the experimenter for 
7 seconds before the chronoscope was automatically 
reset for the next trial. The written score was 
immediately rechecked against the clock. Subjects 
were tested in random order. 


x RESULTS 
Acquisition Curves e} 


Data for one extraverted subject had to be 
discarded because of apparatus failure. Acqui- 
sition curves for the I and E groups are shown 
in Figure 1 for 40 10-second trials. It is clear 
that both the rate of approach to the asymp- 
tote, and the asymptotic levels, are virtually 
identical for both groups throughout training. 
Asymptotic level was attained by about Trial 
22 and maintained unchanged through Trials 
23—40 at a rate of 85-90% time on target. 
Tt seems unlikely that further practice would 
have resulted in any increase in time on 
target. The acquisition data were analyzed 
by means of the Eckhart-Young technique 
(Tucker, 1958) programed for a 1620 IBM 
computer. This technique is designed to 
reveal whether the individual performance 
curves are of the same nature and whether 
the group curves are significantly different. 
The analysis? revealed that a single com- 
ponent accounted for 98.56% of the variance 
in the individual curves; when individual 
learning scores were weighted with respect 
to this component, a ¢ test performed on the 
resultant group means was insignificant. A 
°` second component, which accounted for only 
‘62% of the variance, described trial-to-trial 
variance within subjects. Again, the com- 
ponent failed to discriminate between the 
groups. 


? Acknowledgment is made to John Ross who 
suggested and supervised this analysis. 
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32 3% 40 


Fic. 1. Pursuit-rotor learning (spaced practice) of 
extreme extraverts and introverts, 


> also appear to contradict the 1955 version of 
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TIME ON TARGET IN SECS. 


4 686 2 6 20 2 2 32 
NO. OF TRIALS 


3 40 


Fic. 2. Decrement of pursuit-rotor performance 
from asymptotic level (massed practice) in extreme 
extraverts and introverts. ` 


Massed Practice Curves 1 i 


Performance curves for the I and E groups 
are shown in Figure 2. For both groups there 
is an average decrement in time on target of 
about 30% during the 7 minutes of practice. 
The Eckhart-Young procedure again revealed 
two components similar to those found in 
analysis of the acquisition curves. The first 
accounted for 97.51% of the variance; the 
second for .64%. In neither case did the 
component discriminate between the I and E 
groups. 


Discussion 


Tn the acquisition trials, both groups mani- 
fested a negatively accelerated rate of ap: 
proach to asymptotic level, which was reached 
by Trial 22 and thereafter maintained. The 
rate of approach and the asymptotic level 
reached were virtually identical for both 
groups. These results, taken alone, would ap- ; 
pear to refute the 1957 version of Eysenck’s 1, 
theory which relates I-E differences in patt 
to rate of growth of sHp. Where D, Ir, and 
sIr are controlled, rate of growth of excitation 
(sHr) does not discriminate these groups. 

However, the results under massed practice 


the theory which relates I-E differences 
part to rate of growth of Ip. At the com 
mencement of massed practice, sHe Was% 
asymptotic and equal for both groups. Th s 
decrement of 30% during massed practice 
may reasonably be attributed, within th? 
theoretical framework utilized here, to tH? 
accumulation of Zz and possibly also to incr? 
ments of slp following upon the occurrence f 
of IRPs. However, the same rate and level 0 
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decrement were shown by both groups and it 


. Seems to follow that inhibitory potential ac- 


cumulated at an equal rate in both introverts 
and extraverts, when habit strength was the 
same at the beginning of massed practice. 
Several points require further discussion. 
It was assumed that drive level was held 
constant in this experiment by equating the 
groups on the N scale of the MPI, a pro- 
cedure fully in line with Eysenck’s own po- 
sition (e.g., Eysenck, 1962, p. 127). It might 
be argued, however, that the identical acqui- 
sition curves result from the operation of 
some otlfer drive factor which operates to 
raise the performance (sEr) of the E group 
(that is, disguises a slower rate of growth 
of Hr or a faster rate of growth of Jp), 
while depressing (or not affecting) the per- 
formance of the I group. Such a factor would 
have to be a relevant task-specific or 
situation-specific drive to produce results of 
this kind and the present authors are unable 
to conceive what this drive factor might be. 
The most obvious possibility would be bore- 
dom with either the task or the general situa- 


. tion. But this avoidance drive would be 


A 


expected to affect extraverts more than intro- 
verts and hence increase the effect of a slower 
rate of growth of gHp. Alternatively, it might 
be argued that conditioned avoidance re- 
Sponses to the task or situation might be more 
rapidly learned by the intraverted group and 
hence depress its performance. These pos- 
sibilities cannot be entirely ruled out, but it 
should be noted that both groups reached an 
asymptotic level of approximately 90% time 
on target which is probably about the maximal 
level of skill that is possible on this task. 
Furthermore, it would be expected that the 


` Operation of a conditioned avoidance drive 


and its associated responses would be revealed, 
during acquisition by an observable decline 
1 percentage of time on target between Trials 
23 and 40. As has been pointed out, however, 
no such decrement was manifested. It is 
therefore concluded that the acquisition 


. Curves genuinely reflect the similarity of the 


w diff 


Srowth of excitation in the two groups. It is 
worth noting that Eysenck himself has as- 
Serted that extraverts and introverts do not 
er in drive if their N scores are equated 


. (Eysenck, 1962) and has also asserted that 
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TABLE 2 


PERFORMANCE OF EXTRAVERTS AND INTROVERTS 
ON First 10 POSTREST TRIALS 


Group 
€ 
„ Trial Extraverts Introverts 
M SD M SD 
1 851 144 77 -095 
2 -155 .122 .785 134 
3 -763 132 -806 114 
> 4 -154 .140 «187 lit 
5 -756 142 +152 +157 
6 780 142 171 134 
7 -765 -186 .791 -110 
8 -181 ATT -809 098 
9 782 119 192 116 
10 817 135 71 136 


in a task situation such as the pursuit rotor 
secondary achievement motivation would al- 
most certainly be higher in introverts than 
in extraverts and thus elevate their perform- 
ance (he does not mention avoidance drives). 

Second, it is worth noting that great care 
was used in sefecting the extreme groups in 
this experiment, a double criterion being 
adopted, namely, the questionnaire measure 
and the objective test. It seems doubtful that 
a more stringent method of selecting extreme 
groups could be used and hence condi- 
tions were made as favorable as possible for 
obtaining results in line with theoretical 
expectations, . 

Finally, a more serious objection to the 
interpretation of the results obtained under 
conditions of massed practice may be derived 
from a recent discussion by Eysenck (1962) 
of the complexity of his theory. Eysenck 
points out that while it would clearly follow 
from his theory that under massed practice 
extraverts would build up Jz more rapidly 
than introverts, the same argument would not 
necessarily apply with respect to sIr, since 
this is a (negative) habit which, other factors 
equal, would increase at a more rapid rate 
in introverts. Hence, the question whether 
the performance curves of extraverts and 
introverts would diverge under massed prac- 
tice (with sHp asymptotic) would depend on 
the precise conditions of practice and rest. 
Since introverts may have more IRPs and 
dissipate Jz more rapidly than extraverts, 
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slr would be more often reinforced in the 
former, and therefore grow more rapidly. 
Thus, the influence of the two factors might 
almost counterbalance each other in the two 
groups under continuous massed practice, 
Such as was used in this experiment and both 
groups show the same performance decte- 
ment. Empirical data from the present study 
throw some light on this question. Even if the 
argument is correct, some divergence between 
the curves of performance during the first 
10 trials of the massed practice period would 
be expected, since any decrement in perform- 
ance during this period must be mainly a 
function of Jp. Although the Eckhart-Young 
analysis is sufficiently sensitive to have de- 
tected such differences, had they been present, 
the data for the first 10 postrest trials 
(excluding the two warm-up trials) are re- 
produced in Table 2. It is clear that no 
divergent trends are apparent at any stage 
during these trials. 

In light of the above discussion, it may be 
concluded that the results of the experiment 
reported in this paper do nôt support either 
the 1957 or the 1960 version of Eysenck’s 
theory of introversion-extraversion. Subse- 
quent elaboration of the complexities involved 
in testing the theory does not invalidate this 
conclusion. 
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RELATIONSHIP OF SELF AND IDEAL-SELF DESCRIPTIONS WITH 


SEX, RACE, AND CLASS IN SOUTHERN ADOLESCENTS 
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class was not found to have any 


dominance and love scores than the w) 


scores on ideal descriptions, Males’ 


This study evaluated the effects of sex, race, 
ideal-self-concepts of adolescent Ss. Inte 
ffom 261 Negro and 211 white high 
schools whose social class was determin 
as reported by the students. Sex and ra 


and social class on the self- and 


tpersonal Check List data were obtained 
scheol seniors from urban segregated 
ed on the basis of parental occupations 
ce markedly influenced the results, but 


effect. Negro students obtained higher 


hite students for self-ratings, but lower 
self- 


and ideal-self-ratings yielded higher 


scores on dominance while females? ratings yielded higher scores on the love 


Negroes compared with whites, 
nance, and (c) females compared 


Psychologists have shown much interest in 
delineating various parameters of the phenomenal 
self-concept. Recent studies have investigated the 
stability of the self-concept through the period 
of adolescence (Engel, 1959); the relations be- 
tween self-concept, food aversions, and mother 
concept in emotionally disturbed children (Davids 
& Lawton, 1961); as well as the relationship 
between self-concept, sociometric rankings, and 
anxiety in elementary school children of various 
ages (Horowitz, 1962; Lipsitt, 1958). 

As yet, little clarification has been obtained 
concerning the effects of sptcific demographic 
variables on phenomenal self or ideal-self. Klaus- 
ner’s (1953) early study of white male adoles- 
cents focused on the relations between social 
class and self-concept. His results are quite 
interesting, but should be considered as only sug- 
gestive as N was small (that is, 27). He found 
that individuals in the higher socioeconomic levels 
«ate intropunitive and psychosocially isolated, 
whereas individuals in the lower socioeconomic 
Stratum react to perceived insecurity and in- 
feriority by aggression and self-assertion. Wylie’s 
(1961) Summary of the studies through 1958, 
including the one just described, is 


2 


+ The four studies taken together do not permit us to 


about the relationship between 


Conclude anything 
and the self concept [p. 139]. 


» Socioeconomic class 


In a more recent experiment, Bieri and Lobeck 
(1961) assessed the effects of parental identifica- 
... tions, teligion, and social class on the self-con- 


` cepts of male members of an Army reserve unit. 
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variable. There was less discrepancy between ideal and self-ratings of: (a) 
(b) males compared with females on domi- 
with males on love. 


This study utilized the Interpersonal Check List 
(ICL; LaForge & Suczek, 1955), an instrument 
which analyzes self- and ideal-self-descriptions in 
terms of dominance (Dom) and love (Lov) scores, 
The former is a fheasure of assertive, aggressive, 
leadership qualities while the latter is a measure 
of friendly, warm, cooperative characteristics. 
These investigators obtained significantly higher 
Lov scores from Catholics than from Jewish 
subjects, and also found that upper-class subjects 
scored higher on Dom than lower-class subjects. 

Wylie (1961) has also reviewed the studies 
which deal with sex differences in self-concepts. 
She found marked inconsistencies in the find! 
ings and concluded that 


resolution of possibly contradictory results in this 
area awaits further research [p. 147]. 


Another important demographic variable whose 
relations with self-concept have not yet been 
explicated is race. Other data (McDonald & 
Gynther, 1963) suggest that Negro and white 
adolescent subjects’ reactions to statements about 
their emotional states, physical well being, and 
interests are remarkably influenced by these 
variables. The study referred to, however, found 
no differences ascribable to social class, a result 
which is in contrast with Bieri and Lobeck’s 
(1961) and Klausner’s (1953) tentative findings. 
The purpose of this study is to assess the effects 
of sex, race, and class on the self- and ideal-self- 
concepts of adolescent Negroes and whites and 
to determine if there are any interactions be- 
tween these variables, 


e 
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METHOD 
Subjects 


The Negro subjects consisted of 261 (151 female 
and, 110 male) high school seniors who comprised the 
entire group of graduates for 1961 and 1962 from 
the only Negro high school in a southern urban area. 
This school is located on the edge of the city and 
draws its students equally from the city and outlying 
areas. The white subjects consisted of 211 (114 fe- 
male and 97 male) high school seniors who were the 
1961 and 1962 graduates from three of the seven 
white high schools in the community. These three 
schools were selected as representative of different 
social classes and cultural settings in the area: an 
urban school whose children were drawn largely 
from: professional and managerial families, a sub- 
urban school whose students came from laboring 
class and enlisted military personnel families, and a 
county school which drew adolescents from lower 
middle class and laboring class families. Ages ranged 
from 16 to 19 with no mean age differences between 
sex or race, 


Instrument 


The ICL (LaForge & Suczek, 1955) was used to 
obtain self- and ideal-self-ratings. This instrument 
consists of 128 adjectives and phrases (for example, 
forceful, usually gives in, considerate, sarcastic) with 
16 items for each of eight different kinds of inter- 
personal behavior, Dom and Lov raw scores are de- 
rived for each rating by means of arithmetical for- 
mulae given in the manual (Leary, 1956). These 
scores are then converted to standard scores with a 
mean of 50 and a standard deviation of 10. 

Separate tables are provided for the conversion of 
self- and ideal-self-raw-scores into T scores, How- 
ever, the present study utilized the self-rating table 
for both sets of data as this provides a more accu- 
rate indication of discrepancies between ideal and 
self-descriptions in individual cases and as the deri- 
vation of T scores for the ideal ratings is based on 
somewhat questionable assumptions and procedures. 


Social Class 


An occupational classification scheme by Schneider 
and Lysgaard (1953) was applied to subjects’ parents 
in order to classify subjects with regard to the so- 
cial class factor, Four occupational classes are de- 


rived in terms of the degree of supervisory control > 


over “lower” occupations and independence of su- 
pervisory control from “higher” occupations. The 
four classes are: (a) independent occupations, (b) 
dependent occupations involving skill and supervision 
or manipulation of others, (c) dependent occupations 
involving skill but little supervision or manipulation 
of others, and (d) dependent occupations involving 
little skill and little supervision or manipulation of 
others. In addition, the present study employed an 
“unclassifiable” category for those parents listed as 
deceased, living in another city with job unspecified, 
or unemployed. The groupings were made, when pos- 
sible, on the basis of paternal occupation. In that 


> 


minority of cases where the father’s occupation was 
unclassifiable, the maternal occupational status was 
used for classification purposes. 


Procedure 


ICLs were administered by school counselors dur- 
ing special group testing periods in connection with 
another study (McDonald & Gynther, 1963). Each 
student was asked to rate self and ideal. In addition, 
each student was also instructed to list the occupa- 
tional status and job title, if known, of both parents 
on a form provided for this purpose. Two judges, the 
junior author and a second clinical psychologist, in- 
dependently rated the maternal and paternal occupa- 
tions in terms of the various classifications with a 
resulting 88% agreement. In cases of disagreement 
between these raters, the final rating was made by 
the senior author. 

To provide sufficient numbers in each cell for sta- 
tistical analysis, it was necessary to combine Classes 
a and b which resulted in the use of only three 
socioeconomic categories. The number of subjects 
varied from cell to cell, ranging from 64 Class d Ne- 
gro females to 22 Class d white males. Due to the 
unequal Ns in each cell, formulae provided by 
Walker and Ley (1953) were used for the analyses 
of variance, 


RESULTS 


The major findings of this study are presented 
in two tables. Table 1 gives the F values from 
the analyses of variance for the four ICL vari- 
ables. Inspection of this table reveals that race 
and sex have a marked influence on one’s self- 
and ideal-self-concepts, whereas socioeconomic 
status does not affect these concepts. 


1 Seven tables presenting means, standard devia- 
tions, and analyses of variance for each ICL vari 
able have also been deposited with the American 
Documentation Institute. Order Document No. 8131 
from ADI Auxiliary Publications Project, Photo- 
duplication Service, Library of Congress, Washing- 
ton, D. C. 20540. Remit in advance $1.25 for micro- 


TABLE 1 ‘ 


F VALUES From ANALYSES or VARIANCE oF ICL ` 
VARIABLES BY RACE, SEX, AND SOCIAL CLASS 


x i Ideal 
Soutcess. Vaominaate A A io aes love 
Race (A) | 34.21*** | 17.41%" | 5.10" | 6.32" 
Sex (B) | 14.54" | 4.96" | 2413+ | 0.49 
Class (C) | 0.63 0.83 0.94 | 0.37 
AXB 8.71** | 2.48 1.60 1.00 
BXC 0.92 0.23 2.40 | 0.23 
AXC 1.90 0.93 0.34 | 0.67 
AXBXC} 1.05 1.27 0.32 1.58 
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TABLE 2 
MEANS AND STANDARD Deviations or ICL VARIABLES BY Sex AND Race 
Self-dominance Ideal dominance Self-love Ideal love 
M SD M SD M SD M SDE 
Negroes 57.94 7.38 61.32 6.09 53.84 7.19 55.00 7.57 
Whites 54.31 6.38 63.71 4.63 52.28 7.38 55.48 7.04 
Males 
Negro 60.36 6.17 62.37 6.35 52.39 7.24 53.94 7.88 
White 54.72 6.56 63.86 4.51 50.04 7.18 55.56 6.76 
Females 
Negro 56.18 7.65 60.56 5.72 54.91 6.88 55.78 7.17 
White 53.96 6.22 63.59 4.64.0 54.18 7.04 56.93 6.84 


Note.—Sgcioeconomic level results have been omitted as this variable did not approach significance on any analysis (cf. Table 1). 


Negro students’ self-descriptions yielded higher 
Scores on Dom (ż= 19.10, p < .001) and Lov 
+ (t=13.53, p<.001) than did those of white 
students. The white subjects’ ideal-self-descrip- 
tions, on the other hand, yielded higher scores 
on Dom (¢ = 11.90, p < .001) and Lov (¢=2.53, 
p< 01) than those of the Negro subjects. 

Male adolescents described themselves as 
higher on Dom than their female counterparts 
(t= 4.10, p < .01) while female subjects rated 
themselves higher on the Lov variable than the 
male subjects did (t = 5.52, p<.01). The same 
pattern was found for the ideal descriptions 
(#=192, 05<p<.10 for Dom; t= 4.00, 
$ < .01 for Loy). 


e _ The only significant interaction pertains to the 


finding that Negro males obtained much higher 
Dom scores than white males (t= 8.06, p < .001) 
while Negro females’ Dom scores, although 
higher than those of white females (£= 3.22, 
Ż < .01), showed considerably less difference be- 
tween the means. 

Table 2 presents the means and standard 
deviations of the ICL variables by sex and race. 
Inspection of these data shows that there is 
much less discrepancy between self- and ideal- 
self-descriptions of the Negro students than the 


` white students (t= 32.90, p<.001 for Dom; 


t= 21.61, p<.001 for Lov). Comparisons ofe 
self-ideal-self discrepancies by sex reveal less 
discrepancy for the males on Dom (t= 7:82, 
É< .01) while the reverse was found for Lov 
(t= 9.08, p < .01). 


DISCUSSION 
_ Our negative findings for class are not con- 
sistent with those of previous investigators who 


film or $1.25 for photocopies and make checks pay- 
able to: Chief, Photoduplication Service, Library of 
‘Ongress, 


have demonstrated that social status affects self- 
descriptions. This is not to say that previous 
results have themselves been consistent; for 
example, Klausner’s (1953) and Bieri and 
Lobeck’s (1961) results regarding the relation- 
ship between dominance and class membership 
are quite contradictory, A possible explanation 
of these discrepancies is that each study used a 
different measure of social class, Two suggestions 
for future work are: for relationships between 
different classifieation methods to be clarified 
by, for example, using several different indices 
on one sample; and for consensus to be reached 
in the use of the procedure with the most 
satisfactory reliability and validity coefficients so 
that later findings might be more comparable, 

The results concerning the sex variable (ie. 
boys higher on Dom, girls higher on Loy) con- 
form with the role expectations traditionally 
associated with being a boy or a girl, at least 
in the white culture, although the studies cited 
by Wylie (1961) gave no clear-cut evidence 
related to this point. The similar findings for 
Negro adolescents might not have been as con- 
fidently predicted, considering the characteristic 
description of Negro culture as matriarchal. Of 
course, such conclusions have been principally 
based on observations of adult, not teen-age, 
interactions. 

If the present results concerning the sex 
variable seem fairly obvious, the results in- 
volving race would seem contrary to lay expecta- 
tions, as well as to Grossack’s (1957) findings 
that Negro males and females obtain significantly 
lower Edwards Personal Preference Schedule 
Dominance scores than white normative samples, 
That is, it would probably be anticipated that 
whites would describe themselves as more domi- 
nant than Negroes, whereas in fact the exact 
opposite was found. Although one could give 
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sociological, psychodynamic, or cultural “ex- 
planations” of this finding, the most important 
next step would appear to be replication to 
confirm or disconfirm the results. 
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This experiment used a special form of discrimination reaction time as a means 
of comparing the ability of schizophrenic patients and normal Ss to maintain a 
general set. 15 male schizophrenic patients and 10 normal male Ss were 
presented with various series of groupings of red and yellow lights, so patterned 
as to encourage the establishment of a subsidiary set that would interfere with 
the general set given by the task instructions (responding to the yellow light). 
It was predicted that this disturbance would be manifested in retarded re- 
action times at certain points in the experiment, and that the schizophrenic 
group would be subject to the greater disturbance. It was found that all Ss 
showed significant retardations in reaction time as a result of the presumably 
induced subsidiary set. The retardations of the schizophrenic Ss were greater 
than those of the normal S in every series. A dichotomization between 
hebephrenic and paranoid performance was also noted, the paranoid Ss more 
closely approaching the normal, 


Observations made during a series of experi- 
ments comparing the simple and the discrimina- 
tion reaction times of normal and schizophrenic 
subjects (Huston, Shakow, & Riggs, 1937, pp. 
40-57) suggested that the distinctly longer 
schizophrenic reaction times might in consider- 
able part be attributed to what we interpreted 


, 


as a difficulty in maintaining a major set. The 
present discrimination reaction experiment * was A 


1Another aspect of the set problem was it 
vestigated in a reaction-time experiment that dealt 
with the preparatory interval (Huston, Shakow, e 
Riggs, 1937, pp. 57-79). 
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designed to investigate one aspect of this question 
more fully.” 

+ The major interest of the present study was 
the ability of schizophrenic subjects to maintain 
a generalized set under conditions favoring the 
establishment of other, minor and interfering, 
sets, The general pattern of the experiment was 

. that of presenting various recurring sequences 

of red and yellow lights to subjects instructed 

to react only to the yellow light. The sequence 
patterns were designed to encourage the expecta- 
tion that the given pattern would continue. These 
minor sets were presumably to be revealed by 
the appearance of lengthened reaction times at 
certain crucial points in the series—when a 


*<* yellow light appeared unpreceded by a red one. 


An example will make the experimental situa- 
tion clearer. The subject is given the general in- 
. struction to respond as quickly as possible to a 
yellow light. A series of stimuli are presented: 
warning buzz, red light, yellow light; warning, 
red, yellow; warning, red, yellow. If the three 
consecutive red-yellow combinations are followed 
by another warning and then directly by a yellow 
light, a longer reaction time to this last yellow 
light than to the immediately preceding yellow 
(of the third red-yellow pair) might be expected. 
The hypothesis was that the repeated pattern 

» would lead the subject to “expect” a red stimulus 
before the last yellow. As a result of this interfer- 
ing set, the subject would find it difficult to 
. maintain the instructed general set to respond 
as quickly as possible to the yellow light. Being 
unprepared for the appearance of the yellow 
light, he would be delayed in his response. If, 
however, the general set was maintained effec- 
tively, it would be manifested in no appreciable 
effect on the last reaction. On the basis of our 
earlier experiment and on general theoretical 


2 This study was carried out at the Worcester 
State Hospital in 1936 and reported briefly at the 
APA meetings at Dartmouth, September 1936. Be- 

„ Cause it deals with a quite different phase of the 
general problem of “set” in schizophrenia on which 

un has since been written, it is now published in 
ull, 

`$ The actual instructions were: 
This is an experiment to see how quickly you 
can react, Press with your forefinger on the key 
and direct your attention to a yellow light which 
will appear in the box in front of you. When it 
appears, lift your finger as rapidly as possible. 
Remember you are to watch closely for the light 
and lift your finger from the key as quickly as 
you can when the light appears. When you hear 
this warning signal press down on the key with 
your forefinger and when the light appears lift 
your finger. 
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grounds, it was predicted that the differential 
would be greater in the schizophrenic than in the 
normal subjects. It was further hypothesized that 
as the red-yellow pairs increased in number, the 
retardation effect on the reaction to the im- 
mediately following yellow would be greater. 


METHOD AND PROCEDURE 
Apparatus 


The apparatus consisted of a device for producing 
the stimuli, a chronoscope, a viewing box, and a 
response key. The device for producing the stimuli, 
in addition to permitting their being presented auto- 
matically, also permitted the control of the exposure 
time and the length of the intervals between the 
successive stimuli. It consisted of a punched roll 
of paper, similar in principle to a player piano roll, 
carried on a kymograph by a constant-speed motor 
at the rate of .25-inch per second, The holes punched 
‘in the paper allowed for the completion of circuits 
by roller brushes to give a warning signal, a red 
light, and a yellow light. The pattern of arrangement 
of the holes, and therefore of the stimuli, will be 
discussed shortly. Reaction time was recorded in 
milliseconds by a magnetic-clutch type of chrono- 
scope driven by a synchronous motor. Previous 
calibration of a similar chronoscope with an oscil- 
lograph indicated an average error of +1 milli- 
second. In additian to the automatic device, keys 
were available which permitted the manual man- 
ipulation of the stimulation apparatus. (The manual 
devices were used only at the beginning of the ex- 
periment for a series of practice reactions—five re- 
actions to a yellow light—and on those relatively 
rare occasions during the course of the experiment 
when the subject for some reason did not carry out 
the instructions. In the latter case the apparatus 
was stopped, the trial carried out manually, and 
the experiment resumed with the automatic device.) 

The only apparatus visible to the subject was a 
single telegraph key ‘partially enclosed in a black 
box and a second black box directly above it with 
a 2-inch circular opening through which the stimulus 
lights were exposed. The warning signal was provided 
by a magnetic buzzer, the intensity of which was 
maintained at a constant, moderate level. 

The main advantage of this apparatus is that it 
provides constant time relationships for all subjects 
in respect to lengths of presentation, intervals be- 
tween stimuli, and periods between warning 
intervals.* 


Definitions 


The following definitions will facilitate understand- 
ing the experimental procedure. By stimulus is meant 


4The apparatus does not, however, control for 
the length of time between response and the next 
stimulus, since this is in part determined by the 
subject’s reaction time. However, these were propor- 
tionately far enough apart to allow the subject to 
prepare for the next stimulus, 
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a red (R) or yellow (Y) light. A pair (of stimuli) 
was a red stimulus followed by a yellow stimulus 
(RY). A trial was a single stimulus or group of 
stimuli of varying complexity; these constituted the 
fundamental unit of the experiment. Examples of 
the latter are: Y; RY, RY, RY, Y. A series was a 
definite number of trials (usually 10) using either 
stimulus groupings of the same complexity (regular 
method), or of mixed and varied degrees of com- 
plexity (random method). 

Six series, in three kinds of sequences, of gradually 
increasing complexity were used in this experiment. 


Control Sequence 


Y series. Ten trials using only a yellow light, 
serving for the determination of the simple reaction- 
time level of the subjects. The stimuli were so spaced 
that the warning signals came at intervals of 
10,5-11.5 seconds, The trial sequence was then: 
Wi: Teas: Ya : Ist: 

RY series. Ten trials of pairs of red and yellow 
lights. The warning signal was followed by a red 
light after an interval of 1.5-2.5 seconds. After a 
similar interval a yellow light was presented. The 
time between warning intervals in this series was 
13-15 seconds. The order here was: Wi: Is-25 : Ri 
: Tees: Ys: Is::. This control series was given to 
determine the speed of response in what might be 
considered the simplest discrimination situation. 

Both of the above “controls? also served to 
acclimate the subject to the experimental sequences 
which followed. 


Experimental Sequence—Regular Method 


RY-Y (1RY-Y) series. A series of 10 trials, each 
trial consisting of one red-yellow pair followed by a 
yellow stimulus, The warning signal was followed 
by a red light after the usual interval of 1.5-2.5 
seconds, and then by a yellow light after a similar 
interval. Five seconds later a warning signal was 
again given, followed, after the warning interval, 
by a yellow light. Between the warning signals of 
successive trials there was a period of 23.5-26.5 
seconds, The sequence in this series was: 


Wa: Irsa.: Rat Is-25: Ya: Is:: 
Wi: Tis2.8: Ys: Igi: 


In this series 10 R and 20 Y stimuli were presented. 


5 Read as follows: Warning 1 second, interval of 
1.5-2.5 seconds, yellow stimulus on for a maximum 
of 3 seconds, and 5 seconds devoted to the interval 
before the next repetition of the series. (The yellow 
light is on, of course, only for the length of the 
reaction, but never longer than 3 seconds, Any time 
less than the 3 seconds merely increases the length 
of the succeeding interval by that much.) 

A preparatory interval of the range 1.5-2.5 seconds 
was used throughout the experiment since this 
length of interval was the one in which normal and 
schizophrenic subjects came closest to each other in 
our earlier experiments. 
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RY-RY-RY-Y (3RY-Y) series. A 10-trial series 
in which a trial consisted of three red-yellow pairs 
followed by a single yellow stimulus. A trial sequence 
in the 3RY-Y was: 


Wi: Tis-2.s: Ri: 
Wi: Tis-a.s: Ri: 
Wi: Tis: Ri: 
Wi: Ti.s-2.5: Ya: 


The intervals between trials thus ran from 49.5 to 
56.5 seconds. Thirty red stimuli and 40 yellow stimuli 
were presented in this series, 

RY-RY-RY-RY-RY-Y (5 RY-Y) series, Again a 
series of 10 trials. Five red-yellow ‘pairs succeeded 
by a single yellow stimulus constituted a trial, The 
order in this series was: 


Wi: Tiss: Ri: 
Wu: Tiso.s: Ri: 
Wi: Tis-a.5: Ri: 
Wi: Ti.s-a.8: Ri: 
Wi: Inse.: Ri: Is-25: 
Wi: Irs-2.6: Ys: Is::. 


The intertrial interval here was from 75.5 to 865 
seconds. Fifty red and 60 yellow stimuli were given 
in this series. 


Experimental Sequence—Random Method 


Mixed series. A series containing five of each of 
the different kinds of trials used in the three last 
described series (IRY-Y, 3RY-Y, and SRY-Y). 
The trials were arranged in a “random” order in 
which each type of trial was preceded at least once 
and at most twice by every other type of trial. The 
same time relationships were kept within trials and 
between trials as in the other series. Altogether this 
series consisted of 45 red and 60 yellow stimuli. 

Rest periods of approximately 4 minutes were 
provided after each series and halfway through each 
of 3RY-Y, SRY-Y, and mixed series. The whole 
experimental session lasted approximately 75 minutes. 
During this time, each subject was presented with 
145 R and 200 Y stimuli. 


Population 


Fifteen male schizophrenic patients, all but 3 of 
whom fell into the paranoid and hebephrenic sub- 
types, and 10 normal male subjects participated in 
sthe study. The normal subjects were WPA workers 
having a mean age of 313 and a mean educational 
level of 11.5 years. The patients had a mean age 
of 36.7 and a mean educational level of 8.9 years. 
The mean hospitalization age for the patient group 
was 9.9 years. Only those patients giving 800% 
cooperation and evincing an interest in the expel 
ment were used. 


RESULTS 
Table 1 gives the mean reaction times for the 
RY pairs preceding the single Y stimulus, thos? a} 
for the single Y trials, the differences between 
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TABLE 1 
Mean REACTION TIME IN MILLISECONDS FOR RY TRIALS PRECEDING SINGLE Y, FOR THE SINGLE Y TRIALS AND 


THE DIFFERENCES BETWEEN 
FOR SCHIZOPHRENIC (V 


THEM, AND SIGNIFICANCE OF DIFFERENCES (ANALYSIS OF VARIANCE) 
= 15) anp Normat (N = 10) Susjects 


Regular method Random method 
——_——__fi. 
All 1RY-Y 3RY-Y SRY-Y All iRY-Y 3RY-Y SRY-Y 

Mean RT RY preced- 

ing Y 

S 632.0 584.9 609.4 701.7 634.2 651.8 661.7 589.2 

N 313.6 333.4 315.8 291.5 297.6 276.4 285.6 330.8 
Mean RT Y following 

RY ‘ X 

S 866.9 881.1 860.7 859.0 857.5 850.5 807.1 915.0 

N 389.2 410.9 382.9 373.7 406.8 428.0 417.8 374.6 
_ Mean increase in RT 

234.9** | 296.2** | 251.3** | 157.3** | 223.3** | 198.7** | 145.4%" | 325,7#* 

N 75.6** | 77.5** | 67.1** | 82.2** | 109.2** | 151.6** | 132.2%" | 43.8% 
Significance of differences 

between groups 

(patients versus 

normals) * * + = „+ 


*p<.05>.01. 
** b0, 


. 
these means within each group, and the differ- 
ences between the two groups. Figure 1 presents 
this data graphically, and in addition gives the 
means for the two “control” sessions. 

The first point to be made is that the differ- 
ences between all reaction-time means of schizo- 
phrenic and normal subjects were significant, cor- 
roborating previous studies (Huston et al., 1937, 


pp. 40-57). As predicted, all reaction times to 
the single Y stimulus following the RY pairs 
were significantly longer than the reaction times 
to the yellow stifnulus of the last RY pairs. This 
effect (designated as the “retardation” effect) 
held for each of the series taken separately, as 
well as for the combined means of each of the 
regular and random series. The hypothesis of 
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Fic, 2. Increments of reaction time in Y trials over RY trials, 


increasing retardation with longer succession of 
RY preceding the final Y was not borne out. 
Retardation did result, but the, effects did not 
vary systematically. Apparently the variation 
occurred for more complicated reasons than 
originally postulated. Further, all of the retarda- 
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tion effects were greater in every single series 
for the schizophrenic subjects. These latter dif- 
ferences were significant in the combined regular 
series, and the combined random series, in the 
IRY-Y and 3RY-Y of the regular sequence, and 
in the SRY-Y of the random sequence. d 
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. The retardation effects can be seen in Figure 1, 


indicated by the bracketed sections above the 
+ horizontal lines on each bar. They are perhaps 
shown most clearly in Figure 2. It will be noted 
that there was a marked difference between the 
regular and random means of schizophrenic re- 
tardation for all series. The regular sequence 
. showed greater retardation than the random for 
the IRY-Ys and the 3RY-Ys, with the opposite 
effect for the SRY-Ys. For the normal subjects 
the differences were in the opposite direction— 
” the random being greater than the regular in 
the first two series, this difference being reversed 
in the 5RY-Ys. (The intergroup differences be- 
tween thege random and control series were not 
“as pronounced for the normal as for the schizo- 
phrenic subjects.) A comparison of the two 
subject groups consequently reveals a particu- 
«larly marked difference between the regular series 
in the 1RY-Y and 3 RY-Y, and a lesser differ- 
ence in the SRY-Y trials. In the random series 
the differences were quite minor in the 1RY-Y 
and 3RY-Y but very marked in the SRY-Y trials 
—in fact here lies the greatest difference in any 
of the retardation comparisons, 
Figure 3 gives the distribution of the mean 
reaction times of the individual subjects of the 
„ experiment to the Y stimuli following on the 
last RY of the various series. It will be seen 
that the schizophrenic and normal distributions 
are quite different in both scale position and 
° range. Indeed, in two series, 3RY-Y regular and 
SRY-Y random, there is an overlap of only one 
case. This patient, M, who fell at the level of 
the normal, was a very well-preserved paranoid 
patient who was carrying a responsible job in 
the hospital. We shall discuss the distribution 
within the schizophrenic group below. 


Discussion 


Whereas a previous experiment—that on pre- 
. Paratory intervals (Huston et al, 1937, pp. 
57~79)—designed to test the maintenance of set 


hypothesis depended entirely on “internal” pre-e 


Occupations or chance distractors as sources for 
interference with the maintenance of set, the 
» Present experiment was designed deliberately to 
introduce an external “distractor’—the RY 
coupling, The repeated coupling of a yellow 
_ Stimulus to an immediately preceding red stimu- 
lus was intended to induce an expectation that 
Would interfere with the major set called for 
by the instructions, that of responding to the 


_. Yellow stimulus, (This does not of course mean 


that Preoccupations and chance distractions did 
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not also have some opportunity to influence the 
results.) 

This technique turned out to be a powerful 
tool in affecting set. In every series the pro- 
cedure resulted in a significant retardation in the 
mean reaction times to the single Y for both 
the schizophrenic and normal subjects, But thé 
important result for us lies in the great dif- 
ference in the degree to which this effect oc- 
curred in the two subject groups. The schizo- 
phrenic, even at his already slow level, proved 
to be considerably more affected by this pro- 
cequre than the normal group. 

Although the predicted schizophrenic-normal 
differences occurred throughout, the most marked 
and significant retardation differences were found 
in the first two series of the regular, and in the 
SRY-Y trials of the random. The divergence 
between the schizophrenic and the normal sub- 
jects was particularly marked in the last, 

Why did this latter striking effect appear? In 
the random sequence, the subject had to prepare 
himself to react to a 1, 3, or SRY-Y trial each 
time the warning buzzer sounded. The subject 
could never anticipate when he would be dealing 
with the 1RY-Y situation, However, given two 
RY pairs in a trial, either 1 or 3 more RY pairs 
were to be expected, and he could prepare him- 
self for either a 3 or a SRY-Y situation. If 4 
successive RY pairs occurred it was inevitable 
that the trial was to be 5RY-Y. The normal 
subject was able to take advantage of the knowl- 
edge he had gained in the course of the experi- 
ment, and prepare for the single Y that followed 
the next RY pair. In this situation, on the con- 
trary, the schizophrenic subject seemed to “go 
to pieces”’—even at his own poor level of per- 
formance—and therefore gave his longest reac- 
tion. The random series appeared to prove too 
complicated for him. He was not able to figure 
out the situation as the normal subject had done, 
as was reflected in his rather poor performance 
in this series, 

An interesting phenomenon corroborating our 
analysis occurred with the normal subject in 
this series. Although the normal subject showed 
some disturbance of the major set as a result 
of the experimental procedure, he seemed to 
introduce a subsidiary set that was helpful in 
minimizing the disturbance. It appeared that the 
subject decided for himself that the experi- 
menter intended the single Y stimulus to be 
the most crucial part of a trial. He consequently 
came to regard the Y paired with the R as 
relatively less important. He evidently instructed 
himself here to “beat the game” and achieve a 
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low score for the final yellow, even at the cost 
of some slowing of reaction time to the previous 
RY pair. 

This seems confirmed by more detailed analysis 
of*the ranked results of all the items in the 
random series. The rankings in the 3RY-Y and 

Y-Y involved arranging reaction times to 
the RY pairs in each trial according to length 
and giving the highest ranking to the longest 
reaction. In the 3RY-Y and SRY-Y trials for 
the normal the longest mean reaction time for 
the RY pairs occurred without exception on the 
third pair (Rank 3) of the 3RY-Y random trigls, 
and on the fifth pair (Rank 5) of the SRY-Ys. 
The patients, on the other hand, had, in the 
$RY-Y trials: 1 in Rank 1, 2 in Rank 2, and 
2 in Rank 3. In the SRY-Y trials their longest 
reaction times fell: 1 in Rank 1, 2 in Rank 2, 
1 in Rank 3, 0 in Rank 4, and 1 in Rank 5, 

The performance of the normal subjects on 
the 1RY-Y random trials, where no prediction 
was possible, failed to show the pattern that 
appeared on their 3 and 5RY-Y trials. Here 
the normal means ranked: 1 in Rank 1, O in 
Rank 2, 2 in Rank 3, 1 in Rank 4, and 1 in 
Rank 5. The distribution of the schizophrenic 
reaction times, on the other hand, appeared to 
follow their previous pattern, this time being 
relatively similar to that of the normals: O in 
Rank 1, 1 in Rank 2, 1 in Rank 3, 1 in Rank 4, 
and 2 in Rank 5. (For the 1RY-Y trials the 
rank order was determined somewhat differently 
—by distance of the five RY pairs from their 
respective Y reaction times. For the normals the 
distance between the shortest and longest of 
these reaction times was divided into five equal 
intervals. The distances from the preceding RY 
pairs for each V were then placed—ranked—in 
the appropriate interval—the shortest being given 
the lowest rank, For the schizophrenic subjects 
the above intervals were doubled, then the dis- 
tances between the Ys and their RY pairs were 
ranked in the already established intervals.) 

Thus we found in the normal subject a phe- 
nomenon resembling a goal gradient. When he 
came close to a predictable, or at least a par- 
tially predictable, goal his reaction time to the 
RY increased, apparently in order to be prepared 
to react more effectively to the goal stimulus 
of that trial—the single Y. This phenomenon 
did not appear in the patients. Their reactions 
appeared, on the contrary, to distribute them- 
selves at random. 

As previously indicated, the distribution of the 
two groups differed in range as well as in scale 
position, From Figure 3 it will be noted that 
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the normal group was quite compact, whereas 
the schizophrenic distribution had a wide spread, 
Analysis of this range in terms of schizophrenic 
subtype proved interesting. The paranoid pa- 
tients, B, C, E, H, I, K, and M, fell toward 
the normal end of the distribution, whereas the 
hebephrenic patients, A, G, J, N, and O, fell, with 
one exception, toward the higher end of the 
scale #—showing the greatest retardation in their 
reaction times to the single yellow. The single 
exception, O, was one of the unusual hebephrenic 
patients who had at various times been diagnosed 
simple and catatonic in type as: well; but the 
diagnosis of “hebephrenic” seemed justified on 
the basis of his symptomatology. In general 
demeanor, however, he resembled the paranoid 
type of patient. 

A question might arise about the inclusion 
of Subject N who appeared so far out of the 
distribution of the group on practically all 
measures. He was an easy-going, pleasant, 
friendly person who was quite “willing,” if not 
too understanding, in carrying out the task. 
Since his diagnosis was not in doubt and his- 
cooperation was at least “fair,” there was little 
justification for excluding him.’ 

The general results of the present study, then, 
confirm the hypothesis that the discrepancy be: 
tween the reaction time of schizophrenic patients 
and normal subjects is significantly related to 
their respective ability to maintain a general set. 
The special form of discrimination reaction time 
used in this experiment has proved to be an 
effective tool for dealing with the problem of 
set. The results obtained here have confirmed 
the findings of a previous experiment dealing 
with set through the preparatory interval 
(Huston et al., 1937, pp. 57-79), findings sub- 
sequently corroborated by a series of similar 
studies (Rodnick & Shakow, 1940; Rosenthal, 
Lawlor, Zahn, & Shakow, 1960; Zahn, Rosenthal, 
& Shakow, 1961, 1963; Zahn, Shakow, & 
Rosenthal, 1961), 


ĉFor a more detailed account of other facets of 
>the hebephrenic-paranoid dichotomy, see Shakow 
(1963). $ 

TSee Shakow (1963) and Shakow and Huston 
(1936), respectively, for a description of the precau- 
tions taken in relation to the schizophrenia and 
schizophrenia subtype diagnosis, and the cooperation 
(attitude) criterion. 
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assigned to 2 conditions: 


affiliated to a significantly 
tested. One concerned the 
and affiliation; the other 
affiliation. 


e Festinger (1950), Schachter (1951), and 
others investigated the concept of social reality: 
the use of other people for validation and com- 
parison of those opinions, attitudes, and beliefs 
for which there exist neither actual nor feasible 
physical referents. The idea of social reality was 
expanded into the notion of social comparison 

. Processes (Festinger, 1954) which considered 
Not only the existence of such social reality, 
but, in addition, postulated the occurrence of a 
drive for self-evaluation, That is, within the 

_ human organism there exists a need for the 
Measurement and the comparison of abilities and 
Opinions with some standard. Using this sociale 
evaluation formulation, Festinger reinterpreted 
the experimental literature dealing with pres- 

ə Sures toward uniformity as providing evidence 
for the occurrence of opinion change via the 

+ Social comparison processes. Much of the experi- 

. Mental work specifically generated by the social 
Comparison processes theory, however, was 
directed toward demonstrating the operation of 

ese processes in the evaluation of abilities 
pe (Dreyer, 1954; Festinger, Torrey, & Willerman, 

* 1954; Hoffman, Festinger, & Lawrence 1954). 
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ABILITY AND AFFILIATION 


JEROME E. SINGER anp VERNON L. SHOCKLEY 
Pennsylvania State University 


In a test of the hypothesis that uncertainty in the evaluation of an ability leads 
9. to affiliation for social comparative purposes, 39 female students were randomly 
those who had information concerning an ability 
and those who did not. Ss in both groups were given the choice of either 
waiting alone or affiliating with others, Those Ss who had no information 
greater extent. 2 subsidiary consfderations also were 
relationship between the nature of the information 
was with respect to ordinal Position of birth and 


Schachter (1959) dealing with the psychology 
of affiliation further extended the concepts of 
social evaluation and comparison. In a co- 
ordinated series of experiments he has shown 
that subjects, faced with an anxiety-producing 
situation which was both novel, and by their 
usual standards, unevaluable, affiliated with other 
subjects in similar plights, in order for both to 
verify the presence of and to evaluate the extent 
of their anxiety. Wrightsman (1960) has shown 
that the effects of such affiliation, when it follows 
the arousal of anxiety under ambiguous circum- 
stances, does in fact lead to an evaluation of 
that anxiety. In further related work, Radloff 
(1961) has shown that subjects will affiliate to 
evaluate an ambiguous opinion, and Schachter 
and Singer (1962) have shown that evaluation 
of emotions is not restricted to anxiety, but is 
applicable to a wide range of emotional phe- 
nomena. Although Festinger’s (1954) social com- ` 
parison processes formulation was concerned in 
good part with abilities, the studies upon affilia- 
tion have been focused only toward emotions and 
opinions. There is not, as yet, any direct evidence 
that people, displaying an ability for which they 
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have no physical referent, will, in fact, affiliate, 
let alone do so for evaluation. 

The straightforward derivation can be made 
that people who have an ability which they are 
not able to evaluate, that is, the ability is one 
for which they do not possess any normative 
iformation, will utilize social comparison (com- 
parison with others having a similar ability), 
and will seek affiliation in order to activate such 
comparison processes. Others who demonstrate 
a similar ability, but who do possess normative 
information, will feel no need for social com- 
parison, and, hence, will display less tendency „to 
affiliate. 

Not all information, however, is of equal 
utility. The social comparison processes can only 
function when a true comparison is possible. If a 
person is comparing himself with people whose 
abilities are greatly divergent from his, he will 
not be able to make an accurate evaluation of 
his abilities with respect to them. In a study 
by Dreyer (1954), boys whose level of ability 
was much superior or inferior to that of their 
peers were not able to make as stable a self- 
evaluation as boys whose performance was aver- 
age. This suggests that if people receive informa- 
tion concerning their abilities, they may still 
desire affiliation to secure evaluation if the in- 
formation is too discrepant for this purpose. 
This need for evaluative affiliation as the in- 
formation gets more discrepant should not 
increase symmetrically for Festinger’s (1954) 
formulation of the social comparison processes 
states that people evaluating abilities exhibit a 
culturally established unidirectional upward push. 
They desire to be somewhat better with respect 
to the abilities under comparison than their 
peers. Therefore, information indicating superior 
status should provide a more stable evaluation 
than information pointing to an equally dis- 
crepant inferior status. If the information value 
of the “superior” information is large relative 
to the “inferior,” superior informed people may 
have less pressure to affiliate in order to obtain 
self-evaluation. 

It has been shown that the ordinal position 
of birth is a factor in the affiliation processes. 
First- and only-born subjects in contrast to later- 
born ones were more susceptible to anxiety when 
placed in a fear-arousing situation and also ex- 
hibited greater affiliative tendencies (Schachter, 
1959). This result, greater affiliative tendencies 
on the part of first and only borns, has been con- 
firmed by several investigators (Capra & Dittes, 
1962; Radloff, 1961; Sarnoff & Zimbardo, 1961; 
Wrightsman, 1960). This leads to the prediction 
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that under similarly appropriate circumstances, _ 
first and only borns will affiliate more than later 
borns. k, 

METHOD 


Overview of the Experiment 


Two groups of subjects were used; each subject 
performed an experimental task. Those in one group 
were given no indication of the quality of their 
performances; those in the other group were given | 
reference data. Subjects in the second group were 
further subdivided according to whether the in 
formation presented to them indicated that their” 
performances had been superior, average, or inferior} 
both groups contained first- and later-born subjects. 
Each subject was given the option of affiliating, 


Subjects 


Thirty-nine female students, mostly freshmen and 
sophomores, from the introductory psychology” 
classes at the Pennsylvania State University served 
as subjects in the experiment; all students in the 
introductory classes participate in experiments in” 
satisfaction of course requirements, The students 
were randomly assigned to the experimental condi- 
tions, Fourteen subjects were in the no-information } 
condition, 25 in the information condition, 8 in the 
superior subcondition, 8 in the average, and 9 in 
the inferior. One subject in the average subcondition 
filled out the forms incorrectly and, consequently, 
her data are not included. 


General Instructions 


The study was conducted in the testing rooms 
of the Pennsylvania State University Psychologi 
Clinic, Each subject was individually scheduled, was 
met by the experimenter in the waiting room, 
was taken to an individual testing room where 
she was given an explanation and rationale for the 
experimental task. Each subject was told that she 
was to participate in a validating study of the 
“Video-Somatic Background Test.” The test was 
described as one which the Armed Forces i 
found useful in selecting bombardiers and camouflage 
experts. The subjects were then informed that te- 
cently many civilian uses, for example, rescue work; 
had been found for the test and it was necessaty i 
to secure a representative sample of female norms: 
The experimenter expanded on the task description 
stressing the peacetime uses of the task and the 
need for knowledge about female performances. 

After the introductory explanation, subjects wet® 
told of the need for biographical information, aP 
an ordinal position questionnaire was adminis- 
tered, Then the experimenter continued with the 
instructions, Á 

All subjects were given a form of the Witkin 
Embedded Figures Test. The experimenter kept * 
timed record of their performances, and, when Be a 
task was completed, under the guise of adjusting 
for unequal difficulty of the problems, gave & 
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subject, irrespective of her actual performance, an 
“adjusted” score of 64. The appropriate information 
manipulations were then performed, followed by the 
administration of a second questionnaire. The experi- 
menter then told the subject that there would be a 
15-20 minute delay before the start of the second 
half of the experiment. The subject was informed 
that she could either wait alone or together with 
several other subjects who had completed half of 
*. the experiment and, like her, were waiting for the 
second half. 

The subjects were given the affiliation choice 
»forms, and as soon as they had completed them, 
the experiment was ended and the deception and the 
true purpose of tHe experiment were explained. 


Independent Variable 


Immediately after the completion of the task, 
subjects were given the appropriate information 
instructions, Those in the no-information condition 
were informed that their score was uninterpretable 
as no normative data were available. 

Subjects in the information condition were given 
their “scores” and were shown a bar graph with a 
distribution of scores. Subjects in the superior, 
average, @nd inferior subconditions were shown dif- 
ferent graphs accompanied by an appropriate expla- 
nation from the experimenter which explained their 
scores in comparison with the “norms.” Superior 
subjects were told they were in the top 5% of the 
distribution; average subjects, that they were in 
“between the 40 and 66 percentiles; and inferior 
subjects, that they were in the bottom 8%. 


Measures 


The ordinal position questionnaire was a single 
page form containing questions which elicited the 
number and ages of the subjects’ sibs. 

The measure of the efficacy of the manipulations 
was achieved by asking the subjects to rate their 
performance on a 6-point rating scale which asked, 
“Compared with female students at Penn State, 

. how well do you think you did on the test?” The 

` scale positions ranged from “very much worse than 
average” to “very much better than average.” On 
6-point scales of similar format, subjects answered 
the following questions: 


How confident are you that you have actually 
estimated your performance? 
* How interested are you in your exact results 
on the test? 
» How interested are you in the results of this 
study? 
s How anxious or uneasy do you feel right now? 


The scales were scored so that the higher the 
number, the greater was the performance estimate, 
the more confident was the judgment, the more 
anxious was the report, etc. 

Bg Affiliation was measured with a two question form. 
® The first was a preference check: 
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TABLE 1 
SUBJECTS’ AFFILIATION CHOICES 


Affiliation 


Preference check intensity scale 


Condition Number of 
Nampa ot | abjecte z 
Srekeets | choosing | M | sD 
with others | ng OnE oF | 
No information 6 8 | 60.43 | 18.00 
Information 2 22 50.13 | 11.70 
Fisher exact test $= 2.15, 
p = 017 p <05 


. 


Note.—All ¢ tests are two-tailed. 


Please indicate below whether you prefer wait- 
ing your turn to be tested, alone or in the 
company of others, 

1, I prefer waiting alone. 

2, I have no preference. 

3. I prefer waiting with others, 


This was followed by an intensity of affiliation rating 
scale, a form headed by the question, “How strongly 
do you feel about your choice? Indicate this by 
placing a check at the appropriate spot on the scale 
below.” The intensity scale was a 100-millimeter 
line anchored at one end with the phrase, “I very 
much prefer to wait alone,” and at the opposite, 
“I very much prefer to wait with others.” It was 
scored by measuring how far from the end the 
subject placed her check mark. The greater the 
preference to wait with others, the higher the score. 
In addition, subjects were asked to give, in free 
response form, the reasons for their preferences. 


RESULTS 


The instructions which attempted to manipu- 
late the subjects’ cognitions about their Video- 
Somatic ability were successful. While subjects 
in the no-information condition did not differ 
from their counterparts in the information con- 
dition in the estimates of their performances 
(t = 38), they had considerably less confidence 
in their estimates (t= —1,94), a result indi- 
cating that the no-information subjects had not 

firmly evaluated their performances. 

Given that the no-information subjects were 
more unsure of their estimates of their Video- 
Somatic ability, they clearly affiliated to a 
greater extent than the information subjects, 
On both the preference check and the intensity 
scale, they evidenced significantly greater af- 
filiative choices as reported in Table 1. It is 
interesting to note that the nonaffiliators did not 
express a preference for solitude. Only 1 out 
of 30 nonaffiliators checked the alone choice, 
and the mean affiliation intensity score of sub- 
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TABLE 2 
SUBJECTS’ INTEREST AND ANXIETY REPORTS 


“How interested are you in. 
your exact results on this test?’ 


“How interested are you in 
the results of this study?" 


“How anxious or uneasy 
do you feel right now?” 


” Condition N : 

a M SD M SD M sD 

No information| 14 5.36 0.66 5.36 0.66 1.93 0.83 a 

Information 24 4,79 1.06 4.96 0.96 1.92 0.72 
t= 1.54,.20>p>.10 | £= 2.06, .05 > $ > .02 t= 04, p= nS 


jects in the information condition is almost 
exactly 50, the point midway between alone and 
with others, In addition to the objective ques- 
tion, subjects also gave their reasons for waiting 
with others, waiting alone, or having no prefer- 
ence. These reasons were independently sorted 
by two judges into two categories: those reasons 
which were indicative of evaluation processes 
and those reasons which were not. Only one 
subject’s reason was deemed evaluative. This 
subject was an affiliator in the no-information 
condition, but the evidence is obviously too 
meager for any inference. 

The experimental instructions had been de- 
signed to arouse the subjects’ interest and 
motivation in the task, and yét not arouse the 
subjects’ anxiety. If the task had been one of 
just military utility, subjects might have been 
bored, resistant, or random in their responding. 
On the other hand, had the task been described 
as, for example, a new IQ test, the subject might 
have become anxious. Since Schachter found 
support for an anxiety-affiliation link, and, if the 
no-information subjects were more anxious than 
the information ones, they may have affiliated, 
not only to evaluate, but also to reduce their 
anxiety. 

Both of these possibilities provide simple 
alternative explanations; both can be discounted. 


TABLE 3 


EFFECT or DIFFERENTIAL INFORMATION UPON 
ABILITY ESTIMATE AND AFFILIATION 


aon BE] Number or | Aaron 
Information|, | youaiae” | suplects | “teale 
subgroup to wait 

P with others u 

Superior | 8 4.50 0 out of 8 47.50 
Average | 7 3.71 1 out of 7 51.00 
Inferior 9 2.56 1 out of 7 50.67 

F = 47.94, F = 03, 

df = 2/21, df = 2/21, 

p< .001 p=ns 


The subjects’ answers to questions concerning 
their interest in and anxiety over the test are 
reported in Table 2. Clearly, the subjects were 
motivated and interested. Out of a ‘possible 6: 
points, the mean for all subjects on the “exact 
results” question was 5.01, on the “study results” 
question, 5.11. The significantly heightened inter- 
est of the no-information group provides addi- 
tional evidence for their evaluative concern, In 
addition, not only were there no anxiety dif- 
ferences between the information and no- 
information conditions, but also those ,subjects 
who did affiliate had no greater scores on the 
anxiety scale than those who did not. 

Although the differential information manipu- 
lation produced the desired effect, as shown in 
Table 3, the amount of affiliation in all three 
subgroups was so minimal that no differences 
could be manifested. The subgroups did differ 
in a way, however, which tends to replicate 
Dreyer’s (1954) ability and self-evaluation study. " 
The average subgroup was more confident of 
their self-rating of ability than either the superior 
or inferior. The confidence difference was not 
significant, but the variance of the confidence 
ratings, that is, “How confident are you... ?” 
was somewhat less than that of the other sub- 
groups. The lower variability of the average 
subjects’ scores may be indicative of a more 
stable rating: F=2.53, p<.25; for average 
versus inferior, F = 3.02, p < .10. The variances 
of the two discrepant groups, superior and ii- 
ferior, did not differ from each other but the 
relative magnitudes of the variance differences 
are in the direction required by the unidirectional 
upward push hypothesis; more variance in the 
inferior group. 

Also under test was the prediction that first 
and only borns would affiliate more than later 
borns. This prediction was not confirmed. The 
first and only borns were not more anxious nor 
did they display more affiliation. Indeed, 4 
shown in Table 4, they verged on having Sig- 
nificantly less anxiety in the no-information 
condition. 


: 
| 
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. TABLE 4 
EFFECTS OF BIRTH ORDER UPON ANXIETY AND AFFILIATION 
Anxiety Affiliation 
scale Number of subjects intensity scale 
Condition N choosing 
with others 
M M 
+ - No information 
First and only born 7 1.57 3 60.71 
Later born 7 2.22 3 60.14 
t= —1.53, p < .20 t= .20,p =ns 
. Information 
First and only born 10 1.80 2 51.30 
Later born = * 14 1.86 3 0 49.29 
t= —.18, p = ns t = 41, p = ns 
DISCUSSION unable to stably evaluate themselves against a 


_ The primary hypothesis of the study was that 
‘people who did not possess an adequate objective 
reference for evaluating an ability would affiliate 
with others to utilize the social comparison 
processes. As was shown, subjects who were 
uncertain concerning an ability preferred the 
Presence of others more than those who were 
certain, establishing the uncertainty-affiliation 
link for abilities. That the mechanism underlying 

» this link is a drive for self-evaluation is an 
inference, for the subjects’ stated reasons failed 
to display any support for the notion. The infer- 
ence is, however, a fairly well-documented one. 

Schachter’s (1959) monograph analytically 

s demonstrates the operation of comparison proc- 
esses in affiliative behavior; Radloff (1961) using 
the same type of free response reason question 
as in this study but with a much larger sample 
was able to demonstrate the presence of such 

Processes. In postexperimental interviews, sub- 

Jects explained that it was not that they had 

affliated for reasons unrelated to comparison, 

but rather that their cognitions about the af- 
filiation were vague and unstructured. They pre- 
ferred to make rather innocuous and conven- 

. tional responses as “I just wanted to be with 

Others.” Such statements are as noninformative 

as they are noncommittal; after the session, sub- 

Jects volunteered that for some of them com- 

Parison with others was motivating. Unfortu- 

nately, they did not so indicate on the form 

Provided for that purpose. 

The absence of an affiliation effect with dis- 

_ Crepant information may be attributable to any 
of Several factors. The subjects may have re- 
ceived enough evaluation so that the normative 

_ information was not discrepant enough to pre- 

{p Vent evaluation. Alternatively, if the subjects in 

the superior and inferior subgroups had been 


ə 


group of midwestern coeds, the pressures for 
affiliation may have been counterbalanced by 
pressures to choose alone. Nonaffiliation would 
protect the superior subjects against the con- 
tingency that if many Penn State girls were 
superior to the unspecified midwestern ones, these 
subjects would be only average when compared 
with their Penn State peers and would lose 
their tendency toward superiority. Similarly, the 
inferior subjects may have been guarding against 
confirmation of fheir lack of ability, In any 
event, the small sample sizes involved mitigate 
against the reliability of any except very large 
differences, 

Although some first-later-born differences had 
been expected, their absence is not startling. 
The growing literature upon this topic reveals 
that first and only borns consistently show more 
affiliative behavior in situations involving severe 
threat: glucose (Wrightsman, 1960) or electric 
shock (Sarnoff & Zimbardo, 1961; Schachter, 
1959), Although similar ordinal position effects 
have been reported in ostensibly nonthreat 
situations (Capra & Dittes, 1962; Radloff, 1961), 
the differences were smaller despite the larger 
samples. Thompson (1963) reports that when the 
operant level of affiliation was tested in an 
experimental situation by giving subjects an 
alone-together choice before the independent 
manpiulation, no birth-order differences were 
manifested even though his sample sizes were 
comparable to those of Radloff or Capra and 
Dittes. It seems reasonable to suggest that 
ordinal position effects upon affiliation vary 
directly with the magnitude of the threat. 
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TWO EXPERIMENTS ON THE CONCEPT OF SELF AND THE 


REACTION OF OTHERS 


HAROLD I. HAAS anp MARTIN L. MAEHR i 


. Concordia Senior College 


2 experiments on the concept of self as a response to the reactions of others 
were conducted. The 1st experiment endeavored to determine the durability of 
experimentally induced changes in self-ratings. The 2nd experiment sought to 
determine the effect of dosage on such changes in self-ratings. As the result 
of the Ist experiment it was concluded that experimentally induced changes 
in the concept of self as reflected in self-ratings are durable over time. After 
6 weeks, Ss’ self-ratings still exhibited the effects of the experimental treatment. 


As the result of the 2nd experiment it was concluded that changes in the 


concept of self show the effects of dosa; 
and longer lasting changes in self-ratin, 


There is no question but that the concept of 
self, or some variant thereof, has once again 
become a key variable in psychological theory. 
Research articles abound on the topic and, as 
Wylie (1961) points out, 


all theories of personality which have been put forth 
within the last two decades assign importance to a 
phenomenal and/or nonphenomenal self-concept with 
cognitive and motivational attributes [p. 2]. 


Current interest in this notion has, of course, 
taken many different forms, and consequently the 
concept has been variously defined. For example, 
Lowe (1961) has isolated six distinct approaches 


1 The authors are indebted to Bruce Hartung, Ken- 
neth Neumann, and Lawrence Rosenkoetter for as- 
sistance in gathering and analyzing the data and to 
R. E. Stake for making helpful suggestions, 


ge. 2 treatments brought about greater 
gs than did 1 treatment, 


to the defining of self, all of which find a place 


to a greater or lesser degree in current theories. 


Although there is wide variation in the specific 


use of the concept there is a certain common 
core of agreement, Every theory has had to deal 
with development and change in the concept 
*of self. Most, if not all, theorists have proposed 
that change and development are to a consider- 
able extent dependent upon social experience. 
Earlier theorists such as Cooley (1902), Mead 
(1934), and Sullivan (1953) are largely respon- 
sible for the notion that the individual’s concept 
of self is in fact a direct response to the way he 
has been treated by “significant others.” Today 
most theorists would agree that in one way O" 
another the individual’s concept of self is func 
tionally dependent upon the response of others. 


' e Presumably, we view and value ourselves as 
others have taught us to view and value ourselves. 
+ In spite of the crucial nature of the above 
proposition there is a surprising lack of empirical 
evidence to support it. To be sure, a variety of 
studies which have a bearing on the issue have 
been conducted. For the most part, however, 
_.these studies have been correlational in nature. 
' That is, they have simply compared the at- 
titudes that significant others (for example, 
parents) hold toward subjects to the subjects’ 
“own self-attitudes. Although such “correlational 
evidence” is not to be ignored, its value in 
specifying the parameters of self-concept change 
is obviously limited. A more direct attack on this 
problem has been initiated by Videbeck (1960) 
and continued by Maehr, Mensing, and Nafzger 
(1962). 
+ Briefly, Videbeck and Maehr selected areas of 
human activity in which subjects could be ex- 
pected to have attitudes toward self. A rating 
scale for measuring these attitudes was developed 
and the experimental subjects’ initial self-ratings 
i determinéd. The subjects were then required to 
perform certain activities in the selected area of 
behavior. This behavior was arbitrarily praised 
(approval) or criticized (disapproval) by an 
“expert.” The subjects’ self-ratings were taken 
‘again immediately following the approval-dis- 
approval treatment. Videbeck had college stu- 
dents in a speech class read poems in the 
Presence of an expert. Significant differences 
were found between the before and the after 
self-ratings in the predicted direction for dis- 
approval but not for approval treatment. To test 
the replicability of Videbeck’s findings, Maehr 
had freshmen high school students in a physical 
education class do simple physical tasks which 
„ Were then praised or criticized. The Maehr find- 
, mgs not only support Videbeck’s demonstration 
that changes in self-ratings can be produced as 
the result of disapproval, they show that similar 
A changes can be produced by approval and, more 
significantly, that there is a spread of effect to 


criticized. 

The present study is an extension of this re- 
„Search, Specifically, it is concerned with two 
critical questions related to the phenomenon 
demonstrated by Videbeck and Maehr. First, if 
the phenomenon of changes in self-concept as 
reflected in self-ratings is a real and stable one, 
Some evidence of its durability over time should 

Is demonstrable, Second, it is desirable to de- 
„termine whether the phenomenon is subject to 
dosage, that is, whether greater amounts of ap- 
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areas of self-regard not directly praised or e 


101 


proval or disapproval treatment produce greater 
or more durable changes in the self-ratings. 


METHOD 
Subjects A 


The subjects in these experiments were all male, 
eighth grade students. The administration of the self- ” 
rating scale was described to them as an investigation 
of “how people feel about themselves.” The physical 
tasks were described as a “physical development test 
to show whether a person has lived up to his po- 
tential for his age and size.” 


Measuring Instrument 


The measuring instrument was the Physical Self 
Test.? It consists of 30 items which require the sub- 
ject to rate himself on a 9-point scale ranging from 
“extremely adequate” to “extremely inadequate.” It 
contains three subtests of 10 items each: The Criti- 
cized scale contains body coordination and motor 
skill items directly related to the tasks the subjects 
performed. The approval and disapproval evaluations 
given by the experts to the subjects were directly re- 
lated to these items. The Related scale items concern 
general athletic skill, and the Unrelated scale items 
concern general physical fitness; these items were 
not directly related to the physical tasks the sub- 
jects performed and were not referred to by the ex- 
perts. The Related and Unrelated scales permitted 
measurement of the spread of effect of the approval- 
disapproval treatment. 


Procedure 


The general procedure, which was the same in both 
experiments, consisted of obtaining the subjects’ self- 
ratings before and after the administration of ap- 
proval or disapproval treatment by an expert. The 
pretests were administered 1 week in advance of the 
approval and disapproval treatments; posttests were 
administered at selected intervals following the ex- 
perimental treatment. Both pre- and posttests were 
administered under conditions which were separated 
from the treatment phase of the experiment. They 
were given in a different classroom setting than that 
used in the treatment phase and by a different ex- 
perimenter. 

The “treatment phase” of the experiment was con- 
ducted as follows: The experimenter explained to the 
subjects that they were to participate in a “physical 
development test.” Following a brief description of 
the general nature of this test, the experimenter in- 


2 Reproductions of this test, including a descrip- 
tion of the physical tasks and praise and criticism 
administered by the “experts,” have been deposited 
with the American Documentation Institute. Order 
Document No. 8132 from ADI Auxiliary Publica- 
tions Project, Photoduplication Service, Library of 
Congress, Washington, D. C. 20540. Remit in ad- 
vance $1.25 for microfilm or $1.25 for photocopies 
and make checks payable to: Chief, Photoduplica- 
tion Service, Library of Congress. 


. 
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TABLE 1 
DURATION oF CHANGES IN SELF-RATINGS 


Mean self-ratings before and after approval and disapproval treatments 


After 
Before 
Subscale Immediate 1 day 6 days 6 weeks ’ 
M SD M SD M SD M SD M SD l 
Approval group (V = 19 
TE iiticized si ; ER IIE 7.3 1. 7.11 .95 7.08 1.00 6.83 98 
(t = 5.39**) (t = 4.42") (t = 4.33**) (t=243) 
Related 650 119 | 697 1.11 674 1.07 6.59 1.05 6.57 1.08 
(t = 2.32* (t = 1.70) (t= .56) (t= .43) 
Unrelated 6.02 1.04 | 6.50 9! No data No data 6,05 1.04 
(t = 2.82**) (t= 18) 
Disapproval group 
(N = 18) 
Criticized 6.50 61 5.20 82 5.74.70 6.14 72 6.18 66 
(t = 5,45**) (t = 3.81") (t = 1.73) (t= 1.774). 
Related 6.18 147 | 5.82 1.38 5.98 1.25 6.04 1.22 6.08 1.06 
(t = 1.81* ) (t= .86) (t= .73) (t= .39) 
Unrelated 6.14 1.03 | 5.72 84 No data No data 6.14 911 
(t = 2.12*) (@=.0) 


aoea tests are one-tailed. The ¢’s refer to the difference between the before and after means. 


wD <01. 


troduced the subjects to the “physical development 
experts” who were to serve as judges of the subjects’ 
physical development. The subjects were then dis- 
missed and called back individually for “testing.” 
Each subject performed several simple physical tasks 
(such as walking a straight line, dribbling a basket- 
ball, a series of knee bends, etc.) in the presence of 
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one of the experts. After all the subjects had been 
tested, each subject was brought back, and the ex- 
pert responded with a series of approval or disap-! 
proval statements. These statements followed a stand- 
ardized script and were administered without regard 
for the actual performance of the subjects. Following 
the administering of the approval or disapproval 


APPROVAL TREATMENT 


DISAPPROVAL TREATMENT 


*——— CRITICIZED SCALE 
œ- — æ RELATED SCALE 


SIX 
DAYS 


Fic. 1. Duration of changes in self-ratings. 
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Ve TABLE 2 
EFFECT OF DOSAGE ON SELF-RATINGS 


Mean self-ratings before and after approval treatments 


After 
Before 
Subscale First treatment Second treatment 6 days 6 weeks 
M SD M SD M SD M SD M SD 
? Criticized 6.59 1.27 7.38 .92 7.56 1.01. 7.56 1.04 7.53 1.04 
J (t = 4.32**) (t = 5.59**) (t = 5.71**) (t = 4.85**) 
(t= 1.70) (t = 1.70) (t = 1.15) 
Related 6.51 1.41 6.61 1.19 6.69 1.22 6.82 1.10 6.70 1.16 
s (t= .64) (t = 1.29) (t= a8 (t = 1.33) 
p (t= .85) (t = 2.03* (t= .72) 
Unrelated 6.49 1.06 6.61 1.08 6.57 1.16 6.70 1.07 6.61 1.00 
(t = 1.19) (t= .15) (t = 2.06*) (t= .83) 
(t= .63) (t = 1.01 ) (t= .00) 


- 


second treatment means. N = 30. 
*p <.05. 
HD <.01. 


statements, the subjects were dismissed and returned 


sto their classroom where they later received the first 


posttest. 


Experiment I: Duration 


Hypothesis. Changes in self-ratings in response to 
‘the reactions of others demonstrated by the above 
procedure are durable over time. 

Procedure. Subjects were divided into two matched 
groups on the basis of total score on the pretest. One 
group (N =19) was given approval treatment; the 
second group (W = 18) was given disapproval treat- 
ment, Posttreatment measures were obtained immedi- 
ately (within 1 hour) after the treatment, 1 day 
later, 6 days later, and 6 weeks later. (The Unre- 
lated scale was omitted on the 1 day later and 6 


Note.—All tests are one-tailed. The #'s refer to the differences between the before and after means and the first treatment and 


days later test for various practical reasons. The 
Maehr study had revealed no significant difference 
on this scale.) 

Results, The results are presented in detail in 
Table 1 and illustrated graphically in Figure 1. It is 
apparent that the treatment had not only immediate 
but also rather lasting effects on the subject’s self- 
ratings. Changes on the Criticized scale were still 
significant at the end of 6 weeks for both approval 
and disapproval treatment. 

The hypothesis that changes in self-ratings are 
durable over time was retained, 

Discussion. The results duplicated the findings of 
the Maehr study except that in addition the change 
on the Unrelated scale for both approval and dis- 
approval treatments was significant on the immedi- 


TABLE 3 
EFFECT oF DOSAGE ON SELF-RATINGS 


Mean differences in self-ratings between groups with one and two approval treatments 


aut 
i After second versus no additional treatment 
Subscale Pretest AAU 
. 24-48 hours 6 days 6 weeks 
Criticized 
» Mi-M: .26 00 45 48 -70 
t -70 -00 1.51 1.55 2.30* 
Related 
Mı—-M: .01 36 .05 .23 13 
t 03 1.04 14 69 37 
Unrelated 
M-—M: AT AL No data No data 56 
t 1.48 -36 1.86* 


* ___Note,—Based on approval data from Experiments I and II. Ns = 19 and 30 for 
Eva The q in AE after first treatment columns require two-tail 


oups with one and two treatments, respec- 


led tests; remaining columns permit one-tailed tests. 
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MEAN 


62 
PRE AFTER AFTER 

TEST [ST 2ND 
TREAT. TREAT. 


Fic. 2. Effect of dosage on self-ratings. 


ate posttest. Because changes in self-rating on the 
Criticized scale persisted after 6 weeks, disapproval 
treatment was abandoned in subsequent experiments. 


Experiment IT: Dosage 


Hypothesis. Changes in self-ratings in response to 
the reactions of others are subject to dosage. 

Procedure. Thirty subjects participated in this ex- 
periment. All subjects were given approval treatment 
twice. The second treatment followed the first within 
48 hours and was given to each subject by an expert 
who had not administered the first treatment. Post- 
treatment measures of the subjects’ self-ratings were 
made immediately (within 1 hour) after the first 
and second treatments, 6 days later, and 6 weeks 
later. Two types of comparisons were made. In the 
first comparison, the group was compared with itself, 
so that changes in self-ratings following treatments 
were examined. In the second comparison, the self- 
ratings of the subjects in Experiment II were com- 
pared with the self-ratings of the subjects in Ex- 
periment I who had received approval treatment. 

Results. Data relating to the first comparison (two- 
treatment group with itself) are presented in Table 2. 
Table 3 contains the data for the comparison be- 


tween the two-treatment group of Experiment II ° the end of 6 weeks (p<.05). 


and the one-treatment group of Experiment I. The 
results are illustrated graphically in Figure 2. 

The two-treatment group exhibited the expected 
change on the Criticized scale following the first 
treatment. However, in contrast to Experiment I 
there was much less spread of effect to the other 
scales. The changes on the Related and Unrelated 
scales were not significant. Following the second 
treatment, there was some additional change in the 
expected direction on the Criticized scale ($ < .10). 
The Related and Unrelated scales still did not show 
a significant increase. However, 6 days after the sec- 
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Brier ARTICLES 


EXPERIMENT I- 
1 TREATMENT 


Ae EXPEQIMENT I- 
dl 1 TREATMENT 


e—e—— CRITICIZED SCALE 
@—--—-0 RELATED SCALE 
4----G----@ UNRELATED SCALE» 


SIX 
DAYS 


ond treatment, the mean self-rating on the, 


pretest ratings (p<.025 in each instance), 
weeks, there was only a slight drop in the Cri 
scale mean. The change from the pretest me n 
the Related and Unrelated scales no longer met 
criterion of significance. 
Comparing the results of the two-treatment g 
in Experiment II with those of the one-treal 
group in Experiment I, it was found that the 
did not differ significantly on the Criticized scale 
fore treatment. Following a single treatment, 
mean self-ratings for this scale were identical. 
the Experiment II group had a second treatment, 
means differed in the expected direction (p< 
After 6 days this difference had increased fi 


effect of the single treatment over time. The E 
ments I and II groups did not differ significant y 
the Related scale at any point. On the Un 
scale they did not differ significantly on the p 
or after a single treatment, but they did dii 


The hypothesis that changes in self-ratings 
subject to dosage was retained. 

Discussion. The changes in the expected 
on the Criticized scale constituted the third time 
the effect of the reactions of others on the cı 


the results are striking. Changes on the Relate! 
Unrelated scales showing spread of effect to 
criticized areas of self appear less stable. 


* 
te 


BRIEF ARTICLES 


Jated scales; Experiment II of this study did not 
find it after the first treatment for either scale. 

The effect of dosage is not entirely clear. Changes 
on the Criticized scale following the second treat- 
ment suggest both an increased change in the ex- 
pected direction and greater durability of the change. 
On the other hand, the changes on the Related and 
Unrelated scales suggest some kind of delayed reac- 


. tion effect, since the change did not reach an ac- 


ceptable level of significance until 6 days after the 
second treatment. One possible explanation is that 
a phenomenon of cognitive consistency and reduction 
of dissonance is involved. The activities criticized by 
the expert were quite specific. In the past they had 
probably not received much direct thought by the 
subjects or direct criticism from significant others. 
„Thus when, the criticism of the expert was received 
there was relatively little dissonance to be reduced, 
and the criticism could be quickly incorporated into 
the concept of self. The effect of the criticism on 
„the disparate but logically related self-attitudes re- 
flected in the Related and Unrelated scales was an- 
other matter. Here the effects had to generalize, dis- 
sonance had to be reduced, and consistency had to 
be attained. This took time and the effect was not 
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as great. Determination of whether this interpreta- 
tion has merit will depend on future research. It is 
entirely possible that these changes may be attrib- 
utable to uncontrolled effects of repeated testing or 
other artifacts in the experimental situation. 
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The findings of the APA’s Project on Scientific Information Exchange in Psychology call ati 
to the difficulties of the scientist in establishing and maintaining contact with current work releyani 
his interests. Accordingly, the Project is sponsoring a series of changes during the next 
months which are designed to enhance scientific communication. These changes are planned 4 
limited trials. Moreover, they are wholly supported by the Project and hence do not cut into 

=» page allotment. 

The Editor has agreed to make possible a test of one of the changes. Beginning with this 
and continuing through the remainder of Volumes 1 and 2 (1965), the titles and authors of ac 
papers will be listed following the last article in the journal. Such listing will make it po 
for our readers to become aware of research many months in the advance of its journal pul 
tion. At the end of the year’s trial the results of the trial will be evaluated and further consid 
tion will be given at that time as to whether the service is worth continuing. 
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: ACCURACY OF INTERPERSONAL PERCEPTION AS A 
FUNCTION OF CONSISTENCY OF INFORMATION? 


BERNARD PYRON 


University of Wisconsin =< 


96 female Ss (Judges) predicted the choices of 8 female Others on 48 concrete 
situations and learned the Other’s choice after each prediction. The Others 
(tape-recorded voices giving the choices) were “designed” to test the hy- 
pothesis that Others whose behavior could be more easily ordered on a 
e single, global dimension of discrimination „would be predicted more accurately 
than those who were inconsistent with a single dimension. Consistency with a 
single dimension (global consistency) was Studied orthogonal to consistency 
within 4 different types of situations (situation consistency) ina 2X2X2X6 
. * repeated measures design. The effects of both global consistency and situation 
consistency were significant. There was a significant rise in accuracy over trials 
for all conditions. The response set of assumed similarity did not account for 


all of the accuracy of prediction. 


In the interpersonal perception study, an 
individual, conventionally called a Judge, pre- 
dicts the behavior of another person, called 

| an Other. It is our hypothesis that the pre- 
dictions of the Judge are derived from his 
Positions regarding the Other along dimen- 
sions of discrimination. Dimensions of dis- 
crimination are ordering processes imposed 
‘by the Judge upon the behavior of the Other 
(Pyron, 1962, 1963). We hypothesize that 
Most Judges tend to order Others along a 
very small number of dimensions of discrimi- 
` Ration, De Soto (1961) has discussed the 
idea that people have a predilection for single 
orderings (the use of a single dimension of 
discrimination) and have an antipathy toward 
multiple orderings. He has shown that multi- 
ple orderings (De Soto, 1960) are more dif- 
ficult than single orderings when the multiple 
orderings are uncorrelated. Most Judges in 
Interpersonal perception studies, we hypothe- 
_ Sze, tend to use a very small number of 
global dimensions to predict an Other and 
tend not to make use of the concrete data 
of his actions at a given time and in a given 
Situation, Gage and Cronbach (1955) suggest 
that “various global dispositions of the Judge 
; appear to account for much of the variance 
= M accuracy scores [p. 420].” 


4, This paper is based on a PhD. thesis at the 
| University of Wisconsin in 1963, The research was 
t hie under a predoctoral research fellowship, 
Mu +11,075-C2, from the ational Institute of 

ntal Health, United States Public Health Service. 


In accord with this hypothesis we would 
expect that Others whose behavior (to be 
predicted) could easily be ordered along a 
single dimension would be predicted more 
accurately than Others whose behavior could 
not be ordered along a single dimension. 
Thus, the present study focuses upon the 
behavior of the Other. 

If an Other were ethical, conventionally 
religious, submissive to the expectations of 
others, and liked popular music (the globally 
consistent Other), his behavior, we suggest, 
could be more easily ordered along a global 
dimension such as conformity-nonconformity. 
The behavior of an Other who was unethical, 
but conventionally religious, submissive in 
interpersonal situations, but preferred con- 
temporary avant-garde music (the globally 
inconsistent Other) could not be predicted 
from a single dimension like conformity- 
nonconformity. 

Since more than one dimension would be 
needed to predict the globally inconsistent 
Other, which is contrary to the global process 
of dimensional formation during a brief con- 
tact with an Other, we would expect that the 
globally inconsistent Other would be pre- 
dicted less accurately than the globally 
consistent Other. 

More explicitly, we are testing the hypothe- 
sis that the Other who is ethical, convention- 
ally religious, submissive in interpersonal 
situations, and likes popular music (the con- 
sistent conformist) would be predicted more 
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accurately than an Other who is, for example, 
ethical, but not religious, is not submissive 
in interpersonal situations, but prefers popu- 
lar music (the globally inconsistent Other). 
In addition, we are testing the hypothesis that 

=the Other who is unethical, nonreligious, non- 
submissive in interpersonal situations, and 
prefers contemporary avant-garde music (the 
consistent nonconformist) will be predicted 
more accurately than a globally inconsistent 
Other. We are also hypothesizing that 
accuracy will not be significantly different,for 
the two globally consistent Others, the con- 
formist and the nonconformist, or for the two 
globally inconsistent Others. 

In this study, the Other is globally con- 
sistent or inconsistent across four different 
types of situations: ethical; religious; inter- 
personal situations, not involving ethical 
codes; and situations involving the choice of 
musical selection, The Other’s behavior might 
also be consistent or inconsistent within each 
type of situation. If a Judge categorized an 
Other as being submissive toward others, he 
would be expected to be submissive to 
teachers as well as to parents and peers. 
Since it is more difficult for Judges to form 
several dimensions appropriate, for example, 
to interaction with peers, parents, and 
teachers within the interpersonal type of 
situation, we would expect that an Other 
who was consistent within a type of situation 
would be predicted more accurately than one 
who was inconsistent. In this study we are 
concerned with two kinds of consistency: the 
consistency with a single, global dimension 
across situational types (global consistency) ; 
and consistency within situational types (situ- 
ation consistency). These two kinds of con- 
sistency are studied orthogonal to each other. 
Since it would be difficult to select Others 
who fitted perfectly into the four resulting ° 
combinations, the Others in this study were 
not real individuals although they were 
presented to the Judges as such. 


Additional Hypotheses 


1. The number of dimensions verbalized 
by the Judges about the Others will be small 
(three to five), and, in accord with the 
hypothesis of global dimension formation and 
antipathy toward multiple uncorrelated di- 
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mension formation, we would expect the 
number of dimensions verbalized about all 
the Others to be the same. Judges will not 
tend to use more dimensions to categorize 
the more complex Others. 

2. Accuracy for all Others will increase 
significantly during the predictive sequence, 
Unlike nearly all other interpersonal per- 
ception studies, in this study the Judges were 
given the actual choice of the Other following 
each of the 48 predictions, and curves 
relating accuracy to blocks of predictions 
were constructed. 

3. Accuracy for all Others will be signifi- 
cantly greater than the level of accuracy that 
would have occurred if the Judges had 
received no information (no-information 
level) about the Other. 

4. Since the inconsistent Others tend to 
disconfirm the Judge’s expectations, the 
Judges will tend to dislike them. The con- 
sistent nonconformist stands out clearly from 
the Judge’s dimensional position about the 
average person and will also be disliked. 


METHOD 
Design 


Global consistency and situation consistency wert 
manipulated independently in a basic 2 X 2 design. 
In addition, a third variable, direction, was made 
orthogonal to the two consistency variables. Direc- 
tion 1 for the global consistent, situation consistent 
condition refers to the conformist Other, while 
Direction 2 refers to the nonconformist. For the 
global inconsistent conditions direction was defined 
by the pattern of inconsistency across the four 
situational types. 

The predictive sequence was divided into blocks 
of trials so that there are six blocks and eight 
predictions within each block. The total design thus 
fits a 2X2X2X6 repeated measures analysis of 
variance. 

Within each of the four situational types ther 
were several kinds of subsituations. For the situa 
tion inconsistent conditions, the Other chose he 
alternative reflecting either a conforming or non- 
conforming attitude in 66% of the situations an 
the opposite attitude in the remaining 337 within 
a type of situation. This inconsistency within situa- 
tional types was not random. For example, for the 
religious situational type, 4 of 12 subsituations con- 
cerned church membership. An Other was either con- 
ventionally religious or nonreligious in the 8 sub- 
situations not involving church membership 2” 
reflected the opposite attitude for those involviné 
church membership. 


| 
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The eight Others thus exhibited the attitude 
combinations given in Table 2. 


Predictive Task 


Within each type of situation there were 13 hypo- 
thetical concrete situations, and for each of the 
52 total situations there were four alternatives. The 


. four types of situations are described below: 


Ethics. The 13 ethical situations were considered 
to be such that nonconformity could lead to social 
disapproval or even a jail sentence. They were 

* designed so that many people, it was believed, would 
feel that nonconformity might be partially or wholly 
justified in the situation. In no situation would non- 
conformity be a serious criminal offense. Since the 

. Judges predicted an Other for only 48 situations, 
there were 12 situations within each situation type. 
Four of the 12 ethical situations concerned sexual 
ethics and the other 8 were concerned with cheating 

, on exams, nonpayment of bills, lying in court, failure 
to follow a contract, and violence toward a thief. 
One example of an ethical situation is: 


Suppose you are an achitect and want very much 
to set up your own architectural office, which 
would” cost you $5,000. You have saved $5,000, 
but you have just undergone a series of surgical 
operations, the bill for which is $4,000, which 
you feel is outrageous. I would: 


1. Pay the medical bill, and wait another three 
years to set up the architectural office. 

2. Pay part of the bill and wait a year to set 
up the office, 

3. Not pay the bill at all and set up the office 
immediately. 

4. Pay only $3,000 and try to avoid paying the 
remainder, 


nonethical choice was Alternative 3, For each type 
of situation the most extreme choice was given by 
the Other for one direction and the most extreme 
Opposite choice for the opposite direction, 

For the situation inconsistent conditions the Other 
Was either ethical or nonethical in all situations not 
involving sex, and was the opposite in the four 
sexual situations. 

Religion, The 13 items and hypothetical situations 
were developed from the Extrinsic Religious Values , 
Scale (Wilson, 1960). These items reflect 


(a)... an allegiance to, and dependence upon, 
the external or institutional structure of a Church; 
and (b) ...a utilitarian orientation toward 
religion, ie, acceptance of religion as a means 
[p. 286]. 


. 
The ethical choice was Alternative 1 and the 


For one direction the Other accepted the extrinsic 
or conventionally religious items and for the opposite 
direction the Other accepted the items which re- 
flected either a nonreligious or an unconventionally 
Teligious attitude. Four of the 12 items dealt with 


g lurch membership, and the other 8 concerned the 
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Lord’s Prayer, the purpose of religion, the function 
of prayer, acceptance of nonmembers of one’s 
Church, and attitudes toward the Bible, 

For the situation inconsistent conditions, the Other 
was conventionally religious or nonreligious for the 
situations not involving church membership and re- 
flected the opposite attitude in those situations 
dealing with church membership. 

Submission in interpersonal situations. For one 
direction the Other accepted the alternative which 
reflected a desire to submit to people’s demands, 
and for the opposite direction, indicated non- 
submission to the demands of people. In these 
situations nonconformity would not involve viola- 
tion of an ethical code as in the ethical situational 
type. Four of the 12 situations dealt with submis- 
sion to teachers and the other 8 concerned submission 
to parents and peers. 

For the situation inconsistent conditions, the Other 
was submissive or nonsubmissive toward parents 
and peers, and the opposite toward teachers. 

Musical preferences. Each musical choice situation 
presented four 1-minute excerpts from either ro- 
mantic or classical music, a kind of “classical” 
popular music like Beer Barrel Polka, and contempo- 
rary music (Bartok Quartets and Webern Cantatas), 
and experimental electronic or other types of avant- 
garde works (by Varese, Stockhausen, Cowell, and 
Murphy). For one direction the Other chose the 
popular and for the opposite direction the con- 
temporary was chosen. 

For the situation inconsistent conditions the Other 
preferred either popular or contemporary music 
for eight of the choice situations and preferred Bach 
and Beethoven for the remaining four, 


Procedure 


The 52 situations were first presented to the Other 
who gave her choice upon completion of the situa- 
tion description. Her verbal choices and the voice of 
the experimenter describing the situations were tape 
recorded and played to the Judges. The musical 
excerpts were recorded on the same tape. In addition, 
the judges were given the 52 situations on mimeo- 
graphed sheets, 

Judges were given the task in groups of four and 
first responded to the 52 situations as they thought 
they would react themselves and as they thought 
the average university student would react. Then 
the Judges were told they were to predict the 
choice of the Other before she actually gave her 
choice, and that as the experiment progressed they 
would have more and more information about the 
person they were to predict. In addition, they were 
told they would be asked to verbalize the concepts 
they were using to predict the Others, 

The Judges were given the Other’s choices for 
the first four situations before they began pre- 
dicting. During the predictive task of 48 situations, 
the Judges first predicted the Other’s choice for a 
given situation and then heard her actual choice 
on tape, 
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Each group of four situations contained one situa- 
tion from each of the types of situations, presented 
in a constant order. The order of the groups of situa- 
tions was, however, not the same for all Judges. 
Within each of the eight conditions of the design, 
there were three orders, so that a given group of 

“Bituations occurred once near the beginning, near 
the middle, and near the end of the sequence. The 
orders were designed so that those items on which 
the situation inconsistent Other was inconsistent 
occurred once in each aggregate of 12 situations 
(three groups), or four times during the task, 

At the end of each aggregate of 12 situations the 
Judges were asked to write down at least one 
concept about the Other and were permitted to 
write as many as four each time. The number of 
concepts could range from 1 to 16. At the end of 
the predictive sequence the Judges rated the Others 
on nine evaluative scales (9-point scales) of the 
semantic differential (Osgood, Suci, & Tannenbaum, 
1957). These nine scales were used because Osgood. 
et al. reported evaluative factor loadings of .75 or 
better for them. 

The eight Others were not real individuals but 
were invented according to the design given above. 
A female graduate student was given the choices 
for each Other and her voice was heard on tape 
giving the Others her choices.2 The same voice was 
used for all eight Others. 


Subjects 


Subjects (Judges) were 96 females from intro- 
ductory psychology classes at the University of 
Wisconsin, Each Judge predicted one Other. Twelve 
Judges were used in each of the eight conditions. 


RESULTS AND DISCUSSION 


Mean Number of Concepts used by Judges 
to Describe the Others 


There were no significant differences be- 
tween the mean number of concepts made 
by the Judges about the eight Others, How- 
ever, fewer concepts were used to describe 
the global consistent, situation consistent 
Others (the mean was 3.6) than for all the 
remaining Others (the mean was 4.4 for the 
remaining six Others). The small number of 
concepts verbalized about the Others through- 
out the predictive sequence tends to support 
the hypothesis that Judges use a small num- 
ber of global dimensions to predict Others. 
In spite of the greater number of dimensions 
needed to predict the inconsistent Others, 
there was only a slight tendency (not 


2 The author wishes to thank Gloria Welniak for 
making the eight tapes. 
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statistically significant) to verbalize more 
concepts. 


Effects of the Three Variables, Not Con- 
sidering Trials 

The accuracy of prediction score for each 
Judge was the total number of predictions by 
the Judge which were identical to the choices 
of the Other in the 48 situations. Scores 
could range from 0 to 48. 

It should be noted that the accuracy scores 
for the three orders of presentation were 24.8, 


23.9, and 24.7 and thus do not approach 


statistical significance. 

The analysis of variance reported in Table 
1 indicates that the effects of global con- 
sistency and situation consistency were both 
significant beyond the .01 level. Mean 
accuracy scores for all eight conditions are 
reported in Table 2. The overall situation con- 
sistency means were 31.0 for the consistent 
and 17.9 for the inconsistent; and the overall 
global consistency means were 26.4 for the 
consistent and 22.5 for the inconsistent. 
Thus, the hypotheses that both global con- 


3 


sistent and situation consistent Others would” 


be predicted more accurately than the incon- 
sistent Others were confirmed. 

Direction was not significant. However, the 
interaction between global consistency and 


TABLE 1 


Summary or ANALYSIS OF VARIANCE FOR GLOBAL CON- 
SISTENCY, SITUATION CONSISTENCY, DIRECTION, 


AND TRIALS 

Source daf MS F 
Global consistency (GC) 1} 58.08 20.1** 
Situation consistency (SC) | 1| 682.10 | 235.9** 
Direction (D) il 7.80 rae 
GC X SC 1| 62.10 | 21.4" 
GC XD 1] 15.30 | 52 
sc xD 1 3.80 a 
GC xsc xD 1 5.30 N 
Ss/groups (error a) 88 2.89 1.9 
Trials (T) 5| 45.20 29.1"* 
TX GC 5 -80 a 
TXSC 5| 84 | 54 
TXD > 1.00 aa 
TXGCXSC 5 1.42 a 
TXGCXD 5 2.00 1. 
TXSCXD 5 3.40 S 
TX GC XSCXD 5 1.40 aS 
Ss X T/groups (error b) | 440 1.55 


*p < 05. 
** = 01 
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É TABLE 2 


MEAN Accuracy AND MEAN No-InrorMATION LEVEL 
For ALL EIGHT CONDITIONS 


Mean 
ition Mean | no-infor- 
Cond accuracy | "02h 4 
. Ethical (E), religious (R), 
r submissive (S), popular 
music (P) (100% con- 
formist) 33.1 23.1 6.49** 
ae (NE), non- 
religious (NR), non- 
submissive (NS), con- 
Gane, music (C) 
(100% nonconformist) 36.6 18.7 | 13.30** 
66 E, .66R, 266 S, .66 P 
(66% conformist) 16.3 19.4 a 
.66 NE, .66 NR, .66 NS, 
ma ic (66% nonconform- 
19.5 15.8 | 2,43* 
28.5 24.2 4.70** 
25.8 16.7 | 5.50** 
Direction 1) 17,2 18.7 a 
-66 NE, .66 R, .66 S, .66 C 
(global i inconsistent, situ- 
ation inconsistent, Di- 
rection 2) 18.8 15.0 | 3,06* 


direction indicates that for the global con- 
sistent conditions, the nogconformist was 
Predicted more accurately than the conform- 
ist. The Duncan range test (Duncan, 1955) 
shows that the nonconformist was predicted 
| Sean more accurately for both the 
global consistent conditions. For all Duncan 
Tange tests reported, the significant stu- 
 dentized ranges were for the .05 level. 

The significant interaction (Table 1) be- 
tween global consistency and situation con- 
sistency shows that the effects of global 
consistency occur only for the situation con- 
sistent conditions. Again, the Duncan range 
os for the four mean accuracy scores 

(direction pooled) indicates that the means 
for the global consistent, situation incon- 
sistent and global inconsistent, situation 
inconsistent conditions do not differ from 

-fach other while all other means are signifi- 
cantly different. It is quite likely that this 


ist), 

£, NR, NS, P (global i in- 
consistent, situation con- 
sistent, Direction 1) 

NE, R, 5 C (Global incon- 
sistent, situation con- 
sistent, Direction 2) 

a 66E, 66 NR, .66 NS, .66 
P (global i inconsistent, 
ination i inconsistent, 

: interaction is due to the powerful effect of the 
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situation consistency variable which accounts 
for a sizable portion (62%) of the total 
variance. Perhaps the percentage of situation 
consistency must occur at a certain level, 
higher than 66%, before global consistency 
can exert an effect on accuracy of prediction. ~ 
The percentage of consistency value which 
would permit global consistency to operate 
would have to be determined by varying 
situation consistency at several levels from 
66% to 100%. Such an experiment might 
show global consistency to be a more impor- 
tant factor in determining accuracy than it 
appears in the present study. 

The powerful nature of the situation con- 
sistency variable in this study may account 
in part for the small subject (Judge) effect 
reported in Table 1. Although the overall 
subject effect barely reaches the .05 level 
(error a/error b, which would bias the F 
negatively to the extent that Ss X T inter- 
action was present), separate analyses of 
variance for the four conditions (pooled for 
direction) reveal that the subject (Judge) 
effect for all situation inconsistent conditions 
fails to reach significance. Thus, the subject 
effect in this study is small compared to 
the usual individual difference effect found 
in most studies in social psychology. 


O—O SITUATION CONSISTENT 


8 
@—® SITUATION INCONSISTENT 
TIF ©--0 NO INFO. LEVEL 
3 @--@ NO INFO. LEVEL 
36 
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o 
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4 
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= 
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1 2 3 4 5 6 
BLOCKS OF TRIALS 


Fic. 1. Mean accuracy of prediction as a function 
of blocks of trials for the situation consistent condi- 
tions (open circles) and situation inconsistent condi- 
tions (closed circles). (Each data point is based 
upon 48 Judges. The no-information level for the 
situation consistent conditions is indicated by the 
broken lines between the open circles and for the 
inconsistent by the closed circles.) 
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Trials Effect 


The overall Trials effect shown in Table 1 
is significant beyond the .01 level. The sig- 
nificant interaction between trials and situation 

=consistency reveals that the trials effect was 
greater for the situation consistent conditions 
than for the situation inconsistent conditions. 
This interaction is shown in Figure 1, which 
shows the mean accuracy scores for all situa- 
tion consistent conditions and for all situation 
inconsistent conditions as a function of blecks 
of trials. 

An analysis using five orders of orthogonal 
polynomials shows the linear component to 
be significant for all four conditions (direc- 
tion pooled). The quadratic component is 
also significant for all the situation consistent 
conditions, but not for the situation incon- 
sistent conditions. All the accuracy curves 
tend to level off soon after the middle of the 
predictive sequence. The finding that the 
linear component was significant for all condi- 
tions (direction pooled) tends to support 
the hypothesis that accuracy of prediction 
would increase significantly for all Others 
during the predictive sequence. 


Accuracy Compared to the No-Information 
Base Level 


The no-information level is defined as that 
level of accuracy which would occur if the 
Judge had received no information about the 
Other. Since we have the Judge’s own posi- 
tion on each situation and the position he 
would attribute to the average student, we 
can estimate the no-information level. We 
first computed the accurate assumed similar- 
ity (Cronbach, 1955), and in addition, the 
accurate assumed similarity for the average. 
These two assumed similarity scores, when 
independent, were added together and the 
sum was subtracted from the total accuracy 
score for each Judge. The resulting residual 
accuracy could, conceivably, result from both 
accurate assumed dissimilarity and chance 
accuracy. This possible chance component of 
the accuracy score (one-fourth of the residual 
accuracy, since there were four alternatives 
for each situation) was also added to the two 
assumed similarity scores. The no-information 
level is, quite likely, overly estimated since a 
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Judge may have been making use of in- 
formation about the Other in predicting her 
choices even though the prediction was iden- 
tical to either the Judge’s own position or 
that she attributed to the average. But, if 
accuracy is significantly greater than the 
no-information level, we have evidence for | 
assuming the Judges were making use of the 
information about the Other. 

The mean accuracy scores (summed across: 
the six trials) and the mean no-information 
level score for each Judge was computed and 
correlated ¢ tests (two-tailed) were performed 
for all eight conditions. Mean accuracy scores, 
mean no-information level scores, and ¢ tests 
are reported in Table 2. Accuracy is sig- 
nificantly greater than the no-information 
level for all conditions except the 66% con- 
formist, and the .66 E, .66 NR, .66 NS, .66P 
(see Table 2). 


Assumed Similarity Scores 


Means for Judge-Other total assumed simi- 
larity, accurate assumed similarity, and in- 
accurate assumed similarity were computed. | 
An analysis of variance reveals that accurate 
Judge-Other assumed similarity is signif- 
cantly greater for the situation consistent 
conditions (with a mean of 6.4 for the con- 
sistent and 4.0 for the inconsistent). Inaccu- 
rate assumed similarity is significantly greater 
for the situation inconsistent conditions (with 
a mean of 6.2 for the consistent and 10.1 for 
the inconsistent), Total assumed similarity 
(total number of predictions which ate i 
identical to the Judge’s own position) is 12.7 | 
for the situation consistent Others and 14.1 
for the situation inconsistent Others. The 
situation inconsistent Others were, we suggest. 
ambiguous persons, while the situation con- 
sistent Others were less ambiguous. The 
greater the ambiguity, the greater is the tend- | 
ency of the Judge to rely upon his ow? 
position in predicting the Other, and the 
greater is his inability to differentiate the | 
Other from himself. When the Other is am- 
biguous because of situation inconsistency; 
the Judge tends to wrongly assume that the 
Other is like himself. i 

Correlations between accuracy, Judge 4. 
Other total assumed similarity, accurate 4 
sumed similarity, and inaccurate assume?) 
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similarity were computed. The correlations 


- between accurate assumed similarity (Judge- 


Other) and inaccurate assumed similarity 
reveal that this relationship is nearly zero 
in five cases and significantly positive in 
only one case. All are positive, which 
indicates that high accurate assumed simi- 
larity tends to be slightly associated with 
high inaccurate assumed similarity. This may 
suggest that the tendency to assume similar- 
ity, both accurately and inaccurately, is a 
general individual difference “trait” of Judges. 
Crow and Hammond (1957) found that as- 
sumed similarity scores had more consistency 
over time than their corrected accuracy 
scores. With the exception of the 66% con- 
formist and the 66% nonconformist condi- 
tions, the correlations between accuracy and 
Judge-Other total assumed similarity were 
negative, ranging from —.06 to —.61. 


Semantic Differential Evaluation 


Contrary to the original hypothesis, the 
global consistent Others were not evaluated 
more positively than the global inconsistent 
Others (means of 6.4 and 6.1), nor were 
the situation consistent Others evaluated 
more positively than the situation inconsistent 
Others (means of 6.6 and 6.0). However, 
the hypothesis that the 100% nonconformist 
would be evaluated most negatively was con- 
firmed. The mean evaluation score for her 
Was 8.7 which differs significantly from all 
other means (Duncan range test). No other 
means differ significantly. 

The correlations between evaluation and 
Judge-Other similarity (number of situations 
for which the Judge’s choice is identical 
to that of the Other) suggest that Judges 
tended to like those who were closest to their, 
own positions and disliked those who dif- 
fered considerably from their positions. These 
Correlations are all negative indicating that 
high Judge-Order similarity tends to be asso- 
Clated with positive evaluation (a lower evalu- 
ative score), and only those for the global 
Consistent, situation consistent conditions 
Teached statistical significance. Perhaps the 
Judges were clearly aware of the positions of 
the 100% conformist and 100% nonconform- 


ist while the remaining Others were somewhat 
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ambiguous. They were aware of the degree of 
similarity between themselves and the global 
consistent, situation consistent Others, and 
were less aware of the similarity for ‘the 
remaining Others, 


= 


Support for the Hypotheses 


The results of this study tend to support 
the hypothesis that most Judges in inter- 
personal perception studies categorize Others 
along a small number of global dimensions. 
This economy in the use of dimensions of dis- 
crimination would tend to result in high pre- 
dictive accuracy when Others are in fact con- 
sistent with a single, global dimension, But 
when they are not, their behavior is less 
accurately predicted by most Judges. In ad- 
dition, the findings suggest that the Judges 
were making use, at least to a small degree, 
of the information received about the Other, 
The response sets to assume similarity do not 
account for all of the accuracy even for some 
of the inconsistent others. 
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THE RELATIONSHIP OF EXPECTATION OF SUCCESS 


TO NEED ACHIEVEMENT AND TEST ANXIETY * 


N. T. FEATHER 
University of New England, Australia 


The experiment investigated the prediction that Ss’ expectations of success 
prior to task performance would be positively related to n Ach in a situation 
providing the opportunity for personal accomplishment, It was also predicted 
that these initial expectations would be negatively related to test anxiety but 
that, following task performance, relationships would be less apparent. In a 
test of these predictions, 168 male Ss rated their chances of success before 
commencing an anagrams task, The task was presented as “moderately dif- 
ficult” to 82 Ss and as “easy” to 86 Ss. In a postperformance questionnaire, 
Ss rated their chances of success for the middle and terminal stages of per- 
formance. Results supported predictions and were interpreted in terms of the 
influence of past experience in related situations, and as also possibly reflecting 
the effect of subjective attractiveness of success and subjective repulsiveness of 


failure on initial expectations. 


In a recent investigation (Feather, 1963c), 
subjects were required to estimate their 
chances of successfully completing an ana- 
grams task prior to actual performance. They 
were provided with fictitious information 
about the average probability of success of a 
peer group who had performed the task in the 
past. The results indicated that subjects’ initial 
estimates of probability of success at the task 
were positively related to facilitating anxiety 
and negatively related to debilitating anxiety 
as assessed by the Achievement Anxiety Test 
(Alpert & Haber, 1960). The correlations 
were very low but in the predicted direction. 
Initial estimates of probability of success were 
not, however, positively related to need for 
Achievement (n Ach) assessed by the TAT 
procedure. It is possible that this predicted 
positive relationship would appear in a test 
situation where instructions imply that suc- 
cess at the task is by no means certain and 
where there are fewer reality constraints than 
in the situation investigated in the previous 
study. The experiment reported in the present 
paper was designed to explore this possibility. 

The present experiment is a modification 
of a study by Sarason (1961) in which he 
examined the effect of the interaction of in- 


1Parts of this paper were written while the 
author was visiting scientist and research associate 
at the University of Michigan under the sponsorship 
of the Institute of Science and Technology and the 
National Science Foundation (Project GS-9). 


structions and anxiety on the solution of a 
difficult task, viz., solving complex anagrams. 
Sarason used two different sets of instruc- 
tions. Subjects given the first set were told 
that solving anagrams is directly related to 
intelligence level. But they were also informed 
that high school students of above average 
intelligence and most college students should 
be able to complete the task successfully in 
the time allowed. Subjects given the second 
set of instructions were told that the anagrams 
were harder than most and that they may not 
be able to finish all of them. We assume that 
these instructions influence subjects’ initial 
expectations of success so that those given the 
first set of instructions begin the task with 
a high expectation of success, believing it to be 
easy and well within their competence, while 
those given the second set of instructions 
begin the task with a lower expectation of 
success, believing it to be moderately difficult. 
If we assume that expectations of success iN 
*this second condition are relatively closer t0 
a 50/50 subjective probability of succes 
(Ps) then it follows from Atkinson’s (1957) 
model that both achievement motivation and 
motivation to avoid failure would be elicited 
at greater strength when the second set of 
instructions is employed. h 
It is predicted that subjects’ initial esti- 
mates of probability of success at the task 
will be positively related to n Ach where the 
task is presented as moderately difficult, i-e 
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where achievement motivation is assumed to 
be elicited at a relatively high level. The posi- 
“tive relationship may be less evident, however, 
where the task is presented as easy, i.e., where 
strength of achievement motivation is as- 
sumed to be relatively weak. The basis of the 


_. above prediction is that presented in the pre- 


vious paper (Feather, 1963c). Specifically, it is 
assumed that subjects high in n Ach have in 
, the past performed relatively better than sub- 
jects low in n Ach, in situations where 
achievement motivation has been aroused 

(Atkinson, 1958; McClelland, 1961), par- 
* ticularly Where it has been aroused at a high 
level. This superiority in past performance 
would act as a frame of reference to determine 
„relatively higher initial expectations of success 
among subjects high in n Ach when they come 
to a task which can be seen as providing the 
Opportunity for exercising personal skill and 
the Possibility of a praiseworthy accomplish- 
ment. 

In line with the results of the previous 
study (Feather, 1963c) we also predict that 
subjects’ initial estimates of probability of 
Success at the task will be negatively related 
to test anxiety assessed by the Test Anxiety 
Questionnaire (Mandler & Sarason, 1952), 
since Test Anxiety scores and debilitating 
anxiety scores are positively correlated (Al- 
pert & Haber, 1960). This negative relation- 
ship may be rather more apparent where the 
task is presented as moderately difficult, i.e., 
Where motivation to avoid failure is assumed 
to be elicited at a relatively high level. This 
prediction is based on the assumption that, 
When the task is complex, subjects high in 
test anxiety have generally performed less 
well in stressful situations than subjects low 
“1 test anxiety (Mandler & Sarason, 1952; 
Sarason, 1961, 1963), particularly when a 
high level of motivation to avoid failure has 
een aroused. This inferior past performance 
would tend to determine relatively lower 
initial expectations of success among subjects 
igh in test anxiety when they are presented 
with a complex task under test conditions. 

. Finally, we assume that once subjects are 
Mvolved in performance of a task, the fre- 
dency and patterning of their actual suc- 
_;*Sses and failures will become the dominant 
i Influence’ in shaping their expectations. 
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Hence the predicted relationships between 
estimates of probability of success and either 
n Ach or test anxiety should become less 
apparent once expectations of success have 
been modified by task performance (cf. 
Feather, 1963a). ; 


PROCEDURE 


Subjects were 168 male, external students attending 
a vacation school in psychology at the University 
of New England in 1962.2 These subjects first com- 
the Test Anxiety Questionnaire (TAQ; 
Mandler & Sarason, 1952). A subject’s Test Anxiety 
score was obtained by summing his scores on the 
individual items of the TAQ. As in previous in- 
vestigations (for example, Feather, 1963b) these 
item scores were obtained by dividing the rating scale 
for each item into five equal parts and by scoring 
responses from one to five in the direction of in- 
creasing anxiety, 

One week after administration of the TAQ sub- 
jects were tested for n Ach under neutral con- 
ditions according to the standard procedure (Mc- 
Clelland, Atkinson, Clark, & Lowell, 1953). The 
following four pictures were presented: 2, 48, 1, 
and 7 (using numbers assigned by Atkinson, 1958), 
Interscorer reliability 3 for scoring the TAT n Ach 
was r= .89, 

Following the measurement of n Ach, subjects 
were presented with an anagrams test. Two different 
sets of instructions, corresponding to those used by 
Sarason (1961), were randomly distributed among 
subjects to vary initial expectation of success, In 
the moderately difficult experimental condition the 
instructions were as follows: 


On the next page you will see a set of dis- 
arranged words (anagrams). Your job will be to 
rearrange each group of letters so that they make 
a meaningful word. Start when you are so in- 
structed. Stop at the stop signal. 

Most of you probably have worked anagrams, 
The task on the next page works the same way. 
These anagrams, however, are harder than most 
you have seen in books and magazines. Con- 
sequently, you may not finish all of them and 
you may find some of the anagrams very difficult. 
If this happens, don’t worry about it. No one will 
find the anagrams easy. 


In the easy experimental condition the instructions 
were the same as above except for the second para- 
graph which was as follows: 


Ability to organize material such as the letters 
on the next page has been found to be directly 
related to intelligence level. High school students 


*The total group was split into two separate 
groups of approximately equal size for purposes of 
testing. 

ë The author wishes to thank Graeme Halford for 
his assistance in the reliability check. 
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TABLE 1 


CORRELATIONS OF PROBABILITY ESTIMATES WITH n ACH 
AND TEST ANXIETY FOR MODERATELY DIFFICULT 
0N = 82) anp Easy (N = 86) EXPERIMENTAL 


CONDITIONS 
Moderately difficult Easy 
condition condition 
Probability 
estimate a pa) 

nAch | anxiety mAch anxiety 
Initial .20* | —.27** —.20 | —.39** 
Middle 12 —.04 —.14 | —.19 
Terminal | —.05 —.06 —.11 | —.03 


* p <.05, one-tailed test, 
** b <.01, one-tailed test. 
#** p < .0005, one-tailed test. 


of above average intelligence (IQ greater than 100) 
and most college students should be able success- 
fully to complete the task. You will have 18 min- 
utes in which to complete it. 


Before commencing the task all subjects were 
required to estimate their chances of solving all the 
anagrams by checking a 5-inch scale numbered from 
0 to 100 in equal steps of 20, with the statement 
“No chance at all” at one extreme of the scale, the 
statement “An even chance” at the middle of the 
scale, and the statement “Completely certain” at 
the other extreme of the scale. This procedure pro- 
vided a measure of a subject’s expectation of success 
prior to task performance. In a postperformance 
questionnaire subjects were also required to estimate 
what they considered their chances of success to 
be when they had worked at the test for 9-10 
minutes (about half the total time allowed), and 
just before they were asked to stop working at the 
test. This procedure provided measures of a subject’s 
“middle” and “terminal” expectations of success, i.e., 
expectations modified by task performance. 

All subjects worked for 18 minutes at the same 
set of 13 difficult anagrams which are listed in 
Sarason’s (1961) report. Eighty-two subjects worked 
under the instructions for the moderately difficult 
condition; 86 subjects worked under the instructions 
for the easy condition. 


TABLE 2 


Mean INITIAL PROBABILITY ESTIMATES 
IN RELATION TO n AcH 


MEE difficult Easy 


condition condition 
n Ach 


M SD N M SD | N 


High 45.) 23° | 19 1.56 | .30 | 13 
Upper middle | .45 | .19 | 22 | .56 | .11 | 21 
Lower middle} .40 | .23 | 27 | .55 | .18 | 19 
Low 30} 19 | 14 | 63 | .23 | 33 
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` sified as high, 96 to 116 as upper mid 


The mean n Ach and Test Anxiety scores for s 
jects in the moderately difficult and easy conditi 
were as follows: for the moderately difficult condi 
tion, mean n Ach=449; SD=4.04, mean 
Anxiety = 95.82, SD = 25.59; for the easy condit 
mean n Ach =2.92, SD =4.37, mean Test Anxiet 
= 96.21, SD = 28.784 q 


RESULTS 


Table 1 presents the correlations of ini 
middle, and terminal probability estimai 
with n Ach and Test Anxiety scores for bi 
experimental conditions. Initial probability 
estimates were those obtained prior to task 
performance. Middle and terminal "probal 
ity estimates were those obtained in the p 
performance questionnaire, and related to 
middle and terminal stages of perfor 
Table 2 presents mean initial probabi 
estimates for approximate quarters of th 
total distribution of n Ach scores. N A 


TABLE 3 


Mean INITIAL PROBABILITY ESTIMATES 
IN RELATION TO TEST ANXIETY 


Pcdeatey difficult 
condition 


Easy j 
condition 
Test anxiety 


High .29 | 17 18 AS | .23 
Upper middle} .40 | .17 | 22 | .60 | .16 
Lower middle} .47 | .25 22 60 | .24 
Low 45 | .24 20 69 | .16 


from 4 to 7 as upper middle, from 
to 3 as lower middle, and below 1 as l 


total distribution of Test Anxiety scores. 
Anxiety scores of 117 and above were 


4The mean n Ach score of subjects in 
moderately difficult condition is significantly gre 
than the mean n Ach score of subjects in the 
condition. Since subjects were randomly assi 
to each condition this difference may be consid 
accidental rather than as reflecting a syst 
influence. When the n Ach distributions are mal 
by deleting subjects the analysis of results indi 
the same trends as described below, but the rela 


tenuated because of the restriction in the range 
n Ach scores, 9 


n 
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* 75 to 95 as lower middle, and below 75 
as low. 
` In general, the results shown in Tables 1, 
2, and 3 support the predictions. In the first 
place, Table 1 indicates that initial probabil- 
ity estimates and n Ach scores are positively 
correlated in the moderately difficult condi- 
tion (r = .20, df = 80, p < .05). This posi- 
tive relationship is also apparent in Table 2 
, where the mean initial probability estimate 
for the moderately difficult condition is 
greater when n Ach is high than when it is 
low. These results imply that, in a situation 
which is’ assumed to elicit a relatively high 
level of achievement motivation, subjects 
high in n Ach tend to have relatively higher 
expectations of success prior to task per- 
formance. Initial probability estimates and 
n Ach scores are not positively related in 
the easy condition. In fact, the relationship 
is negative (r = —.20) and would be sta- 
tistically significant if a one-tailed test were 
applied. This negative relationship is also 
evident in Table 2 where the mean initial 
probability estimate for the easy condition is 
. less when n Ach is high than when it is low. 
The correlation of initial probability esti- 
mates with n Ach scores differs significantly 
between the two experimental conditions 
(2 = 2.58, p<.01). These results imply 
that, in a situation which is assumed to elicit 
a relatively low level of achievement motiva- 
tion, subjects low in n Ach may tend to have 
relatively higher expectations of success prior 
to task performance. 

An important assumption involved in the 
foregoing prediction is that subjects should 
tend to perceive the task in the moderately 
difficult condition as fairly difficult, as one 
which permits the opportunity for personal 
accomplishment which can be evaluated 
against standards of excellence. In contrast, 
We assume that subjects should tend to 
Perceive the task in the easy condition as 
telatively easy, as one for which successful 
Performance should be possible for most 
People. In terms of Atkinson’s (1957) model, 
these assumptions imply that a higher level 
of achievement motivation would be elicited 
in the moderately difficult condition. The 
results do indicate that the mean initial 
Probability estimate is significantly lower in 
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the moderately difficult condition (M = .41 
for moderately difficult condition, M = .59 
for easy condition, F = 27.77, df = 1/166, 
$ <.001). Hence the different instructions 
appear to have determined the expected dif- 
ference in subjects’ initial expectations of suc 
cess, although the mean initial probability 
estimate in the easy condition is not as high as 
we had anticipated. According to Atkinson’s 
(1957) model, P, values of .41 and .59 would 
be associated with egual levels of achieve- 
ment motivation for an individual in a test 
situation. The results do indicate, however, 
that there is a significant positive correlation 
in the moderately difficult condition (r = .25, 
df = 80, p < .025, one-tailed test) between 
performance scores (number of anagrams cor- 
rectly solved) and n Ach scores, but no 
significant relationship between these scores 
in the easy condition (r = —.11, df= 84, 
ns). This result suggests that in an achieve- 
ment situation where a deliberate attempt 
is made to reduce anxiety about failure by 
reassurance (as in the moderately difficult 
condition) a positive relationship between n 
Ach scores and performance may be more 
apparent. 

Table 1 also shows that initial probability 
estimates and Test Anxiety scores are nega- 
tively correlated in the moderately difficult 
condition (r = —.27, df = 80, p < .01) and 
in the easy condition (r = —.39, df = 84, 
p < .0005). This negative relationship is also 
evident in Table 3 where the mean initial 
probability estimates for both moderately 
difficult and easy conditions are less when 
test anxiety is high than when it is low. 
These results are consistent with predictions 
and with the evidence of the previous study 
(Feather, 1963c). They imply that subjects 
high in test anxiety tend to have relatively 
lower expectations of success prior to per- 
formance of a complex task in a test situa- 
tion. Contrary to prediction there is no evi- 
dence that the negative correlation between 
initial probability estimates and test anxiety 
is higher in the moderately difficult condition. 
In fact the reverse occurs although the dif- 
ference in correlations is not statistically sig- 
nificant. The lower negative correlation in 
the moderately difficult condition may reflect 
the influence of the greater reassurance pro- 
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vided by the instructions for this condition. 
Subjects were told not to worry if they en- 
countered some very difficult anagrams. This 
instruction would tend to reduce anxiety 

> about failure. In terms of Atkinson’s (1957) 

=model such a reassuring instruction may have 
the effect of reducing the magnitude of the 
negative incentive value of failure (J;) across 
the entire range of subjective probabilities 
(P;), and hence would determine lower levels 
of motivation to avoid failure. This would 
be an example of a situational effect on 
value (cf. Feather, 1959). 

In Table 4 mean initial probability esti- 
mates are presented for the simultaneous 
classification of n Ach and test anxiety. 
Subjects are classified as high or low in n Ach 
or test anxiety according to whether they 
are above or below the median of the respec- 
tive total distribution of scores. N Ach scores 
of 4 and above are classified as high, below 4 
as low. Test Anxiety scores of 96 and above 
are classified as high, below 96 as low. 

Table 4 represents the joint influence of 
n Ach and test anxiety. The mean initial 
probability estimates differ significantly for 
both the moderately difficult condition 
(F = 3.37, df = 3/78, p < .05) and the easy 
condition (F = 3.28, df=3/82, p< .05). 
For both experimental conditions subjects 
classified as high in n Ach and low in test 
anxiety tend to have higher initial probabil- 
ity estimates than subjects classified as low 
in n Ach and high in test anxiety, although 
this difference is negligible in the easy condi- 
tion where it is overshadowed by the rela- 
tively high mean initial probability estimate 
for subjects classified as low in both n Ach 
and test anxiety. This relatively high mean 


TABLE 4 


Mean INITIAL PROBABILITY ESTIMATES IN RELATION 
TO n ACH AND TEST ANXIETY 


Moderately difficult Easy 
Test condition condition 

nAch | anxiety 

M SD N M SD N 
High | High | .43 | .14 | 20 | 56 | .19 | 19 
High | Low | .47 | .25 | 21 | .56 | .22 | 15 
Low High | .28 | .18 | 20 | 51 | .21 | 23 
Low Low | .45 | .23 | 21 | .68 | .19 | 29 
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TABLE 5 


CORRELATIONS OF PROBABILITY ESTIMATES 
WITH TASK PERFORMANCE 


Task performance 


Probability 
Saumar Moderately Easy 
difficult condition condition ” 
Tnitial eh bt 14 
Middle .60** one 
Terminal .60** .61** 


* p <.005, one-tailed test. 
** > <.0005, one-tailed test. 


reflects the negative correlation of initial 
probability estimates with both n Ach and 
test anxiety in the easy condition. In a 
sense, subjects low in both n Ach and test 
anxiety are behaving realistically since they 
rate their chances quite highly when told 
that the task should in fact be easy. 
Finally, the results in Table 1 show that 
neither middle nor terminal probability esti- 
mates are significantly correlated with n Ach 
or Test Anxiety scores. This lack of signifi- 
cant relationships is consistent with the as- 
sumption that, once a person is involved in 
performance of a task, his present perform- 
ance becomes the dominant influence in 
shaping his expectations of success. Whereas 
prior to performance his expectations may 
tend to be shaped by his past history of 
success and failure in similar situations, once 
he begins the task the frequency and pat- 
terning of present success and failure will 
assume the major role in determining expecta- 
tions. The effect of present performance on 
expectations is in fact evident in the corre- 
lation of performance scores with middle and 
terminal probability estimates obtained in the 
postperformance questionnaire. We would 
expect these estimates to be influenced by 
success and failure at the task, those sub- 
jects with relatively high performance scores 
being those with relatively high middle and 
terminal probability estimates. Table 5 pre- 
sents the correlations of initial, middle, and 
terminal probability estimates with perform- 
ance scores for both experimental conditions.” 


5 The positive correlation of performance scores 
with initial probability estimates in the moderately 
difficult condition parallels the positive relationship 
of persistence scores to initial probability estimates 
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The highly significant positive correlations of 
performance scores with middle and terminal 


' probability estimates are taken as indicating 


the modifying effect of success and failure 
at the task on expectations of success. 


Discussion 


These results generally support predictions 
and are consistent with the underlying as- 
sumptions. They suggest that, in order to 
predict the relationship between expectation 
of success prior to task performance and 
achievement-related motives, one requires 
two main types of information. First, it is 
necessary to determine whether the situation 
is one which would arouse achievement 
motivation or anxiety about failure. Second, 
one would need to know what the effect of 
this aroused motivation is likely to have been 
when subjects performed similar tasks in the 
past under conditions similar to those now 
present. In particular, has this aroused moti- 
vation determined relatively more success 
than failure in the past, or vice versa? We 
believe that subjects tend to categorize per- 
formance situations in terms of their past 
experience. For example, a subject high in 
n Ach might tend to categorize a situation, 
where he has the opportunity to excel, as one 
in which he usually does rather well in 
comparison to others. In contrast, a subject 
high in test anxiety might tend to cate- 
gorize the same test situation as one in which 
he usually performs badly, particularly if the 
task is complex. 

In general, there are probably several cues 
in a test situation to which a subject responds 
when estimating his chances of success prior 
to task performance. The important cues are 
‘those which have been consistently associated 
with his success and failure in the past. 
Some cues may have similar implications 
about present performance; other cues may 
have conflicting implications. For example, a 
Subject high in test anxiety may feel pes- 
simistic about his chances of success at a 
Complex task since he has done poorly when 


obtained in a previous investigation (Feather, 1963b) 
where an insoluble task was presented to subjects 
as very difficult (see Feather, in press, for full dis- 
Cussion of performance data from the present 
study). 
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working at complex tasks in test situations 
in the past. But the task may be verbal in 
character and he may have found that in 
the past he has often excelled at verbal activi- 
ties. Thus the verbal nature of the task may 
determine some optimism about chances of 
success. In this example the different charac- 
teristics of a complex, verbal task have dif- 
ferent implications for present performance. 
For other cases task characteristics may have 
similar rather than conflicting implications 
for present performance. Somehow the subject 
has to resolve these implications when he 
estimates his chances of success at the task. 
Future research might investigate which 
features of a test situation have greatest 
weight in determining a subjects initial 
expectation of success. It is likely that they 
would be those characteristics of the situa- 
tion which have been closely and reliably 
related to the quality of task performance 
in the past. 

Since ability measures are customarily 
taken as important predictors of performance, 
the above line of argument suggests that 
subjects with high verbal ability may tend 
to report relatively high initial probability 
estimates in both the moderately difficult and 
easy experimental conditions. The anagrams 
test is verbal in nature, Subjects with high 
verbal ability may therefore react to it in 
terms of their generally superior performance 
on previous verbal tasks, and estimate their 
chances of success as fairly high, Table 6 


TABLE 6 


CORRELATIONS OF PROBABILITY ESTIMATES AND TASK 
PERFORMANCE WITH VERBAL ABILITY FOR 
MoperatEty Drirricutt (N = 73) AND 
Easy (N = 73) EXPERIMENTAL 


CONDITIONS 
Verbal ability (Test AL) 
Variable 
Moderately Easy 
difficult condition condition 
Initial probability 
estimate .01 tiai 
Middle proba- 
bility estimate -09 .21* 
Terminal prob- 
ability estimate .09 10 
Task performance 24% .23** 


* p <.05, one-tailed test. 
** p <.01, one-tailed test. 


< 


124 


presents data relevant to this prediction. 
Scores on the Test AL were available for 
the majority of subjects. This test was de- 
veloped by the Australian Council for Edu- 
, cational Research as an advanced level test 
=of verbal ability. Table 6 shows that initial 
probability estimates are positively related to 
scores on Test AL in the easy condition but 
not in the moderately difficult condition. 
Performance scores on the anagrams test 
and scores on Test AL are positively corre- 
lated for both experimental conditions.® Thys 
the results for the easy condition support 
our prediction while those for the moderately 
difficult condition do not. It is possible, how- 
ever, that the instructions for the moderately 
difficult condition, in implying that the ana- 
grams were harder than most usually en- 
countered, may have suggested to subjects 
that their past experience specifically at ana- 
grams (and more generally at verbal tasks) 
would not be a reliable guide to present 
performance. They could, however, still per- 
ceive the task as a challenge and respond to 
it in terms of how they met similar challenges 
in the past. In this respect motives to achieve 
success and to avoid failure would be impor- 
tant influences on performance and we would 
therefore expect present expectations of suc- 
cess to relate to these motives as predicted. 
One rather unexpected finding in the pres- 
ent study is the negative correlation between 
initial probability estimates and n Ach scores 
in the easy condition. In terms of the pre- 
ceding argument, this result suggests that 
subjects low in n Ach may tend to perform 
better than subjects high in n Ach in situa- 
tions where a task is presented as easy and 
is in fact easy. As indicated previously, we 
would not expect a situation of this type to 
elicit much achievement motivation, so su- 
perior performance may reflect the operation 
of other types of motivation, for example, 
desire for social approval. It is possible also 
that subjects high in n Ach may be relatively 
uninterested in this type of situation, and 


6 Verbal ability scores are negatively correlated 
with Test Anxiety scores for both experimental con- 
ditions (for moderately difficult condition, r =—.30, 
p<.01; for easy condition, %——-26,- p'<.01); 
These negative relationships are consistent with 
previous results reported by Sarason (1963). 
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may not feel it worthwhile to try very hard 
to do well. In studies reported by Atkinson 
(1958, pp. 288-305) and by McClelland « 
(1961, pp. 217-221), subjects low in n Ach 
tended to perform better at a task than 
subjects high in n Ach, when the chances 
of winning were high (3/4). Differences were 
not, however, statistically significant. Atkin- 
son (1955, p. 502) also found in an earlier 
study that subjects low in n Ach tended to 
have relatively higher performance scores © 
under relaxed conditions than subjects high 
in n Ach but differences were not statistically 
significant. Further research is needed in this _ 
area. 

An alternative interpretation of the positive 
correlation of initial probability estimates and 
n Ach scores in the moderately difficult 
condition and the negative correlation of these 
scores in the easy condition is in terms of 
Brody’s (1963) assumption that subjects will 
tend to state a subjective probability of 
success that will maximize their resultant 
motivation. According to Atkinson’s (1957) 
theory of achievement motivation this would 
be at a subjective probability of .50 for 
subjects in whom the motive to achieve 
success (M,) is stronger than the motive to 
avoid failure (Ms). The results do show that 
the mean probability estimates of subjects 
high in n Ach and low in test anxiety are 
close to .50 in both the moderately difficult 
and the easy conditions (Table 4). If we 
assume that these subjects tend to be those 
in whom M, > Me, then these results would 
be consistent with the above interpretation. 
However, this line of argument would also 
imply that subjects in whom My > Ms 
should tend to state either very low or very 
high probabilities.’ If we assume that subjects 
low in n Ach and high in test anxiety 
tend to be those in whom Ma; > M,, then 
these subjects should tend to avoid stating 


TWe assume that these subjects would not 
maximize their resultant negative motivation but, 
in fact, would tend to minimize this motivation. 
Brody’s (1963) statement is rather ambiguous in 
this respect. We interpret his principle as a maxi- 
mizing of resultant positive motivation by subjects 
in whom M, >Mar, and a minimizing of resultant 
negative motivation by subjects in whom Mat? 
M.. This interpretation is consistent with his pre- 
dictions about changes in confidence ratings (p. 414)- 


7 
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` example, 


EXPECTATION OF SUCCESS 


Table 4 for the moderately difficult condition 
are consistent with this prediction, those for 


. the easy condition are not. In any case, the 


assumption that subjects in whom Ms, > May 
should state probabilities so as to maximize 
resultant positive motivation, and subjects in 


' whom Mas > M, should state probabilities so 


as to minimize resultant negative motivation, 
needs some further clarification. Atkinson 
(1957) applies his theory to risk-taking 
behavior where a subject is required to per- 
form an achievement task which varies in 
difficulty. In making a choice the subject 
commits himself to task performance at a 
particular level of difficulty (for example, to 
throwing the ring at a chosen distance from 
the peg in a ringtoss situation). This type of 
situation appears to be different from one 
in which a subject is required to state proba- 
bilities and where there may be no necessary 
commitment to action. Furthermore, in Atkin- 
son’s theory subjective probabilities are as- 
sumed to influence strength xgf motivation 
which then determines choice, whereas Brody 
appears to reverse the causal sequence in 
assuming that a preference for a maximum 
(or minimum) level of motivation determines 
the stated probability. Just why subjects 
would prefer a maximum (or minimum) level 
of motivation needs further clarification. 

We do not, however, wish to deny the pos- 
sibility of “motivational” types of effect. 
Brody’s (1963) experiment is of value in 
Suggesting this kind of interpretation. It may 
be argued, for example, that the results of 
the present study demonstrate the influence 
of subjective attractiveness of success and 
subjective repulsiveness of failure on a per- 
.Son’s probability statements.’ Within each 


* Subjective attractiveness of success (or positive 
valence or utility of success) may be conceptualized 
m Atkinson’s model as the multiplicative product of 
motive to achieve success (M,) and positive incentive 
value of success (J +). Similarly, subjective repulsive- 
ness of failure (or negative valence or utility of 
failure) may be conceptualized as the multiplicative 
Product of motive to avoid failure (Mar) and 
negative incentive value of failure (I;) (see Atkin- 
Pa 1964). In Atkinson’s model (Ie=1—P, and 
1= =P.. We believe, however, that incentive values 
may depend on the nature of the situation (for 
test versus nontest) as well as upon the 
culty of the task (cf. Feather, 1959). 


125 


“probabilities of .50. But while the results in experimental condition subjects high in the 


motive to achieve success (M,) may tend to 
perceive success as more attractive than do 
subjects low in the motive to achieve success. 
This relatively higher perceived attractive- 
ness may tend to dominate the judgments of - 
probability of success made by subjects who 
are high in M, such that these judgments are 
more “wishful” than reality oriented. Such 
effects should be especially likely in situa- 
tions where reality constraints are few and 
where the positive incentive value of success 
(J) tends to be high (for example, in the 
moderately difficult situation). Subjects high 
in M, may, as it were, react autistically by. 
stating high probability estimates, expressing 
the hope that they may succeed at a task 
for which success is seen as very attractive. 
This argument would imply a positive corre- 
lation between initial probability estimates 
and n Ach scores assuming these latter scores 
reflect strength of Ms. This positive correla- 
tion should be especially apparent in the 
moderately difficult condition where perceived 
attractiveness of success should tend to be 
relatively high because of the higher /,. 

Similarly, within each experimental condi- 
tion subjects high in the motive to avoid 
failure (Ma) may tend to perceive failure as 
more repulsive than do subjects low in the 
motive to avoid failure. This relatively higher 
perceived repulsiveness may determine lower 
judgments of probability of success among 
subjects who are high in May. These effects 
should be especially likely in situations where 
reality constraints are minimal and where the 
negative incentive value of failure (J;) tends 
to be high (for example, in the easy situa- 
tion). Subjects high in May may, as it were, 
react defensively by stating low probability 
estimates, since failure would not be so 
“repulsive if the task were accepted as fairly 
difficult.? This argument would imply a nega- 
tive correlation between initial probability 
estimates and Test Anxiety scores assuming 
these latter scores reflect strength of May. 


9 Probability statements are public in that the 
subject knows that the experimenter will see them. 
The subject’s response may therefore also be in- 
fluenced by his concern about how the experimenter 
will react to the probability estimate which the 
subject reports. 


c 


126 


This negative correlation should be especially 
evident in the easy condition where perceived 
repulsiveness of failure should tend to be 
relatively high because of the higher J;. 

The above type of interpretation therefore 
assumes an autistic biasing of judgments of 
probability of success among subjects high 
in M,, especially under conditions where 
I, is high (ie. where the task is pre- 
sented as difficult), and a defensive biasing 
of these judgments among subjects who are 
high in May, especially under conditions where 
J; is high (i.e, where the task is presented 
as easy). In general, the results of the present 
experiment are consistent with this interpreta- 
tion, although it also has difficulty in ac- 
counting for the obtained negative relation- 
ship between initial probability estimates and 
n Ach scores in the easy condition. 

This alternative interpretation could be 
tested further by presenting an achievement 
task to subjects at a number of different 
levels of difficulty and investigating relation- 
ships between motive assessments and initial 
probability estimates. The predicted positive 
relationship between initial probability esti- 
mates and n Ach should be stronger when 
the task is presented as very difficult, than 
as moderately difficult, than as easy. The 
predicted negative relationship between initial 
probability estimates and test anxiety should 
be stronger when the task is presented as 
easy, than as moderately difficult, than as 
very difficult.° These predictions are clearly 
different from those which would follow from 
the type of theoretical analysis developed in 
the present paper which sees the relation- 
ships as reflecting the influence of past experi- 
ence in related situations and which considers 
differences in past performance in terms of 
Atkinson’s theory of achievement motivation. 


Our present theoretical approach would pre~” 


dict that the strongest positive relationship 
between initial probability estimates and n 
Ach scores, and the strongest negative rela- 
tionship between initial probability. estimates 
and Test Anxiety scores, should both occur 
when the task is presented as moderately dif- 

10 Table 1 does in fact indicate a higher negative 


relationship between initial probability estimates and 
Test Anxiety scores in the easy condition. 


N. T. FEATHER 


ficult. Here then is an instance where the 
alternative theoretical approaches imply dif- 
ferent predictions. These predictions now 
await experimental test. 
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A study of constructive conformity. Experimental studies have generally 
pointed to the negative effects of group pressure. The present experiment 
shows that in specifiable circumstances group pressure enables a person to 
resolve conflicting forces in a direction congruent with his values. In Exp. I 
Ss are instructed to administer increasingly more severe shocks to a victim. 
In Exp. II, 2 confederates who defy E’s authority are introduced into 
this situation and effectively free Ss from obedience to E’s destructive com- 
mands. In Exp. III, the 2 confederates follow the E’s commands blindly, 
but this does not lead to increased compliance. Some factors contributing to 
» the powerful effect of the disobedient group are described. The author suggests 
that there is a direction to potential group effects inherent in the structure 


of a given social field. 


* In laboratory research, the effect of group 
pressure has most often been studied in its 
negative aspect; the conspiratorial group is 
shown to limit, constrain, and distort the 
individual’s responses (Asch, 1951; Blake & 
Brehm, 1954; Milgram, 1964). Edifying ef- 
fects of the group, although acknowledged, 
have rarely been demonstrated with the clar- 
ity and force of its destructive potential. 
Particularly in those areas in which a morally 
relevant choice is at issue, experimentalists 
typically examine pressures that diminish the 
Scope of individual action. They have ne- 
glected effects that enhance the individual’s 


Sense of worth, enlarge the possibilities for 


action, and help the subject*resolve conflict- 
ing feelings in a direction congruent with his 
ideals and values. Although in everyday life 
Occasions arise when conformity to group 
Pressures is constructive, in the laboratory 
thinking and investigation have concentrated 
almost obsessively on conformity in its most 
Sterile forms [Asch, 1959].”? 


1 This research was supported by two grants from 
the National Science Foundation, G-17916 and 
24152. The experiments were conducted while the 
author was at Yale University. Pilot studies com- 
Pleted in 1960 were financed by a grant from the 
Higgins Fund of Yale University. My thanks to Rhea 

endoza Diamond for her help in revising the 
original manuscript. 
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There are technical difficulties to demon- 
strating the value enhancing potential of 
group pressure. They concern the nature of 
the base line from which the group: effect is 
to be measured. The problem is that the 
experimental subject ordinarily acts in a 
manner that is socially appropriate. If he has 
come to the laboratory to participate in a 
study on the perception of lines, he will gen- 
erally report what he sees in an honest 
manner. If one wishes to show the effects of 
group influence by producing a change in his 
performance, the only direction open to 
change is that of creating some deficiency 
in his performance, which can then be 
attributed to group influences. 

If men tend to act constructively under 
usual circumstances the obvious direction of 
an induced and measurable change is toward 
inappropriate behavior. It is this technical 
need rather than the inherently destructive 
character of group forces that has dictated the 
lines of a good deal of laboratory research. 
The experimental problem for any study of 
constructive conformity is to create a situa- 
tion in which undesirable behavior occurs with 
regularity and then to see whether group 
pressure can be applied effectively in the 
direction of a valued behavior outcome.* 


8 Another solution would be to wait until people 
who perform in a naturally destructive way come 
to the laboratory and to use them as subjects. One 
might deliberately seek out a group of recidivist 
delinquents who would ordinarily behave in a dis- 
valued manner, and then study group effects on 
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EXPERIMENT I: BAsE-LINE CONDITION 


A technique for the study of destructive 
obedience (Milgram, 1963, in press) gen- 
efates the required base line. In this situation 
a subject is ordered to give increasingly more 
severe punishment to a person. Despite the 
apparent discomfort, cries, and vehement 
protests of the victim, the experimenter in- 
structs the subject to continue stepping up 
the shock level. 


Technique 


Two persons arrive at a campus laboratory to 
take part in a study of memory and learning. (One 
of them is a confederate of the experimenter.) Each 
subject is paid $4.50 upon arrival, and is told that 
payment is not affected in any way by performance. 
The experimenter provides an introductory talk on 
memory and learning processes and then informs 
the subjects that in the experiment one of them 
will serve. as teacher and the other as learner. A 
rigged drawing is held so that the naive subject 
is always assigned the role of teacher and the 
accomplice becomes the learner. The learner is taken 
to an adjacent room and is strapped into an electric 
chair, 

The naive subject is told that it is his task to 
teach the learner a list of paired associates, to 
test him on the list, and to administer punishment 
whenever the learner errs in the test. Punishment 
takes the form of electric shock, delivered to the 
learner by means of a shock generator controlled 
by the naive subject. The teacher is instructed to 
increase the intensity of the electric shock one step 
on the generator on each error, The generator con- 
tains 30 voltage levels ranging from 15 to 450 volts, 
and verbal designations ranging from Slight Shock 
to Danger: Severe Shock. The learner, according to 
plan, provides many wrong answers, so that before 
long the naive subject must give him the strongest 
shock on the generator, Increases in shock level are 
met by increasingly insistent demands from the 
learner that the experiment be stopped because of 
growing discomfort to him. However, the experi- 
menter instructs the teachers to continue with the 
procedure in disregard of the learner’s protests.+ 


A quantitative value is assigned to the subject’s” 


performance based on the maximum intensity shock 
he administered before breaking off. Thus any sub- 
ject’s score may range from O (for a subject un- 
willing to administer the first shock level) to 30 
(for a subject who proceeds to the highest voltage 
level on the board). 


their performance. This would, of course, limit the 
study to an atypical population. 

* Descriptions of the shock generator, schedule of 
protests from the learner, and other details of pro- 
cedure have been described elsewhere and will not 
be restated here (Milgram, 1963, 1964), 
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Subjects i 


The subjects used in the several experimental 
conditions were male adults residing in the greater 
New Haven area, aged 20-50 years, and engaged in a 
wide variety of occupations, Each experimental con- 
dition described here employed 40 fresh subjects and 
was carefully balanced for age and occupational types _ 
(see Milgram, 1963, Table 1, for details). f 


Results and Discussion 


In this situation a subject is instructed to 
perform acts that are in some sense incom- 
patible with his normal standards of behay- 
ior. In the face of the vehement protests of 
an innocent individual, many subjects refuse 
to carry out the experimenter’s orders to con- 
tinue with the shock procedure. They reject 
the role assignment of experimental subject, 
assert themselves as persons, and are unwilling 
to perform actions that violate personal stand- 
ards of conduct. The distribution of break-ofi 
points for this condition is shown in Table 1, 
Column 1. Fourteen of the 40 subjects with- 
draw from the experiment at some point 
before the completion of the command series. 
The majority of subjects, however, comply 
fully with the experimenter’s commands, de; 
spite the acute discomfort they often experi- 
ence in connection with shocking the victim. 
Typically these obedient subjects report that 
they do not wish to hurt the victim, but they 
feel obligated to follow the orders of the ex- 
perimenter. On questioning they often state 
that it would have been “better” not to 
have shocked the victim at the highest 
voltage levels. Consider, for example, the 
remarks of the following obedient subject: 
He has completed the experiment and is now 
questioned by an interviewer (who is not 
the experimenter). 


Vd like to ask you a few questions. How do you 
feel? I feel all right, but I don’t like what happene 
to that fellow in there [the victim]. He’s bee? 
hollering and we had to keep giving him shocks. 
I didn’t like that one bit. I mean he wanted to get 
out but he [the experimenter] just kept going; he 
kept throwing 450 volts. I didn’t like that. 

Who was actually pushing the switch? I was, but 
he kept insisting. I told him “No,” but he said you 
got to keep going. I told him it’s time we stopP® 
when we get up to 195 or 210 volts. 

Why didn’t you just stop? He wouldn't let me 
I wanted to stop. I kept insisting to stop, but de 
said “No.” . . . I figured the voltage we were siViDE | 
him was quite a bit. I wanted to stop but he [the 
experimenter] kept insisting not to stop. I me? 
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the fellow in there is hollering “I don’t want to do 
jt. I want to get out of here. I want to get out 
“of here!” 

Why didn’t you just disregard what the experi- 
menter said? He says it’s got to go on, the experi- 
ment, 

Do you feel a little upset? Well, I mean I feel 
concerned about the gentlemen in there, I do 
. I was getting ready to walk out...I 


" couldn’t see the point of going on when the guy is 


suffering in there. I figured he was having a heart 
attack or something. That’s the reason I wanted 


"to stop.... 


The subject was then dehoaxed carefully 
and had a friendly reconciliation with the 
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There is additional evidence that, in shock- 
ing the victim to the end of the command 
series, subjects are engaging in behavior which 
they disvalue and see as antithetical to 
personal and social ideals: 

Spontaneous shock levels. In an experi- ` 
mental control reported elsewhere (Milgram, 
1964), subjects administer any voltage level 
they wish in the absence of group or authori- 
tarian pressure. Under this condition the 
mean maximum shock for 40 subjects is 82.5 
volts (level = 5.50); in contrast, under com- 
mand of the experimenter (in the base-line 
condition described above) the mean maxi- 


‘victim. 
TABLE 1 
ae) DISTRIBUTION OF BREAK-OFF POINTS By CONDITION 
4 Base-line Hypothetical Disobedient Obedient 
Shock level Verbal designation and condition subjects groups groups 
loc el voltage level (n = 40) (n = 40) (nm = 40) (nm = 40) 
(1) (2) (3) (4) 
; Slight Shock 38 
1 15 0 0 0 0 
2 30 0 0 0 0 
3 45 0 1 0 0 
4 60 0 1 0 0 
iy Moderate Shock 
5 75 0 7 0 1 
6 90 1 1 0 0 
E7 105 0 1 1 1 
8 120 0 3 0 0 
4 Strong Shock 
f 9 135 0 2 0 0 
10 150 6 9 3 2 
mit 165 0 2 4 0 
12 180 < 1 3 1 1 
Very St Shock 
l 13 195 Bae 0 1 4 0 
14 210 0 0 12— 0 
15 225 0 1 0 0 
16 240 0 1 0 0 
Intense Shock 
17 255 0 1 0 0 
18 270 2 0 4 4 
19 285 0 0 0 0 
. 20 300 1 3 2 0 
Extreme Intensity Shock 
2 315 sf 1 0 3 2 
22 330 1 0 0 0 
+ 23 345 0 0 0 0 
24 360 0 0 1 0 
D y 
25 ue Severe Shock 1 o 0 0 
26 390 0 0 1 0 
27 405 0 0 0 0 
28 420 0 0 0 0 
XXX 
29 435 0 0 0 0 
30 450 26 0 4 29 
Mean maximum shock 24.55 9.00 16.45 25.70 
Percentage defiant subjects 35.0% 100.0% 90.0% 27.5% 


* These 3 subjects indicated they would refuse to adminster even the lowest shock. 
c 
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mum shock is 368.25 volts (level = 24.55). 
When left on their own, subjects admin- 
ister far lower shock levels than when under 
command. 

Self-image. When persons who have not 
performed in the experiment are provided 
with a description of the experimental situa- 
tion, and are asked to predict their own per- 
formance, almost all subjects see themselves 
as defying the experimenter at some point in 
the command series. Moreover, they justify 
their hypothetical behavior in terms of posi- 
tive qualities of character, employing such 
statements as “Im not the kind of person 
who is willing to hurt others even for the 
cause of science.” 

Column 2 of Table 1 shows the anticipated 
break-off points of 40 adult respondents who 
were asked to predict their own experimental 
performance. Hypothetical subjects respond 
in terms of ideals, values, and positive self- 
conceptions; these values are frequently 
overridden by the forces engendered in the 
concrete laboratory situation. 

Moral judgment. Twenty undergraduates 
were asked to render a moral judgment on 
the issue of obeying or disobeying the experi- 
menter. After describing the experiment to the 
respondents in conctete detail, the experi- 
menter asked: 

In terms of the moral and ethical values most 
important to you, and taking into account the scien- 
tific goals served by the research, how should you 
perform in the experimental situation? Should you 


go through to the end of the experiment or should 
you break off at some point? 


Valued break-off points ranged from 0 to 255 
volts with the median at 150 volts, Subjects 
were also asked to explain at length the 
values that should guide their behavior in the 
laboratory situation. A sample statement is 
given below: 


I would say that one should not even start the 
shocks, but rather, forcibly withdraw the moment 
that the outline of the experiment becomes evident, 
My reasons for saying this are very complex... 
The first is that the society’s morals, which I have 
absorbed, teach that it is wrong to hurt a man. 
Morals are usually tied in with religion and this 
is especially true with Christianity. Having been 
raised, then, in an old fashioned, staunchly Lutheran 
family I have internalized the view that hurting a 
man is wrong, and no amount of intellectual 
reasoning can shake this belief. 
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Thus, there is a clear consensus that the l 


only behavior consistent with social values 


and individual conscience is to defy the ex- 
perimenter at some point before the com- 
pletion of the command series. Yet there is a 
marked discrepancy between this value judg- 
ment and the actual performance of subjects 
in the laboratory. Despite their protests and 
obvious conflict in shocking the victim, a sub- 
stantial number of subjects are unable to 
defy the experimenter’s authority, and pro- 
ceed to administer the highest shock levels 
commanded by him. 


EXPERIMENT II: GROUPS FOR DISOBEDIENCE 


We may now examine in what degree group 
influence can release the subject from authori; 
tarian control, and allow him to act in a 
direction congruent with his values and per- 
sonal standards, 

The strategy is to replicate Experiment I, 
but with this difference: the subject is placed 
in the midst of two peers who defy the ex- 
perimenter and refuse to punish the victim 
against his will. In what degree will the pres- 
sure created by their action affect the naive 
subject’s behavior? 


Technique 


Forty fresh subjects were studied in this condition. 
The procedure is identical to Experiment I with these 
exceptions: Four persons appear at the laboratory to 
take part in an experiment on “the effects of col- 
lective teaching and punishment on memory an 
learning.” Three subjects are confederates of the 
experimenter and one is a naive subject. The experi- 
menter explains that three teachers and one learner 
are to be used in the study, and by means of a rigged 
drawing, the naive subject is assigned the position 
of Teacher 3. The roles of Teacher 1, Teacher 2, and 
learner are filled by the confederates, The learner is 
strapped into the electric chair and the three teachers 


are seated before the shock generator. Teacher 1 i8 | 
» told to read the list of word pairs, Teacher 2 informs — 


the subject if he is correct or incorrect in his answe!$ 
and Teacher 3 (the naive subject) administers 
punishment. As in Experiment I, the subjects at 
instructed to raise the shock level one step each time 
the learner makes an error, (More detailed descrip- 
tions of the group technique can be found in 
Milgram, 1964.) 

Behavior of confederates. The confederates comply 
with the experimenter through the 150-volt shock. 
After the 150-volt level is administered Teacher 1 
informs the experimenter that he does not wish t0 
Participate further, because of the learner’s com- 
plaints. (It is at the 150-volt shock level that the 


ikes his first vehement protest.) The ex- 
insists that the subject continue, How- 
1 is not swayed by the experimenter’s 
gets up from his chair in front of the 
aerator, and takes a seat in another part of 
. Since the experimenter’s efforts to get 
ect to return to the generator are futile, the 
e ter instructs the remaining two subjects to 
et ntinue with the experiment. Teacher 3 (the naive 
‘is to handle the reading of word pairs, in 
tion to his regular job of administering electric 
the learner. 
Shock Level 14 (210 volts) is administered, 
ch 2, expressing concern for the learner, refuses 
icipate further. The experimenter makes strong 
to have him continue, but he too leaves his 
"in front of the generator and seats himself at 
pposite corner of the room, saying “I’m willing 
er any of your questions, but I’m not willing 
ock that man against his will; I'll have no 
git. 
this point the naive subject is seated alone in 
of the shock generator. He has witnessed the 
actions of two peers. The experimenter orders 
live subject to continue with the experimental 
mocedure by himself, indicating that it is essential 
th t the experiment be completed, 


alts and Discussion 


e break-off points of the naive subjects 
Experiment II are shown in Table 1 h 
umn 3, The arrows at Shock Levels 10 
14 indicate the points at which each of 
the confederates defied the experimenter. 

somparing the proportion of obedient to 
it Subjects in Experiments I and II, we 
that the effect of the confederates’ pres- 
Was substantial. In Experiment I, 26 
ects proceeded to the end of the command 


j Jess than one-sixth of this number 


‘in Figure 1, The mean maximum shock 
experiment II (16.45) was also signifi- 
a than in Experiment I (24.55, 
er Shock Level 14 the second confeder- 
lefies the experimenter. Before Level 15 

istered, 25 naive subjects have fol- 
defiant group, while at the corre- 
3 point in Experiment I only 8 sub- 


Course the mean maximum shock in the 
ntal Condition is tied to the precise point 
joltage series where the confederates’ break-off 
ed. In this experiment it is not until Level 14 
th confederates have defied the experimenter. 


GROUP PRESSURE 
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Fic. 1. Proportion of obedient and defiant subjects 
in Experiments I and II. 


jects have refused to follow the experimenter’s 
orders. The confederates appear to exert some 
influence, however, even on those subjects 
who do not follow them immediately. Between 
Voltage Levels 17 and 29, 11 subjects in 
Experiment IT break off, while only 6 subjects 
do so in Experiment I. 

In sum, in the group setting 36 of the 40 
subjects defy the experimenter while the cor- 
responding number in the absence of group 
pressure is 14. The effects of peer rebellion 
are most impressive in undercutting the ex- 
perimenter’s authority. Indeed, of the score 
of experimental variations completed in the 
Yale study on obedience none was so effective 
in undermining the experimenter’s authority 
as the manipulation reported here.® 

How should we account for the powerful 
effect of the experimental manipulation? It 
is probable that in Experiment I many sub- 
jects come near to performing the defiant 
action but cannot quite bring themselves to 
the point of disobedience. The additional 
pressure of the group members leads to an 
increment in the forces oriented toward defi- 
ance; the increment is of sufficient strength 
so that, in combination with pressures for 
defiance already present, many subjects are 
carried over the threshold of disobedience, 

The strong liberating effect of the peers 
brings to mind the powerful effect of a partner 
in Asch’s (1951) study. In that experiment 

®See Milgram (in press) for additional experi- 
ments, 
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negative pressures originated within the group 
itself so that the conflicting agents (partner 
verus majority) were internal to the group 
boundary; in the present study the peers 
free the subject from an influence that is 
external to the group. Additionally, the part- 
ner’s support in Asch’s study leads to a 
response that is fundamentally similar in form 
but different in value from that of the erring 
majority. In the present study the peers 
initiate a radically different order of response, 
one which has no antecedent in the cdurse 
of the laboratory hour, and which destroy’s 
the very framework of the experiment. 

Reactions to the confederates. The re- 
actions of naive subjects to the defiant con- 
federates varied considerably and were in 
part dependent on the exact point where the 
subject himself defied the experimenter. A 
subject who quit simultaneously with the 
first confederate stated, “Well, I was already 
thinking about quitting when the guy broke 
off.” Most defiant subjects praised the con- 
federates with such statements as, “I thought 
they were men of good character, yes I do. 
When the victim said ‘Stop,’ they stopped 
[Shock Level 11].”* “I think they were very 
sympathetic people . . . and they were to- 
tally unaware of what was in store for them 
[Shock Level 14].” 

A subject who defied the experimenter at 
Level 21 qualified his approval: “Well I think 
they should continue a little further, but I 
don’t blame them for backing out when they 
did.” 

A few subjects acknowledged the impor- 
tance of the confederates in leading to their 
own defiance: “The thought of stopping 
didn’t enter my mind until it was put there 
by the other two [Shock Level 14].” “The 
reason I quit was that I did not wish to seem. 
callous and cruel in the eyes of the other 
two men who had already refused to go on 
with the experiment [Shock Level 14].” The 
majority of subjects, however, denied that the 
confederates’ action was the critical factor in 
their own defiance. 


7 Numerals in brackets indicate the break-off point 
of the subject quoted. 

8 Twenty-seven of the defiant subjects stated that 
they would have broken off without the benefit of 
the confederates’ example; four subjects definitely 
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The fact that obedient subjects failed to 
follow the defiant group should not suggest 
that they did not feel the pressure of the 
confederates’ action. One obedient subject 
stated: 


I felt that I would just look like a real Simon 
Legree to these guys if I just went on cooly and 
just kept administering lashes. I thought they reacted 
normally, and the first thing that came to my mind 
was to react as they did, But I didn’t, because if 
they reacted normally, and stopped the experiment, 
and I did the same, I don’t know how many 
months and days you’d have to continue before 
you got done. 


Thus this subject felt the burden of the group 
judgment, but sensed that in the light of two 
defections he had a special obligation to help 
the experimenter complete his work. Another 
obedient subject, when asked about the 
nervousness he displayed in the experiment, 
replied: 

I think it was primarily because of their actions. 
Momentarily I was ready to go along with them. 
Then suddenly I felt that they were just being 
ridiculous. What was I doing following the crowd? 


... They certainly had a right to stop, but I felt 
they lost all control of themselves. 


And a third obedient subject criticized the 
confederates more directly, stating: 


I don’t think they should have quit. They came here 
for an experiment, and I think they should have 
stuck with it. 


A closer analysis of the experimental 
situation points to a number of specific 
factors that may contribute to the group's 
effectiveness: 


1. The peers instill in the subject the idea 
of defying the experimenter. It may not have 
occurred to some subjects as a response 
possibility. 

2. The lone subject has no way of know- 
ing whether, in defying the experimenter, 

I 


i ae Ni 
re = 


is performing in a bizarre manner or whether 
this action is a common occurrence in the 
laboratory. The two examples of disobedience 


acknowledged the confederates’ rebellion as the criti- 
cal factor in their own defiance. The remaining 
defiant subjects were undecided on this issue. mp 
general, then, subjects underestimate the degree tii 
which their defiant actions are dependent on grou? | 
support. 


a 


Group PRESSURE 


“he sees suggest that defiance is a natural 
reaction to the situation. 

3. The reactions of the defiant confederates 
define the act of shocking the victim as im- 
proper. They provide social confirmation to 
the naive subject’s suspicion that it is wrong 
to punish a man against his will, even in the 


* context of a psychological experiment. 


4, The defiant confederates remain in the 
. laboratory even after withdrawing from the 
experiment (they have agreed to answer post- 
experimental questions). Each additional 
shock administered by the naive subject now 
‘carries with it a measure of social disapproval 
from the two confederates. 

5. As long as the two confederates par- 
‘ticipate in the experimental procedure there 
is a dispersion of responsibility among the 
group members for shocking the victim. As 
the confederates withdraw, responsibility 
becomes focused onto the naive subject.® 

6. The naive subject is a witness to two 
instances of disobedience and observes the 
consequences of defying the experimenter to 
be minimal. 

7. There is identification with the dis- 
obedient confederates and the possibility of 
falling back on them for social support when 
defying the experimenter. 

8. Additionally, the experimenter’s power 
may be diminished by the very fact of failing 
to keep the two confederates‘in line, following 
the general rule that every failure of author- 
ity to exact compliance to its commands 
weakens the perceived power of the authority 
(Homans, 1961). 


Hypothesis of Arbitrary Direction of Group 
Effects 


The results examined thus far show that 
group influence serves to liberate individuals 
effectively from submission to destructive 
commands. There are some who will take this 
to mean that the direction of group influence 
1s arbitrary, that it can be oriented toward 
destructive or constructive ends with equal 
impact, and that group pressure need merely 
be inserted into a social situation on one side 
of a standard or the other in order to induce 


„ Movement in the desired direction. 


° See Wallach, Kogan, and Bem (1962) for a treat- 


J ment of this concept dealing with risk taking. 
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This view ought to be questioned. Does 
the fact that a disobedient group alters the 
behavior of subjects in Experiment II neces- 
sarily imply that group pressure can be ap- 
plied in the other direction with similar _ 
effectiveness? A competing view would be that 
the direction of possible influence of a group 
is not arbitrary, but is highly dependent on 
the general structure of the situation in which 
influence is attempted. 

To examine this issue we need to undertake 
a further experimental variation, one in which 
the group forces are thrown on the side of 
the experimenter, rather than directed against 
him. The idea is simply to have the members 
of the group reinforce the experimenter’s 
commands by following them unfailingly, thus 
adding peer pressures to those originating in 
the experimenter’s commands. 


EXPERIMENT III: OBEDIENT Groups 


Forty fresh subjects, matched to the subjects in 
Experiments I and II for sex, age, and occupational 
status, were employed in this condition. The pro- 
cedure was identical to that followed in Experiment 
II with this exception: at all times the two confeder- 
ates followed the commands of the experimenter; 
at no point did they object to carrying out the 
experimental instructions. Nor did they show sym- 
pathy for or comment on the discomfort of the 
victim. If a subject attempted to break off they 
allowed the experimenter primary responsibility for 
keeping him in line, but contributed background 
support for the experimenter; they indicated their 
disapproval of the naive subject’s attempts to leave 
the experiment with such remarks as: “You can’t 
quit now; this experiment has got to get done.” 
As in Experiment II the naive subject was seated 
between the two confederates, and in his role of 
Teacher 3, administered the shocks to the victim. 


Results and Discussion 


The results, presented in Table 1, Column 
4, show that the obedient group had very 
little effect on the overall performance of 
subjects. In Experiment I, 26 of the 40 sub- 
jects complied fully with the experimenter’s 
commands; in the present condition this figure 
is increased but 3, yielding a total of 29 
obedient subjects. This increase falls far short 
of statistical significance (x? = .52, df= 1, 
p > .50). Nor is the difference in mean maxi- 
mum shocks statistically reliable. The failure 
of the manipulation to produce a significant 
change cannot be attributed to a ceiling arti- 


134 


fact since an obedient shift of even 8 of the 
14 defiant subjects would yield the .05 
significance level by chi square. 

Why the lack of change when we know 
that group pressure often exerts powerful ef- 
fects? One interpretation is that the authori- 
tarian pressure already present in Experi- 
ment I has preempted subjects who would 
have submitted to group pressures. Conceiva- 
bly, the subjects who are fully obedient in 
Experiment I are precisely those who would 
be susceptible to group forces, while those 
who resisted authoritarian pressure were also 
immune to the pressure of the obedient con- 
federates. The pressures applied in Experi- 
ment III do not show an effect because they 
overlap with other pressures having the same 
direction and present in Experiment I; all 
persons responsive to the initial pressure 
have already been moved to the obedient 
criterion in Experiment I. This possibility 
seems obvious enough in the present study. 
Yet every other situation in which group 
pressure is exerted also possesses a field 
structure (a particular arrangement of stimu- 
lus, motive, and social factors) that limits 
and controls potential influence within that 
field. Some structures allow group influence 
to be exerted in one direction but not 
another. Seen in this light, the hypothesis of 
the arbitrary direction of group effects is 
inadequate. 

In the present study Experiment I defines 
the initial field: the insertion of group pres- 
sure in a direction opposite to that of the 
experimenter’s commands (Experiment II) 
produces a powerful shift toward the group. 
Changing the direction of group movement 


10 See, for example, the study of Jones, Wells, and 
Torrey (1958). Starting with the Asch situation 
they show that through feedback, the experimenter 
can foster greater independence in the subject, but 
not significantly greater yielding to the erring major- 
ity. Here, too, an initial field structure limits the 
direction of influence attempts. 
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(Experiment III) does not yield a compa- 
rable shift in the subject’s performance. The 
group success in one case and failure in 
another can be traced directly to the con- 
figuration of motive and social forces opera- 
tive in the starting situation (Experiment I), | 

Given any social situation, the strength and 
direction of potential group influence is pre- 
determined by existing conditions. We need 
to examine the variety of field structures that 
typify social situations and the manner in 
which each controls the pattern of potential 
influence. 
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4 INTERPRETATION OF WORDS IN SCHIZOPHRENIA* 
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The interpretation and use of words by schizophrenics is described by a theory 

reconciling previous findings that appear contradictory if viewed in terms 

of other theories. The present theory states that schizophrenics suffer from an 

excessive reliance on the strongest normal meaning responses, which are defined 

as representations of portions of the denotative meaning of words. Confirmatory 

y results indicated schizophrenics (a) more than normal Ss, judged similar- 
meaninged word pairs to have the same meaning (p<.01), and more so 

when the strongest meaning response for both words was identical (p < .001) ; 
but (b) less than normal Ss judged them to have similar meanings (p < .07). 


Schizophrenics often use common words in 
ways which indicate that they invest words 
with deviant meanings. This is seen most 
dramatically in those patients with whom 
verbal communication breaks down despite a 
continuing exchange of conversation, but it 
also is seen to some degree in almost every 
schizophrenic patient. The present investiga- 
tion examines this deviancy by studying the 
characteristics of words which schizophrenics 
tegard as synonymous. 

' Schizophrenics substitute words for one an- 
other in ways which normal subjects do not. 
These words often have a slight similarity 
which normal persons recognize, but do not 
: regard as sufficient to treat the words as inter- 
changeable. For example, Bleuler’s (1950, 
p. 19) patient who said, “Brutus was an 
Italian,” instead of saying, “Brutus was a 
Roman,” might be viewed as regarding 
Italian” and “Roman” as interchangeable 
on the basis of their similarity of mean- 
Ing. One patient encountered among those 
Interviewed for the present research spoke 
of. “handlebars” of an automobile instead 
of “steering wheel.” Another substituted 
hexagon” for the number “six.” 
The phenomenon of schizophrenics speak- 
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ing in symbols appears also to reflect a treat- 
ment of similar-meaning words as if they 
were synonymous. Freud (1953), in ana- 
lyzing the Schreber case, believed that when 
Schreber wrote of birds, in his delusional 
system, he really was saying something about 
girls. Freud concluded this from the fact that 
the things which Schreber said about the birds 
seemed appropriate as a description of girls— 
in particular, the birds chattered away in an 
empty-headed manner, monotonously repeat- 
ing clichés. Freud suggested that the real-life 
twittering of birds resembled the conversation 
of girls, and on the basis of this similarity, 
Schreber chose birds as a suitable representa- 
tion of girls. Similarly, Freud believed that 
when Schreber wrote of God, in his delusional 
system, he was using God as a symbol of his 
father. Again, Freud explained the origin of 
this symbol on the basis of the similarity of 
the attributes commonly ascribed to God, and 
the attributes that Schreber saw in his father, 
Most, or perhaps all, clinical analyses of the 
formation of symbols by schizophrenics indi- 
cate that the choice of a symbolic equivalent 
is based on a partial similarity, which normal 
people would recognize but would not regard 
as sufficient to warrant treating the two words 
or things as equivalent. 

Chapman and Taylor (1957) and Chapman 
(1961) have studied a schizophrenic disturb- 
ance in the use of common concepts which 
is closely related to the treatment of similar 
meanings as synonymous. They found that 
when schizophrenics use names of common 
concepts in an excessively broad manner, they 
tend to do so by including in the concept 
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items which, although incorrect, are similar 
to correct ones. For example, the schizo- 
phrenics often labeled various names of 
vegetables as “fruit.” 

Phenomena of this sort might tempt one 
to describe schizophrenia as being character- 
ized by a heightened and flattened gradient 
of semantic generalization, since in one vari- 
ety of semantic generalization a subject gives 
a response to a word which he has previously 
learned to a different word of similar meaning. 
However, the more direct evidence concerning 
semantic generalization in schizophrenia con- 
tradicts this interpretation. At least three 
different investigators have compared schizo- 
phrenics and normals on tasks which are 
usually thought of as measuring semantic 
generalization. These three studies uniformly 
obtained negative results, that is, they found 
no difference between schizophrenics and nor- 
mals on degree of semantic generalization. 
Carson (1958), using rote serial learning, 
found no difference between these groups in 
their response to variations in degree of simi- 
larity between adjectives. Thornton (1958) 
reported a similar finding for paired-associate 
learning. Peastrel (1961), using a conditioned 
galvanic skin response, found no difference on 
generalization to synonyms. 

To complicate further the picture, it ap- 
pears that in some kinds of situations, schizo- 
phrenics show the very opposite kind of 
behavior, that is, they treat words as less alike 
in meaning than normals. For example, when 
a schizophrenic is asked on a Wechsler test 
in what way a dog and a lion are alike, he 
will often answer something like, “They are 
not alike because a lion roars and a dog 
barks.” Similarly, Chapman (1961) found 
that when schizophrenics are asked to sort 
objects into designated categories which are 
fairly broad, for example, to indicate which 
of a series of names of objects belong in the 
category of “things that grow,” schizophrenics 
exclude from that category many more ob- 
jects than normals. If we were to describe 
these phenomena in terms of semantic gen- 
eralization, we would have to say that the 
schizophrenics respond in accordance with a 
steeper and a lower gradient of semantic 
generalization than normals, instead of a 
higher and flatter gradient. 
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These apparent contradictions in the re- 
sponse of patients to the similarity dimension, 
when viewed in terms of semantic generaliza- 
tion, together with the consistent lack of dif- 
ference between schizophrenics and normals 
on conventional measures of semantic gen- 
eralization, indicate that semantic generaliza- 
tion is an inadequate model for describing 
the schizophrenic deviancy in their interpreta- 
tion of the meaning of words. Instead we 
should seek some other principle to account 
for disordered thought in which similarity 
of meaning is a variable. The present investi- 
gators decided that before attempting to find 
such a principle, it was first necessary to 
develop a measure of the tendency to treat ` 
words of similar meaning as synonymous and | 
use it to compare schizophrenics and normals. 
It also seemed desirable to compare schizo- 
phrenics with patients suffering cortical brain 
damage on such a measure since brain- 
damaged patients also show a disorder of 
thought. 


EXPERIMENT I 


This phenomenon was measured by pre 
senting subjects with pairs of words, and 
asking them if they mean the same thing. 
These pairs of words varied in degree of simi- 
larity, which made it possible to plot, as 4 
function of similarity, the number of pairs f 
called the same, 


Procedure 


Preliminary item development. It was first neces 
sary to measure degree of similarity between pairs fi 
of words. The investigators constructed 118 items M | 

i 


the following format: 


28. To what degree are carrer and rvc alike in 
meaning ? $ 
They are almost exactly alike in meaning. 4 
—_2. They are very much alike in meaning. 

. They are fairly much alike in meaning. 
. They are only a little alike in meaning. 4 
They are not at all alike in meaning. | 


—1. 


—_. 


These were rated by 100 college students. The 118 
pairs of words varied widely in degree of similarity; 
and from the distribution of mean rated similarity 
48 items were selected which clustered, in groups 
of 12 each, around 4 points of rated similarity. | } 

The mean rated similarity for the 4 points 
was 1.35, 2.22, 3.10, and 4.01. These points wel’ x 
treated as ordinal points only, and no attempt we 
made to obtain equal intervals by a scaling 
procedure. a 
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Main experimental task. For each of the 48 pairs 
of words, items like the following were constructed: 
it similarity (Point A) item: 

T 14. “Slender” means the same as 
A. Slim 

_ B. Sober 

: © C. Neither of the above 


, Low similarity (Point D) item: 

_ 25. “Automobile” means the same as 
$ A. Pail 

t B. Wagon 

C. Neither of the above 


Slender and slim had been given a mean similarity 
rating of 1.3 on the 5-point scale; automobile and 

_ sagon had “been given a mean rating of 3.8. 
C Each item also has an additional alternative. In 
~ Item 25, the “pail” alternative was intended to have 
minimal similarity to “automobile.” It is called here 
the “unidentified-error” alternative and was intended 
as a measure of “random” response, that is, subjects 
“might mark the “wagon” alternative for reasons 
other than the hypothesized effect. For example, 
C schizophrenics might be uncooperative or have dif- 
ficulty taking the test, and so might mark randomly. 
Such random markings should result as often in a 
"marking of the unidentified-error alternative (like 
pail) as in marking of the similar alternative (like 
Wagon). Therefore, to obtain a relatively pure 
pcasure of the tendency to treat similar words as 
‘synonymous, for each subject the number of mark- 
€ ings of the unidentified-error alternatives was sub- 
tracted from the number of markings of the similar 
alternatives at each of the 4 points of similarity. 
This yielded, for each subject, a corrected score of 
the tendency to treat similar words as synonymous. 
There was also one additional experimental control. 
Some subjects might not understand the “neither 
of the above” alternative, and would be disinclined 
to choose it. As a check on this, each subject was 
given 12 additional control items, similar in format 
to the 48 experimental items, but in which neither 
alternative was similar to the named word. For 
example, subjects were given an item which asked 
if “jury” means the same as “cat, purse, or neither 
Of the above.” Subjects who chose an alternative 
, Other than “neither of the above’ on as many as 
a a these 12 control items were dropped from the 
i 4 on the grounds that they were not taking 
ios in a meaningful manner. There were four 
oaas so dropped, but no normals or brain- 
ne patients. For the remaining subjects, the 
a ies pi control items for which the “neither 
N oe alternative was not chosen were 
Ophrenic, 46; brain damaged, .04; and normal, 


5 peis, „The subjects were white males who 
had aasa indications of mental deficiency and who 
26: schi pleted at least the sixth grade. They included 

izophrenics, 26 brain-damaged patients, and 
Normal subjects, 


he schizophrenic subjects were long-term chronic 
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patients at Chicago State Hospital, who were not 
receiving “tranquilizing” drugs, and who had not 
received somatic therapy in the previous 3 months. 

Although no attempt was made to select patients 
by diagnostic subtype, they were found to include 
5 diagnosed as paranoid, 6 hebephrenic, 5 catatonic, 
and 10 without classic subtype. No schizophrenic 
was included for whom there was any suspicion of 
brain damage. None had been admitted more recently 
than a year ago and most of them had spent a large 
part of their lives in the hospital. 

The brain-damaged patients were obtained at 
Hines Veterans Administration Hospital. Only those 
patients were included who were diagnosed as suffer- 
ing from damage to the cortex of the cerebrum and 
also appeared clinically to have some intellectual 
loss, These included 3 brain tumors, 3 head injuries, 
5 advanced multiple sclerosis, 1 chronic brain 
syndrome, 10 cerebral vascular disorders, 1 myelo- 
pathy, and 3 of unknown origin. No brain-damaged 
patient was included who appeared to be suffering 
from a severe emotional disorder. 

The normal subjects were ambulatory psy- 
chiatrically normal medical and surgical patients at 
the Illinois Central Hospital in Chicago, The mean 
and variance of age of the three groups were schizo- 
phrenic 43.7, 69.1; brain damaged 43.1, 122.1; and 
normals 43.3, 38.3. Mean and variance of education 
were schizophrenic 10.7, 4.2; brain damaged 11.1, 
2.8; and normals 10.7, 5.1. The three groups did not 
differ on mean of either age or education. 

As an estimate of premorbid intellectual function- 
ing, Stanford-Binet Vocabulary scores were available 
for all 26 normal subjects, for 24 of the brain- 
damaged, and 21 of the schizophrenic patients. The 
mean scores were schizophrenic, 20.90; brain dam- 
aged, 19.79; and normal, 22.04. 

Since these scores did not differ significantly as 
indicated by an analysis of variance, despite the 
severe psychopathology of the patients, it appears 
that the premorbid intelligence of the patient groups 
was at least as high, and probably higher than that 
of the normal subjects. The vocabulary measure is 
especially useful because the main experimental task 
is itself a measure of knowledge of words. The 
similarity of the three groups on the vocabulary 
measure enables us to conclude that any difference 
between a patient group and the normal control 
group on the experimental measure indicates not only 
the effects of the psychopathology, but also indicates 
an effect over and above that which might result 
from the lowering of general vocabulary skill by the 
psychopathology. 


Results 


Very few unidentified-error alternatives 
were chosen, the mean numbers being schizo- 
phrenic, .08; normal, .19; and brain dam- 
aged, 0. This indicates that the subjects were 
able to take the task in a meaningful manner, 
producing very few “random” responses. 
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JUDGEMENT OF SAMENESS 


A 
HIGH 


SIMILARITY 
Fic. 1. Mean judgments of sameness at 4 points of similarity in Experiment I. 


Nevertheless, all statistical computations were 
made using the corrected scores, that is, with 
the number of unidentified-error responses 
subtracted from the number of judgments of 
sameness at each of the 4 points of similarity. 

Figure 1 shows the mean number of pairs 
of words judged as synonymous in meaning at 
each of the 4 points of similarity, using this 
corrected score. 

Analysis of variance indicated that the 
three groups differed on total number of pairs 
of words called the same (p < .01). Separate 
t tests showed that the schizophrenics differed 
from each of the other two groups (p < .01 
in each case) but that the normal and brain 
damaged did not differ. 

The slopes for the three groups diverged, as 
shown by a significant Groups X Similarity 
interaction (p < .01). Separate analyses of 
variance indicated that the slopes differed 
between the schizophrenics and normals and 
between the schizophrenics and brain-damaged 
patients (p < .01 in each case) but that the 
slopes did not differ between the brain- 
damaged and the normal subjects. 

It will be recalled that on the 12 control 
items in which none of the three alternatives 
were similar to the stimulus word, the schizo- 
phrenic group chose the “neither of the 
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BRAIN-DAMAGED 
NORMAL 


D 
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above” alternative slightly less than the other | 
two groups. It might be argued that they had | 
a similar disinclination on the experimental 
items, and that this would tend to inflate the 
scores of judgments of sameness. As a check 
on this, three schizophrenics were dropped in 
order to repeat the analyses for subgroups 1 
which the schizophrenics and normals were 
exactly matched on scores on the 12 control 
items. The results were substantially the same 
as for the total groups, and all of the $ valu 
remained unchanged. 

In summary, these results indicate that 
schizophrenics regard words of similar meat- 
ing as synonymous more than do both normal 
subjects and brain-damaged patients, and that 
their deviancy increases as similarity de 
creases. No difference was found between” 
brain-damaged patients and normal subjects 
on either of the two measures. 

A theory of schizophrenics’ interpretation of 
words. An interpretation of this phenome ai 
was suggested by spontaneous comments that 
some subjects made concerning the items- 
One item asked if “bicycle” means the same 
as “wagon.” The schizophrenics often said 
things like, “You can ride on a bicycle, a^ 
you can ride in a wagon, so they mean 
same.” Normal subjects would say ee 
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“ike, “You can ride on both a bicycle and in a 
wagon, but a bicycle has two wheels and a 
wagon has four, so they don’t really mean the 


‘ same.” These responses suggest that the 


schizophrenics used a more narrow range of 
meanings for mediating their responses. They 
do not weigh simultaneously the several dif- 
‘ferent aspects of meaning in order to answer 


“appropriately the question at hand, but in- 


' stead answer by using a more limited number 
of aspects of the meaning. The meaning 


responses which they use need not necessarily 
be deviant. Instead they merely fail to use 
all of the meaning responses that normal 


` persons use. 


A number of writers have suggested that 


< verbal meanings are responses and that such 


responses serve as stimuli which mediate re- 
sponses to words. Several of these writers have 
further suggested that the relevant meaning 
responses are associative ones, and they have 
measured similarity of meaning by overlap of 
associates (Bousfield, Whitmarsh, & Danick, 
1958; Deese, 1962; Noble, 1952). The diff- 
culty with this approach is that associates 
tap only indirectly what we usually think of 
as “meaning” so that for some pairs of words 
associative overlap yields results which are 
not intuitively appealing. For example, the 
most popular associate both to “dark” and 
to “lamp” is “light” (Russell & Jenkins, 
TEAR This measure would indicate that 
dark and “lamp” are highly similar in 
meaning. It seems unlikely that these two 
words would be rated as similar by judges. 
For a further discussion of the inadequacies 


_ of the associative definition of meaning, see 


O , Par Ki 
4 s spirited critique (Cofer, 1961, pp. 


Connotative meaning has been more widely 
ged by psychologists than denotative and 
ee ument for this has usually been the 
stn differential (Osgood, Suci, & Tan- 
Smad 1957). However, as these writers 
i oara out, this instrument yields an 
fence account of meaning, since “many 
aa. distinct concepts may occupy 
See; ly the same region of our semantic 
AS 3 may have highly similar profiles 

: -” Errors in interpreting denotative 


meani : i 
caning seem central in the phenomena being 


Studied here. 
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In the present theory, meaning is viewed 
as a response but not as an associate or an 
expression of connotative meaning but as a 
direct expression of a portion of the denota- 
tion of a word. Thus a person has, to any one 
word, a series of meaning responses, each of 
which expresses an aspect of the meaning of © 
the word. For example, he might respond to 
the the word “bicycle” with the meaning re- 
sponses of, “you ride on it, it has wheels, you 
pedal it, its metal, etc.” These several mean- 
ing sresponses to a single word differ in 
strength, and there is a similarity in the 
hierarchy of meaning responses among people 
in our culture—otherwise it would be difficult 
to communicate. For example, the meaning 
response, “you ride on it,” is probably for . 
most people high in the hierarchy of meaning 
responses to the word “bicycle.” These mean- 
ing responses mediate both the subject’s usage 
of the word and the answers that he gives to 
test items which ask about the word. 

It is suggested here that in schizophrenia 
the strongest meaning responses tend to be 
relied on excessively, and the weaker ones 
tend to have little influence. This results in 
a reduction of the variety of meaning re- 
sponses used by subjects to mediate their 
responses to words, and those meaning re- 
sponses which are used to the exclusion of 
others tend to be the ones which are stronger 
for normal subjects. This might be viewed as 
an uncritical acceptance of a normal response 
bias. 

This principle explains excessive substitu- 
tion of similar-meaninged words for one an- 
other. Among pairs of words which we think 
of as similar, there is frequently, but not 
always, a common shared meaning response. 
Hence, schizophrenics indicated in Experi- 
ment I that “wagon” means the same as 
“bicycle” more often than normals because 
among the strong meaning responses to both 
“wagon” and “bicycle” are “you can ride on 
it,” and “it has wheels.” For the normal 
subjects, other meaning responses occur which 
point up differences, but often not so for the 
schizophrenic who uses uncritically the 
strongest meaning responses, neglecting others. 

This principle yields, in other situations, 
predictions which are the reverse of those 
which would be made from a principle of 
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heightened semantic generalization. The latter 
principle would always predict that schizo- 
phrenics would treat words as more alike than 
normals. However, the present formulation 
predicts that on a task on which a subject is 
asked to point out similarities, rather than 
identities, between pairs of words of moderate 
to high similarity, schizophrenics would in- 
dicate such similarities less often than the 
normals. This is because a normal person 
decides whether words are similar in meaning 
by considering various meaning responses, 
and the meaning response in which most of 
the similarity resides will sometimes not be in 
the first position in the hierarchy of responses 
ordered by strength. 

It is seen that the present formulation 
predicts that on any given set of word pairs 
schizophrenics more than normals will tend to 
give the same answer to the questions con- 
cerning similarity and sameness of meaning. 
This is because schizophrenics, being limited 
to fewer meaning responses, tend more to 
make sharply dichotomous judgments. 


EXPERIMENT II 
Procedure 


Preliminary item development. Pairs of words 
were ordered on degree of similarity by the same 
rating procedure as in Experiment I, Each of 104 
college students rated 332 pairs of words on a scale 
of similarity from 1 to 5. From the distribution of 
mean rated similarity, 80 items were selected which 
clustered in groups of 20 each around 4 points of 
rated similarity. The 80 items were divided into two 
sets, each with 10 items at each point. The two sets 
of word pairs, called here Sets 1 and 2, were closely 
equated on mean, mean variance, and variance of 
the means of rated similarity of the 10 items at each 
point. The mean of the ratings for the 4 points were 
14, 2.2, 3.2, and 4.2 for each set. 

Experimental task. Two parallel forms were con- 
structed, Forms X and Y, each of which contained 
both sets of word pairs. Form X presented the 
word pairs of Set 1 in items calling for the judgment 
of similarity in meaning and the word pairs of Set 2 
in items calling for the judgment of sameness in 
meaning—for Form Y this was reversed, with Set 1 
word pairs presented in items calling for selection of 
identities, and Set 2 word pairs in items calling for 
selection of similarities. The items calling for judg- 
ments of sameness were identical in format to 
those of Experiment I. Items calling for judgments 
of similarity were of the form: 


“Automobile” is similar in meaning to 
A. Pail 


B. Wagon 
C. Neither of the above 
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Each subject received one of the two forms, and 
was tested on two occasions, a week apart, receiving 
on one occasion only one of the two sets of items, 
The order of the two sets within each form was’, 
counterbalanced across subjects. The two forms 
were used equally often. Thus each subject judged 
all the words, half on the “similar” instructions and 
the other half on the “same” instructions. This made 
it possible to compare responses to the two kinds of 
item wording without contamination by differences: 
in word pairs or by transfer effects. 

Each test form of 80 experimental items was also 
accompanied by 20 additional control items, for 
which neither alternative was similar to the named 
word. Subjects who chose an alternative other 
than “neither of the above” on as many as 5 of these 
20 control items were dropped from the study on the 
grounds that they were not taking the test in a 
meaningful manner. There were 19 schizophrenics” 
so dropped, but no normals. s 

Subjects. The schizophrenics were again a chronic 
sample, and were obtained from Eastern State 
Hospital, Lexington, Kentucky, and Kentucky State 
Hospital in Danville. All had been admitted at least 
6 months previously, and most had spent many years 
in the hospital. They were all white males between 
the ages of 32 and 60, who showed no indication 
of mental deficiency or brain damage, and who had 
completed the fifth grade, and had not received 
electric shock therapy for the previous 3 months 
No attempt was made to select by diagnostic subtype. 

At both hospitals most of the patients wer? 
receiving phenothiazine therapy, but at the second 
hospital the patients receiving such drugs welt 
removed from medication for purposes of this re- 
search, An attempt was made to test every patient 
in both hospitals who met the criteria listed above, 
and since the patients at Kentucky State Hospital 
were to be removed from medication, they were 
first given a screening test (a simple sorting task 
unlike the experimental instruments) in order to 
determine if they would participate in testing. 

From a larger group of patients at Eastern State 
Hospital who met the criteria, 17 were found who 
completed the testing. 

Of 80 patients at Kentucky State who met the 
criteria, 58 took the screening test in what appeate 
to be a meaningful manner, Of these, 43 wert 
receiving tranquilizers (one or more phenothiazine): 
These 43 patients were placed on placebo for 4 weeks 
before being tested for the experiment. Howeveh, 
only 7 of these successfully completed the experiment 
The remaining 36 were either unable or unwilling to 
take the test. No patient had to be omitted because 
the hospital management judged him unsuitable {0! 
continued withholding of drugs. However, testiné 
was more successful with the 15 patients who initially 
were not receiving tranquilizers, Thirteen of thes? 
took the test and are included in this report. Thus 
there were 37 schizophrenics in all. 

It was found that the patients from the be 
hospitals performed in a strikingly similar mam“ 
on the main experimental instrument, despite m 
fact that one group was receiving tranquilizing, 
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a drugs and the other not. Therefore, they were 
combined for comparison with the normal subjects. 
| The normal subjects were each paid $10.00 to 
participate in this and other studies. Twenty-six 
of them were obtained through the State of Ken- 
tucky Employment Service and 10 from among 
maintenance workers at the University of Kentucky. 
Although the subjects from the Employment Service 
were currently unemployed, no one was included 

* who was not recorded as having held a job for at 
least 6 months during the previous year. A number 
of them were currently out of work during the 
* winter months, but normally worked steadily in the 
building trades during warmer weather. 

The 36 members of the normal group included 8 
carpenters, 4 truck drivers, 2 laborers, 5 electricians, 
2 bricklayérs, 3 salesmen, 6 plumbers, 2 postal 
clerks, 1 policeman, 1 store manager, and 2 foremen. 

The mean and variance of age were schizophrenic 
47.7, 55.3; normal 40,2, 81.7. Similar values for years 
of schooling completed were schizophrenic 10.2, 13.7; 
normal 11.0, 8.1. As an estimate of current intel- 
lectual functioning the Vocabulary subtest of the 
Stanford-Binet was available for 33 of the schizo- 
phrenic patients and all of the normal subjects. The 
Mean and variance for raw Vocabulary score were 
Schizophrenic 17.6, 36.4; normal 22.2, 19.8. The 

roups did not differ significantly on education but 

id on both age and Vocabulary score (p< .001 in 
each case). The difference of 7.5 years of age is not 
thought to be an important variable for a task of 
‘this kind. Vocabulary score might be expected to 
be related to the measures in question but the lower 
Score of the patients very likely reflects the patients’ 
Psychopathology rather than lower premorbid intel- 
ligence. Therefore, instead of matching the groups 
on these variables, additional analyses were done to 
determine the effects of age and Vocabulary score 
on the dependent variables. ° 
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Results 


As in Experiment I, the score used for 
number of pairs judged the same (or similar) 
by each subject was corrected for “random” 
error, that is, the number of predicted 
alternatives marked minus the number of 
“unidentified-error” alternatives marked. 

Figure 2 shows the means of the number 
of pairs of words judged by the schizophrenic 
and normal subjects as the same in meaning, 
and as similar in meaning, at each of the 4 
poihts of rated similarity. 

The curves for judgments of sameness ap- 
pear similar to those obtained in Experi- 
ment I, except that at the A point, which is 
the point of highest similarity, the schizo- 
phrenics marked fewer pairs than the normals 
as meaning the same (z= 3.10, p< .01). 
This can probably be attributed to a greater 
degree of disorganization in the patients used 
in Experiment II, with a resultant greater 
variability of response. The data were ex- 
tremely skewed, so nonparametric analyses 
were used throughout. 

As in Experiment I, the schizophrenics ex- 
ceeded the normals on the number of pairs 
of words called the same for the 4 points 
combined (z = 2.40, p < .01), as measured by 
a one-tailed Mann-Whitney test. The differ- 
ence in slopes of the curves was determined 
by computing for each subject the difference 
between the number of pairs called the same 


YA SCHIZ. - SIMILAR 
NORMAL = SIMILAR 


` 
NO SCHIZ. - SAME 


NORMAL = SAME 


G D 


SIMILARITY 


Fic. 2. Mean judgments of sameness and of similarity at 4 points of similarity in Experiment II. 
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at the A point and at the D point, and then 
comparing schizophrenics and normals on this 
difference score. The slopes of the curves 
for the normal and schizophrenic responses 
were found to differ significantly (z = 4.81, 
p < .001) as rated similarity decreased, with 
a flatter slope for the schizophrenics. 

On the second instruction, in which the 
subject was asked to indicate similarity rather 
than sameness of meaning, the theory offered 
in the present paper predicted that schizo- 
phrenics would mark fewer pairs of words as 
similar in meaning than the normals. The dif- 
ference found was in the direction hypothe- 
sized, but it fell short of significance (z = 1.55, 
p < .07). The direction of this difference was 
opposite to that which would be predicted 
by a hypothesis of heightened semantic 
generalization, but consistent with the present 
theory. 

As seen in Figure 2, the curves converge 
with decreasing similarity. The slopes of the 
curves were compared by the same method as 
described above, and were again found to be 
significantly different (z = 2.30, p < .02). 

A measure of meaning responses. The 
theory offered here states that the schizo- 
phrenic deviancy in the use of words arises 
from excessive reliance on the strongest mean- 
ing responses. This yields the hypothesis that 
those word pairs on which schizophrenics 
exceed normal subjects in judgments of same- 
hess are more often those for which the 
strongest meaning responses are identical for 
the two words. This can be tested most simply 
by selecting out those word pairs for which 
the one meaning response of highest strength 
is identical for the two members of a pair, 
Therefore procedures were devised to obtain 
the most common meaning responses, and to 
measure their relative strength, for each word 
in the 80 word pairs used in Experiment II. 
The method used was a new one, although it 
has something in common both with Under- 
wood and Richardson’s (1956) measure of 
the hierarchy of sensory impression responses 
to verbal stimuli, and with Flavell and 
Johnson’s (1961) measure of the number of 
different similarities given for word pairs, 

After pilot testing, a list of the 160 words 
was presented to judges, preceded by the 
following instruction: 
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For each word below, first write very briefly, . 
preferably in one or two words, what the thing is 
or what it’s like. Then write two other very brief 
statements about what it is or what it is like. 

For example, for elephant someone might write 

(1) animal (2) big (3) has trunk 

Please be careful not to give associations, For 
example, tiger might be an association for elephant 
but it is not a statement of what an elephant is 
or what an elephant is like. i 

You may find that for some words the most 
appropriate statements of what it is or what it’s like 
are ones that you have used already for earlier 
words, In such cases you can use these statements 
again. There is no need to write something new 
unless another statement is more appropriate. 


The judges were 100 students in an intro- 
ductory psychology course. 

Because of the length of the task it, was 
divided into two forms, each half of the judges 
receiving one half of the words. The two 
words which had been used in a pair in the 
experimental task were presented to the same 
judge but were widely separated on the list. 

The experimenter tallied the meaning re 
sponses given for each word, combining those 
which were essentially the same except for 
minor variations in phrasing. The five mean- 
ing responses most frequently given for each 
word were selected for further study. These 
sets of meaning responses were then ranked 
as to their importance by other judges. The 
experimenter presented the judges with a list 
of the 160 words with each word followed 
by the five most frequent meaning responses 
listed in random order, and asked them to 
number the meaning responses as to “their 
importance for telling the meaning of the 
word or for describing what it is.” The judges 
were again 100 students in an introductory 
psychology course, and again, each judge was 
given only half of the items. f 

The number of common meaning responses, 
shared by both words of a pair varied from 
0 to 5. There were only five word pairs with 
no meaning response in common. As expecte 
the high similarity word pairs (Point A) had 
more meaning responses in common than the 
low similarity word pairs (Point D). How 
ever, the principal interest here was not in the 
sheer number of shared meaning responses or 
even in their absolute strength, but rather 
in whether the same meaning response 8 
strongest for both members of a word pall 
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l For example, on the “pig-dog” word pair 


the five meaning responses for pig were “Fs 
iised for food,” “is fat,” “is dirty,” “has four 


* Jegs,” “is an animal.” The five for dog were 


“ig an animal,” “is a pet,” “has four legs,” 
“ig a friend,” “likes to bark.” It is seen that 
only two meaning responses are shared, that 


is, “is an animal” and “has four legs.” Ifa 


judge should rank the same one of these two 
as first in importance for both pig and dog, 
he was considered to have judged a shared 
meaning response to be in first position for 
the two words. 

. On mary of the word pairs a meaning 
response to one of the two words was the 
other member of the pair itself. This occur- 
rence: implies shared meaning responses of 
all levels of strength so that such words are 
inappropriate for testing a hypothesis concern- 
ing overresponsiveness to the high strength 
meaning responses. Therefore, word pairs for 
which this occurred for as many as 8 out of 
50 judges were dropped from further analyses. 
This was the case for 42 pairs, which left 38 
word pairs for testing the hypothesis. These 
$8 word pairs were dichotomized as to the 
number of judges who ranked the same shared 
meaning response first for both members of a 
pair. This yielded 19 word pairs for which 16 
or more judges had done so (called here 
Type I items) and 19 word pairs for which 
13 or fewer had done this (called here Type 
Tl items), The 19 Type I items included 3 
items at Point B, 8 at C, and 8 at D, and 


had a mean rated similarity of 3.4. The 19 
; Type IT items included 7 at C, and 12 at D, 
- with a mean rated similarity of 3.8. It is seen 


» 


that our measure has only a slight relation- 
ship to rated similarity. It would appear that 
a Type II word pair which has the same 
rated similarity as a Type I word pair must 
ave a larger number of shared meaning 
responses of lower strength, since its overall 
Similarity is the same despite the absence 
of high strength commonality. 

No Prediction is made by the theory con- 
cerning the absolute number of pairs called 
fe as by the schizophrenics and normals 
aaa e Type I items and Type II items. The 
Tat oak number would correspond instead to 

€d similarity, The prediction is rather that 


_Schizophrenics will show a greater differential 
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TABLE 1 


Mean NUMBER OF WORD PAIRS JUDGED AS 
Synonymous For TYPE I 
AND Tyre II ITEMS 


Group Type I Type II I-I 
Schizophrenics 4.40 3.50 90 
Normals 1.90 2.43 —.53 


preference, than normals, for regarding the 
Type I word pairs as the same in meaning. 

The items corresponding to these two sub- 
sets of word pairs were then treated as sub- 
tests, and scores on number of pairs called 
the same were computed for schizophrenics 
and normals. Table 1 shows these data. As 
seen there the schizophrenics scored higher 
on Type I than Type II items, but the 
normals showed the opposite direction of 
difference. The score used as a measure of 
differential response was the number of pairs 
called the same for Type I items minus 
number of pairs called the same for Type II 
items. On this score the schizophrenics 
were significantly different from the normals 
(z = 3.38, p<.001) as determined by a 
one-tailed Mann-Whitney U test. 

As seen in Figure 2, the excessive tendency 
of schizophrenics to say that word pairs have 
the same meaning varies with similarity. 
However, it would appear unlikely that the 
difference between the two sets of items in 
number of pairs called the same could be 
attributable to the difference in mean rated 
similarity because the curves are nearly 
parallel between the two similarity points 
corresponding to the mean values of the sets 
of items. However, as a check on this, sub- 
sets of Type I and Type II items equated 
on similarity were obtained by dropping the 
three extreme items from each group. The 
resultant mean rated similarity was 3.67 for 
both subsets. The analyses comparing Type I 
and Type II items were repeated using these 
16-item subsets, and the results were highly 
similar to those for the 19-item sets of items. 
On the difference score of number of Type I 
items minus number of Type II items called 
the same, the schizophrenics again exceeded 
the normal subjects (z = 2.49, p < .01). 

These results confirm our hypothesis that 
the words that schizophrenics treat as mean- 
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ing the same more than normals are more 
often for those for which a shared meaning 
response is strongest for both words. 

Effects of age and vocabulary level. Because 
the schizophrenics were 7.5 years older than 
the normal subjects, it was necessary to 
determine if age might be related to the 
dependent variables. This relationship was 
investigated only for the normal subjects, 
because for schizophrenics the relationship of 
age to length of illness might produce an 
artifactual relationship of age to severity of 
symptoms. The correlation was computed for 
the normal subjects between age and number 
of pairs called the same (r = .05), and be- 
tween age and the score of number of Type I 
items called the same minus number of Type 
TI items called the same (r = —.22). Neither 
value approached significance, and the larger 
of the two values (r = —.22) was in the 
direction opposite to that which would ex- 
plain the main experimental findings as an 
artifact of age. 

It also was necessary to determine whether 
the main experimental effects could be at- 
tributed to the lower Vocabulary scores of 
the schizophrenics. This possibility was es- 
pecially important since the experimental task 
was itself a variety of vocabulary test. Vocab- 
ulary scores for normal subjects correlated 
—.18 with the number of pairs called the 
same. This nonsignificant correlation indicates 
a slight tendency for brighter subjects to 
judge fewer word pairs as meaning the same. 
Although the correlation did not approach 
significance, it was in the direction of explain- 
ing the higher scores of schizophrenics on this 
measure by their lower Vocabulary scores. 
Therefore, an additional analysis was done 
to check on this possibility. Subgroups of 
normal and schizophrenic subjects were 
matched on total Vocabulary score (27 nor- 
mals and 24 schizophrenics). The schizo- 
phrenic subgroup exceeded the normal sub- 
group on number of pairs called the same 
(z = 2.34, p<.01), just as in the main 
analysis. 

The correlation of Vocabulary score with 
the score of difference between number of 
Type I and Type II items called the same 
was .16 which indicates a slight nonsignificant 
trend toward brighter subjects showing the 
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greater differential response toward the 3 
kinds of items. The direction of this difference 
is opposite to that which would be necessary 
to interpret the greater differential response 
of the schizophrenics as due to a lower 
vocabulary skill. On the subgroups matched 
on Vocabulary score, the schizophrenics again 
exceeded the normals on this difference score 
(z = 2.22, p < 02). l 
It is clear, therefore, that the findings ob- 
tained here are not an artifact of difference 
in age, and that they cannot be attributed to 
the schizophrenics’ drop in vocabulary skill, 


Discussion 


The present theory and data permit a more, 
precise description of a schizophrenic language 
disturbance that has often been described by 
other writers in general terms. The data sup- 
port the theory that schizophrenics have 4 
steeper gradient of strength of meaning 
responses to words. 

Several writers have described features of 
schizophrenic verbalization that apparently 
can be subsumed, at least in part, under this 
phenomenon. Cameron’s (1938) “metonymit 
distortion” is a clear example, since it refers : 
to inappropriate substitution of words for 
one another. ‘ 

The present formulation also has much inf 
common with Bleuler’s (1950) description 0! 
the disorder in terms of a breaking of associ 
ative threads. By associative threads, Bleuler 
meant connections between one word o 
thought and another and hence included; 
aspects of denotative meaning. Bleuler, how 
ever, regarded the selection of associative 
threads for cutting or retention as 4 hap 
hazard nonlawful process. The present for 
mulation has additional predictive powe! ovel 
the formulations of both Cameron 4 
Bleuler in that it specifies the kind of mea? 
ing response which is not used and the kim 
that continues to be used to guide verb 
behavior. 

The present formulation sheds light on 
what von Domarus (1951) and Arieti (1955) 
have described as a disturbance in syllogisti® 
reasoning, and which Arieti terms “Y 
Domarus’ principle.” Both of these write 
have stated that a central feature of schizo 
phrenic thinking disorder is a distortion © 
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* logic which consists of the tendency to con- 
‘dude identity of subjects on the basis of 
identical qualities or predicates. For example, 
yon Domarus and Arieti both state that 
given the premises: 


Stags are swift, 
Certain Indians are swift 


schizophrenics will conclude that “certain 
. Indians are stags,” while normal people will 
not, Arieti offers convincing clinical examples 
of this kind of error. However, Gottesman and 
Chapman, (1960) compared schizophrenics 
‘and normals on the conclusions that they 
draw from syllogisms presented in a printed 
test, and found close similarity between 
«schizophrenics and normals in their tendency 
toward this kind of error. The findings of the 
present study indicate that those words which 
schizophrenics treat as identical on the basis 
of common qualities more than normals are 
more often those for which the common qual- 
ity in question is an important aspect of the 
meaning of the word. If we may infer by 
analogy, it seems probable that schizophrenics 
‘also equate objects which have a common 
quality more often when the common quality 
in question is an important feature of the 
object. In the example above, the present 
theory would predict the von Domarus error 
only if one of the strongest meaning responses 
to both stags and Indians should be “swift- 
hess.” In the Gottesman and Chapman study 
the common qualities were not of this kind, 
and hence, the lack of difference between 
schizophrenics and normals is only to be 
expected. It seems likely that differences of 
the sort described by Arieti could be demon- 
„ Strated in a printed syllogism test in which 
the syllogisms describe objects of which the 
Patient has knowledge, and the attributes are 
i“ great prominence. However, if items with 
ese special characteristics are necessary to 
emonstrate the von Domarus principle, it 
rene appear that the more basic disturbance 
fea error in syllogistic reasoning, but is 
va ‘ad a steepening of the gradient of 
Tength of meaning responses. 
ae peed above, this same phenomenon 
Br ee izophrenics’ treating words of similar 
gat ing as synonymous has led some investi- 
», Bators to hypothesize that schizophrenics show 
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an abnormally high and flat gradient of 
semantic generalization, but that the empirical 
evidence for this has been uniformly negative. 
Again the present theory resolves this paradox 
and offers a principle which subsumes clinical 
phenomena which would appear contradictory 
when viewed in terms of the semantic gen- 
eralization paradigm. The present theory also 
yields a prediction of the kinds of words on 
which schizophrenics should show semantic 
generalization over that of normal subjects in 
tasks which are usually thought to measure it. 
These would be words of low similarity which 
have shared meaning responses of highest 
strength for both words. (It should occur less 
readily with words of high similarity because 
both schizophrenics and normals should re- 
gard them as equivalent in meaning.) This 
prediction could be tested in any of a variety 
of tasks such as a rote learning task, or in a 
design in which a conditioned response is 
learned to one word and then is tested for 
generalization to a different word of similar 
meaning. 

Predictions from this theory similarly 
follow for any design which tests for transfer 
effects with verbal materials. The prediction 
in each case would be that transfer effects of 
schizophrenics rise sharply above those of 
normal subjects with words of low similarity 
when the words share strong common mean- 
ing responses. It also follows, of course, 
that investigators who compare schizophrenics 
and normals on transfer produced by other 
variables should control for this one. 

The logic of our theory and experimental 
design has leaned heavily on a new approach 
to the measurement of meaning. Since our 
measure taps denotative meaning it yields in- 
formation other than that obtainable from 
the semantic differential (Osgood et al., 
1957). Moreover, as discussed above, it seems 
to provide a more direct approach to denota- 
tive meaning than is provided by associative 
methods. 

Although indices of similarity of meaning 
were not obtained by this method in the 
present study, they could be readily obtained 
from the degree of overlap of the ordered 
meaning responses, by much the same sta- 
tistical procedures that Garskof and Houston 
(1963) recently used to obtain an index of 
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“verbal relatedness” from the overlap of 
ordered verbal associates. Such an index from 
our task should be useful for research on 
semantic similarity. It appears that generali- 
zation of a response trained to a word has 
some relationship to overlap of associates 
(Bousfield et al., 1958) and there is a long 
research tradition relating paired-associate 
learning to associative connection (Horton 
& Kjeldergaard, 1961). Likewise verbal 
generalization in classical conditioning is 
related to meaning as defined by the semantic 
differential (Baxter, 1962; Dicken, 1958). 
Might an additional contribution to verbal 
generalization be made by overlap of denota- 
tive meaning as defined by the present task? 
One might expect this, if, as in the present 
study, subject’s responses to words are 
mediated by meaning responses. We would, 
of course, expect this measure of similarity 
of meaning to have a positive correlation 
with the other two, but we would not expect 
the contribution of any one of the measures 
to verbal generalization to be completely 
accounted for by another of the measures. 
This measure could also be used to deter- 
mine if denotative meaning can be ac- 
quired through classical conditioning pro- 
cedures as Staats and Staats (1957) and 
Staats, Staats, and Heard (1959) have 
claimed for connotative meaning, as measured 
by the semantic differential. It should simi- 
larly be useful for investigating a variety of 
issues concerning denotative meaning. 
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BLOOD PRESSURE REACTIVITY AND RECALL 
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The purpose of this research was to test the general hypothesis that Ss who 
recall their relations with their parents as having been stressful have higher 
blood pressure reactivity than do Ss who recall their parents in more positive 

5 terms. 2 groups of Ss were combined: (a) those who had a casual high (> 139 
mm.Hg.) or low (< 110 mm.Hg.) systolic blood pressure reading taken during 
a mass physical examination (N = 83), and (b) those with normal pressure 
(N=24). Recall of parents was measured by a semantic differential parent 

. * image form and by questionnaire. Blood pressure reactivity was measured 
by degree of elevation in response to the cold pressor test. Multiple correla- 
tions provide support for the hypothesis with respect to recall of relations 
with mother, but not for recall of relations with father. 


Within recent years blood pressure re-. 


activity has proven to be an indicator of 
“emotional” responsivity, for example, in 
situations of fear (Schachter, 1957), anger 
(Funkenstein, King, & Drolette, 1957), and 
aggressivity (Hokanson, 1961), and a useful 
variable for the study of personality dy- 
namics. But despite these variety of studies, 
to date no research has been done to test the 
hypothesis that degree of vascular respon- 
sivity may be, at least in part, associated with 
early childhood experience. 

A review of research on blood pressure 
(McGinn, Harburg, Julius, & McLeod, 1964) 
indicates extensive empirical’ support for the 
following statements: 


1. Persons who show high responsivity and 
lability of blood pressure are more likely to 
develop high blood pressure at a later date 
than those who are not highly labile or 


' _Tesponsive, 


2. Both persons with high blood pressure 
and those diagnosed as “neurotic” react more 
to, and recover more slowly from, stress 
experiences than do low or normal blood 


Pe cepted from a doctoral dissertation by the 
sie author. Many thanks are extended to Robert 
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marite to Sibley W. Hoobler, MD, of the Depart- 
Michigon, Internal Medicine of the University of 
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pressure persons or those with “normal R 
personalities. 

3. High blood pressure persons more fre- 
quently demonstrate personality characteris- 
tics considered “neurotic” than do persons 
with normal blood pressure. 


These findings have led a number of 
researchers to hypothesize that hyperrespon- 
sivity to stress is a causal mechanism for the 
development of high blood pressure, and that 
this hyperresponsivity is a function of not 
having learned to handle stress in childhood. 
The relationship between “neuroticism” and 
hyperresponsivity fits logically into such a 
model as “neurotic” persons are often defined 
as possessing psychologically “inadequate” 
means of handling stress. It is possible that 
the same childhood conditions that lead to the 
development of “neurotic maladjustment” 
also result in a physiological impairment to 
systems which control reactivity levels. Such 
an argument has been proposed by Malmo 
(1957) who cites evidence to indicate that 
“continuous overarousal may result in im- 
pairment of central inhibitory mechanisms 
[p. 286]” that control physiological activity _ 
levels. 

This present study was designed to test the 
general hypothesis that chronic stress in 
parent-child relations leads to psychophysio- 
logical response patterns which are associated 
with higher blood pressure reactivity in young 
male adults. Stressful relations between parent 
and child were assumed to exist where the 
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offspring, as an adult, recalled his parents as 
tending to have been punitive, irritating, and 
unloving while the subject was a child. Simi- 
larly, it was assumed that families in which 
parents frequently disagreed with each other, 
and in which the mother was dominant over 
the father, would create more stress for the 
child than if these conditions were minimal. 

Finally, it was assumed that the offspring’s 
perception of his parents’ attitudes and be- 
haviors would more proximately influence, his 
own personality development than parents’ 
reports of how they treated him. 


METHOD 
Subjects 


The 107 white, male subjects for this study con- 
sisted of two groups. The first group of 83 were 
incoming students registering for their first semester 
at the University of Michigan in September 1961. 
A single blood pressure reading was taken by one 
of five trained examiners from roughly every third 
student (sitting position) who were members of a 
line moving through the mass procedures of a physi- 
cal examination. Out of about 800 students whose 
blood pressure was determined by standard tech- 
niques, 50 subjects were selected because their systolic 
blood pressures exceeded 139 millimeters of Hg., and 
50 because their systolic pressure was less than 110 
millimeters of Hg. There were no differences in 
diastolic levels between the two groups. Of the 100 
subjects thus selected, 83 completed all phases of 
the study as paid volunteers. When these 83 subjects 
were given a second casual blood pressure reading 
several weeks later on first entering the physician’s 
office, the once “bimodal” distribution of systolic 
levels became normalized due to a predictable regres- 
sion toward the means as found by others, for 
example, Thacker (1940). Nevertheless we also 
expected that enough individuals would maintain 
these extremes of normal ranges to allow the maximal 
distribution of blood pressure to serve in analysis. 
About 33% of the 83 subjects did remain over 140 
millimeters of Hg. at their second reading; 25% 
remained below 100 millimeters of Hg. and 42% 
changed toward the normal range. 

In order to increase the size of the population 
and to further increase the normality of the blood 
pressure distribution, 24 more subjects were obtained 
from a pool of introductory psychology students 
whose systolic and diastolic blood pressures, as 
expected, were all within the same normal ranges. 
Subsequent analysis showed that for both groups 
(N = 83, N =24) only negligible correlations existed 
between a casual reading taken in the physician’s 
office and blood pressure reactivity—the dependent 
variable in this study. Nor for the first group 
(N = 83) was there any correlation between their 
initial systolic readings taken on the registration 
line and reactivity measures taken later. 
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The average age of the total group of 107 subjects 
was 19.3 years, 77% were between the ages of 18 
and 21. Ninety-three percent were from homes where 


the parents had never been separated or divorced 


and were still living together. 


Recall of Parent-Child Relations 


Two weeks after selection and, in all cases, before 
being tested by the research physician, subjects 
were administered a Family Questionnaire given 
during a single test session. This form developed by 
Harburg (1962) had two parts. The first part elicited 
background information and responses to a series of 
Likert-scaled items about subjects’ parents when 
subject was a child. Being a child was’ defined as 
from earliest memories until about 14 years old. The 
items included for analysis in this report are: fre- 
quency of physical punishment by father and 
mother; amount of affection demonstrated toward 
the son by each parent; the son’s view of the 
dominance of the father or mother in the family 
when he was a child.3 

The second part of this form consisted of the 
Parent Image Differential (PID) which has been 
described in detail elsewhere (Harburg, 1962; 
McGinn, 1962)4 and is therefore only briefly de- 
scribed here. This form is an adaptation of the 
semantic differential technique devised by Osgood, 
Suci, and Tannenbaum (1957). The subject is asked 
to read a concept at the top of the page, and rate 
its meaning on a number of scales below the concept 


8 The punishment items were: 


1. How frequently did (parent) punish you physi- 
cally? (6-point scale) 

2. How hard were you ever hit (regardless of 
how often)? (6-point scale) 


The demonstration of affection item was: 


1. How often did your parents show affection t0 
you when you were a child? (6-point scale) 


The index of dominance was the sum of responses 
to these questions: 


1, Which parent would you say was actually the 
head of the family when you were a child? 

2. If your father wanted your mother to do 
something that she really didn’t want to do, what 
is your estimate of the chance that she would do it? 

3. If your mother wanted your father to do 
something that he really didn’t want to do, what 38 
your estimate of the chance that he would do it? 

4. Which parent would you estimate as usually 
having the final word in decisions concerning the 
family, when you were a child? 

5. (If parents disagreed about family matters) 
Which parent usually got his or her way in the 
final decision? 


` 4The Parent Image Differential was developed 
jointly with Gerald P. Ginsburg, now at the Uni 
versity of Nevada, Reno, 
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* on the page. The six concepts used in this study 


were: 


How My Father Treated Me When I was a 
Child 
How My Mother Treated Me When I was a 
Child 
How My Father Taught Me When I was a Child 
How My Mother Taught Me When I was a 
Child 
How My Father Disciplined Me When I was 
, a Child 
How My Mother Disciplined Me When I was 
a Child 


A sample of the 15 scales included below each 
*concept is: 


It should be mentioned that in this study coopera- 
tion from subjects was such that there was almost 
no missing data on any item in the whole question- 
naire, 

The scale responses for all 107 subjects to each 
of the concepts were factor analyzed by varimax 
totation, and yielded a set of 12 factors in all, each 
factor pertaining to a given concept. Those scales 
with factor loadings of 40 or more and not loaded 
higher on any other factors were selected as being 
‘salient. The factor score for each individual was 
computed by simply adding the scores of the salient 
Scales for the given factor. Most of the factor scores 
Were normally distributed as were the individual 
pale Scores. But some scales, for example, “warm- 
cold” with the concept How My (Parent) Treated 
Me, were skewed toward the socially desirable end- 
Point, ie, “warm.” The factor,score made up of 
pen rates was also thereby skewed. Nevertheless, 
aca, apam from a group of college females, 
ee ed for high and low acquiescence response sets, 
i icates that neither of the major “affection” or 
Pere in the PID was significantly cor- 
cee pae standard measures of either “social 

1 ability” set or an acquiescence response set.5 
Maes following description of the factors the 
Kaas ess of language compels the use of extreme 
ea es which mark the ends of the scales. 
Neba ea factor score is a continuous, normally 
tain aes variable, subject to the skewness of cer- 
tives ae such as affection. These endpoint adjec- 
relations Er give the direction of Pearsonian cor- 
TER wW] a were used in this analysis to facilitate 

Dis p tiple regression computer program. 

regarding concept, the factors for father. were: 


upr otency—father was described as “stern,” 
Mes “severe”; or “mild,” “soft,” and 
nt.” This factor essentially gives a degree- 
Of-severity index, S a 
peti 


5 
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2. Support—this described a relationship be- 
tween father and son which varied along a dimen- 
sion of warm and skillful encouragement. Salient 
scales were “consistent-inconsistent” and “skillful- 
awkward.” 

3. Inclusion—this described the extent to which 
father and son were jointly involved in activities, 
for example, “rarely did things together with me- 
often did things together with me.” 

4. Irritability—this was defined by such scales 
as “easy to irritate him-hard to irritate him,” and 
“impatient with me-patient with me.” 

5. Intropunitiveness—subject described his reac- 
tion to discipline as blaming himself, or as blaming 
father, feeling guilty or feeling resentment. 

6. Demand style—this was so labeled because 
the salient scales suggested a continuum of 
rationality-irrationality in father’s demands for 
behavioral conformity, for example, “usually 
ordered me-usually persuaded me.” 


The potency, irritability, and intropunitiveness 
factors for mother contained almost identical scales 
as those for father, but only the intropunitiveness 
factor scores for father and mother were significantly 
correlated (r=.40). The other factors for mother 
were: 


1, Affection—this was close to the father support 
factor, but the security of the environment pro- 
vided by mother seemed based on love while that 
given by father was based upon the consistency 
of his behavior. However, mother affection was 
significantly (r = .33) related to father support. 

2. Justice—this was the consistency dimension 
like the father support factor, and was just sig- 
nificantly (r=.21) related to that factor. Its de- 
fining scales were “just-unjust” and “inconsistent- 
consistent.” 

3. Tolerance—this factor represented the degree 
to which subject reported his mother as allowing 
or encouraging independent behavior. It had no 
clear equivalent among the father factors. 


Reliability and validity of the PID, The concur- 
rent validity of the PID factors is shown by their 
correlations with the questionnaire items: The 
father potency factor score was correlated with the 
answers to the item about the frequency of father’s 
punishment (r=.52) but not to the frequency of 
mother’s punishment, nor to ratings of father’s 
affection or mother’s affection. The same result holds 
for the mother potency factor. Similarly, the father 
support factor was correlated with the question 
about father’s affection (r=.52) but not with 
mother’s affection; the same result holds for the 
mother affection factor. 

‘Test-retest reliability was measured when 26 of the 
107 subjects took four of the PID concepts twice. 
For 24 of these subjects the second administration 
occurred more than 11 months after the first; for 
the other 2 the administrations were 6 months apart. 
Factor scores for the factors obtained in the initial 
analysis were computed and correlated to yield test- 
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TABLE 1 


Test-Rerest RELIABILITIES FoR PID Factors 
AFTER 11-Montu Inrerm (N = 26) 


PID factors r 
Father R. 
Potency -65 
Support 17 
Intropunitiveness .76 
Demand style 52 
Mother 
Potency 17 
Affection 64 
Justice TOL His 
Irritability 48 
Intropunitiveness 66 


Note.—Three of the 12 factors are not presented as data for 
only two concepts were collected. All Pearsonian correlations 
> .50 are significant beyond the .01 level, df=24. 


retest reliability coefficients, These reliabilities are 
presented in Table 1. Only one of the coefficients 
was below .50; the median for the others was about 
.70. In an as yet unpublished report (Ginsburg 
McGinn, & Harburg, 1964) it has been found that 
successive samples from populations of male (and 
female) college students and a Mexican group of 
males, controlled on social class, produced signifi- 
cantly similar PID factor structures, testing with 
the Kaiser (1960) factor similarity program. In 
another study the same scales were used by both 
of the parents of 83 of the subjects to describe “How 
I Believe I (Treated/Disciplined) My Son When He 
Was a Child”; here again significantly similar factor 
structures emerged from parents’ reports and their 
sons’ reports. 

Because the PID purports to measure perceptions 
which subjects had of their parents during child- 
hood, and because there is no independent way to 
measure perceptual content other than self-report, 
it is impossible to directly support the validity of 
the instrument. However, if it is possible to demon- 
strate that the perceptions measured are relatively 
stable, one can argue that the perceptions originated 
in childhood and are therefore “valid” for the 
purposes of this research. Ausubel (1958) states 
that the only way to measure validity of perceptual 
instruments is by stability over time, asserting that 
perceptions formed in childhood should continue 
to exist through adolescence and thus be repeatedly 
measurable. 


Blood Pressure Reactivity 


The full procedure and the list of other blood pres- 
sure measures taken on the subjects are described in 
McGinn (1962). This paper discusses only the results 
obtained with the cold pressor test. Each subject 
was given an hour examination by the research 
physician. The subject reclined unclothed under a 
sheet on a bed in the room provided by the Uni- 
versity of Michigan Public Health Service. During 
the last 5 minutes of the first 20-minute rest period, 
three blood pressure readings were taken by standard 
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auscultatory method. The mean of these measures 
is the base-line resting average. (During this 20- 
minute resting time the average level of blood * 
pressure decreased for some subjects, but remained o 
the same or even increased for most subjects. As a 
group most subjects seemed to relax under these 
conditions.) After the resting measures, the phy- 
sician performed the cold pressor test. The subject's 
hand was placed in a small bucket filled with ice 
cubes and water at a temperature of about 2 degrees 
centigrade. Subject’s hand remained submerged to 
the wrist for 1 minute. Blood pressure readings 
were taken 30 and 60 seconds after immersion, the 
latter immediately before removal of the hand. The 
difference in levels between the highest of the 
cold pressor readings and the resting average is the 
degree of cold pressor reactivity. Details of this 
procedure and a demonstration of its reliability are 
presented by Windesheim, Roth, and Hines (1955); 
it is a standard measure in medical research, No 
subjects showed a decline in blood pressure as @ 
result of the test, and most showed an increase in 
both systolic and diastolic levels. There was no 
relation between resting level of blood pressure and 
degree of reactivity. Other studies have shown that 
anticipation of pain, and psychological traits such 
as anxiety levels, have been associated with increases 
in blood pressure levels during the cold pressor 
tests (Malmo & Shagass, 1952; Thacker, 1940). 


RESULTS 


Table 2 presents the correlations between 
the PID factor scores and systolic and 
diastolic blood pressure reactivity to the cold 
pressor test, Only 1 of the 12 correlations for 
father was significant, that between diastolic 


TABLE 2 


PRODUCT-MOMENT CORRELATIONS OF COLD PRESSOR 
Reactivity wita PID Factor Scores (N = 107) 


Cold pressor reactivity 
PID factors* a 
Systolic Diastolic 
Father 
Potency +,04 +.02 
Support —.05 —.07 
Inclusion —.06 = oan 
Irritability .00 +.01 
Intropunitiveness —.12 —.07 
Demand style +12 +.04 
Mother ae 
Potency 18* +. 
Affection E .21* —.24* 
Justice 47, <22 
Tolerance for independence —.06 —.22* 
Irritability +.01 —.08 
Intropunitiveness —21* —.24* 


a = 
Benen: from low to high. 
Dp <.01. 


| reactivity and father inclusion. Subjects who 
reported that their father less frequently in- 
“cluded them in his activities tended to have 
nisher diastolic reactivity. Eight of the 12 


i jects who had higher systolic reactivity also 
reported their mothers tended to be less mild, 
a “soft, and lenient than mothers of subjects with 
Tower systolic reactivity. Similarly, higher 
+ systolic reactors reported their mothers as 
‘tending to be less warm, concerned, and 
"encouraging; less just and consistent; show- 
i ing lower tolerance for independent behav- 
‘ior and making subject feel resentment 
j when punished. (Systolic and diastolic re- 
activity were significantly correlated: r = .50, 
$ <`.01.) 
Table 3 presents the correlations between 
‘blood pressure reactivity and the pertinent 
items from the Family Questionnaire. Systolic 
eactivity was significantly negatively related 
to the amount of affection demonstrated by 
‘mother. Subjects with higher diastolic reactiv- 
Ý ity reported having received more frequent 
: a punishment from their mother, and 
si TABLE 3 
3 | Propuct-Moment CORRELATIONS OF COLD PRESSOR 


REACTIVITY WITH FAMILY QUESTIONNAIRE 
Items (N = 107) 


a Cold pressor reactivity 
i) _ Family questionnaire items* 
a x Systolic Diastolic 
Father 
| Frequency of physical 
, Punishment 12 .09 
Intensity of physical 
E Apinishment —.13 —.01 
_ Amount of affection 
demonstrated —.03 —.05 
Mother 
My Frequency of physical 
punishment 16 3a" 
tensity of physical 
__, punishment —.07 07 
lount of affection 
_ demonstrated —.21* —18 
i D ominant Parent Index? 18* 22* 
‘Frequency of parental 
ps isagreement .12 07 


a 
pae are from low to high. 


2 s sore = mother dominant. 
n $ <01. 
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TABLE 4 


MULTIPLE CORRELATION (R) AND b COEFFICIENTS USING 
PID SEMANTIC FACTORS AND FAMILY QUESTIONNAIRE 
ITEMS AS THE ĪNDEPENDENT VARIABLES AND 
Systoric Corp PRESSOR REACTIVITY 

AS THE DEPENDENT VARIABLE 


(N = 107) 
Varian E EES 

1. Mother demonstration of af- 

fection (less often showed 
affection) 2i —2.20 

2. Mother intropunitiveness 

(made me feel resentment) | .27 —0.30 

3. Honet bain for inde- 

ndence (suppressor vari- 
Pole) PR 32 0.36 

4, Father intensity of punish- 
ment (mild) 35 1.76 

5. Father frequency of punish- 
ment (more) Al —3.06 
6. Mother dominant over father | .48 —0.80 

7. Father demand style (always 
explained why he punished)| .51 0.24 

8. Mother justice (less just, less 
consistent) sph —0.36 

Constant term = 9.06 
> <.01. 


perceived that she was more dominant over 
their father than did subjects with lower 
diastolic reactions. 

Tables 4 and 5 present the PID factors 
and Family Questionnaire items which loaded 
in a multiple correlation equation predicting 
to systolic and diastolic reactivity, respec- 
tively.° In Table 4 we see that subjects with 


6 The program used for this analysis is a stepwise 
regression procedure which selects one independent 
variable at a time and enters it into the regression 
equation. The selection of the particular independent 
variable is based on the amount of increase it causes 
in the multiple correlation. This program, developed 
by F. W. Westervelt at the University of Michigan 
Computing Center, requires the user to specify a 
minimum F level for an independent variable to be 
entered in the equation. For the purposes of this 
study, the F level was set at .00 in order that all 
26 independent variables would be used. We are 
reporting all independent variables in the order of 
their appearance in the equation; each entered the 
prediction equation at less than the .10 level. The 
remaining variables which entered the equation 
greater than this level are not reported. Suppressor 
variables are identified (following Guilford, 1956) 
as those which had simple correlations with the 
dependent variable which were negligible and/or 
with a sign opposite to that which they entered 
the prediction equation. The b coefficients presented 
are those from the final equation. Our thanks to 
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TABLE 5 


MULTIPLE-CORRELATION (R) AND b COEFFICIENTS USING 
PID Semantic FACTORS AND FAMILY QUESTIONNAIRE 
ITEMS As THE INDEPENDENT VARIABLES AND 
DIASTOLIC COLD PRESSOR REACTIVITY 
AS DEPENDENT VARIABLE 


(N = 107) 
Variable R enta 

1. Mother punishment (fre- 
quently’ 33 —2.66 
2. Mother potency (less mild) 39 —0.18 
3. Mother intropunitiveness J 
(made me feel resentment) | .42 —0.46 

4, Mother irritability (sup- 

pressor variable) 45 0.58 

5. Father demand style (sup- 
pressor variable) 46 0.40 

6. Father inclusion (rarely did 
things together with me) .49** —0.38 
Constant term 

= 39.63 
*Fp <.01, 


high systolic reactivity to the cold pressor 
test were those who described their mother as 
tending to: less often show affection; make 
subject feel resentment; as being dominant 
over their father; and be less just and con- 
sistent. Father was described by these sub- 
jects as punishing frequently but mildly, and 
always explaining why he punished. The 
degree to which mother was tolerant of inde- 
pendent behavior was a suppressor variable. 
Subjects with high diastolic reactivity (Table 
5) described mother as tending to: punish 
more frequently; be less mild, soft, and 
lenient; and make subject feel resentment. 
Father was described as infrequently doing 
things with subject. Father demand style and 
mother irritability entered the equation as 
suppressor variables. Both of the Rs are 
significant at the .01 level. 


Discussion 


The assumptions used to guide this research 
were generated from a consideration of what 
kinds of parent-child relationships lead to 
“neurotic” behaviors. Because extant theory 
and research does not specify the relation- 
ship between parent-child relations and 
physiological response patterns, and because 


James Wigle, mathematician, and Lindsey Crooks, 
research assistant, for their help in the statistical 
analysis. 
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research has shown a relationship between 
“neuroticism” and hyperresponsivity, it was 
hoped that such a strategy would pay off in 
leads for future research. The data suggest 
that this decision was a good one, at least 
with respect to variables describing mother- 
child relations. 

Assuming that “adjustment” would be 
reflected in the physiological response patterns 
of the subject, and that persons with poor’ 
“adjustment” react more to stress, it was 
expected that young male adults who recalled 
their parents as having tended to demonstrate 
little affection, or tending toward frequent 
physical punishment, or as being inconsistent 
in their treatment of them, or as irritable ot 
arbitrary in their relationship with them, 
would be more likely to have higher blood 
pressure reactivity than subjects who did not 
describe their parents as having had such 
tendencies. As the data show, these expecta~ 
tions were largely borne out for mother, both 
in the simple and the multiple correlations. 
With respect to recall of father behavior and 
responsivity, however, there are few relation: 
ships in the data which fit the model. With 
the exception of the father inclusion variable 
(in which low interaction between father and 
son was associated with high reactivity) post 
tive aspects of father behavior are associated: 
with hyperresponsivity. There are few such 
relationships. * 

Another set of variables was intended to 
measure the presence of a male model fot 
the subjects to imitate, and it was assumed 
that the absence of such a model in the fathet 
would lead to inappropriate behavior on the 
part of the offspring. Inappropriate behaviors 
can be assumed to generate more stress for 
the child and thus lead to overresponsivity: 
It was expected, therefore, that subjects who J) 
characterized their mother as more dominant 
than their father and who felt they did not 
share activities with their father would be 
more responsive than subjects who saw theif 
father as dominant or parents as equally 
dominant, and who recalled that they wet 
included in activities with their fathet 
These expectations were borne out, theté 
being a relationship between responsivity 2” 
dominance of the mother on one hand, an 
low father inclusion on the other. 


Bioop PRESSURE REACTIVITY 


The results seem consistent with those ob- 
tained by researchers in other areas. For ex- 


“ample, poor premorbid schizophrenics have 


been reported to have punitive mothers who 
are dominant over their husbands (Farina, 
1960; Garmezy, Clarke, & Stockner, 1961). 
Funkenstein et al. (1957) found that high- 
anxiety subjects more often reported unaf- 
fectionate and domineering mothers, and 
fathers who played a minor role in the 
authority structure of the family. It should 
be noted that here too the role of the father 
is not clearly defined. 

To support the general hypothesis pro- 
posed, that stress in parent-child relations 
leads to higher blood pressure reactivity in 
adulthood, it is necessary to show that the 
measures of parent-child relations are actually 
tapping relations which existed during child- 
hood. We have already stated that it is not 
possible to do this directly with adult sub- 
jects, and that at this stage the best measure 
of validity is a demonstration of stability of 
recall of parents over time. But it could also 
be argued that “neurotic” subjects who are 


* more reactive to stress are also more negative 


in their recall of parents. The present data do 
not permit us to discount this possibility. It is 
also possible that subjects in this study who 
were highly responsive were highly responsive 
as children, and provoked stressful relations 
with their parents. Furthermore, it is possible 
that not all children are negatively affected 
by stressful parents or that some children 
are highly reactive to stress without having 
experienced a stressing childhood. 
i In closing it must be noted that the results 
in this study were obtained from a white, 
college male, middle-class sample, in which 
extremes of reactivity to physical stress and 
maltreatment by parents are perhaps not well 
represented. On the other hand, such a back- 
Fiind might well increase sensitivity to stress 
Pert interpersonal and physical sources. 
onsider, for example, the effects of norms of 
eO for others, high protection from 
ite pain, and the psychological burden 
Modern-day college life. Finally, it would 
ey that the cold pressor test may be a 
seful addition to the armamentarium of 


. Psychologists who measure stress. 
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ALTERATIONS IN SELF-CONCEPT UNDER HYPNOSIS 


EARL D. MARKWELL, Jr. 
Humboldt State College 


This study was designed to help answer 2 questions: (a) Do attitudes toward 
the self change under hypnosis? (b) What are the correlates in the hypnotic 
process and situation of such alteration? Measures of self-favorability, self- 
acceptance, and self-criticalness both in the waking and hypnotic state were 
obtained from 44 volunteers via Gough’s Adjective Check List. Analysis of 
variance results of those capable of amnesia when compared with those 
responding differently indicated that such attitudes did significantly change. 
Significant correlations were found between contradictory hypnotic and waking 
ACL answers and measures both of hypnotic depth and relationship to the 
hypnotist. MMPI profile differences were reported for those whose attitudes 
became more self-accepting as compared with increased self-criticalness. 


Psychologists have for many years concerned 
themselves with the concept of the self in their 
research and clinical practice (Allport, 1943, 
1955; Bertocci, 1945; Lowe, 1961; Raimy, 1943; 
Rogers, 1951; Rogers & Dymond, 1954; Snygg 
& Combs, 1959). Hypnotic techniques have found 
a place in experimental investigations of the self 
(Brenman & Gill, 1959; Dorcus, 1956; Erickson, 
1955; Farberow & Sarbin, 1950; Klemperer, 
1954, 1955). This literature reveals basic differ- 
ences of opinion about the effect of hypnosis on 
alterations of attitudes toward the self. White 
(1941) states 


hypnotism is one of the few experimental techniques 
applicable to human beings whereby it is possible 
to produce major changes in the organization of 
behavior, Without discomfort or danger`to the sub- 
ject, provided certain precautions have been taken, 
it is possible to effect an extensive alteration in those 
patterns of experience which constitute the self 
. [p. 4771. 


Statements such as this have stimulated a few 
experimental psychologists to devote their atten- 
tion to such questions as: Do attitudes toward 
the self change under the influence of hypnosis? 
And, if there is such change, what are the cor- 
relates in the hypnotic process and hypnotized 
subject of such alteration? 

Farberow and Sarbin (1950) in their clinical 
study of the role of the self in hypnotic age 
regression say that 


apparently a “core” personality (self) is revealed 
which persists recognizably . . . in the various hyp- 
notic situations [p. 305]. 


Evidence for this statement comes from their 
qualitative evaluation of Rorschach, Draw-A- 
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Person, and handwriting data for six sub; 
who were regressed to 18-, 13-, 6-, and 3 
age levels. Comparison of the findings with 
subjects who simulated “regression” to 6 
of age constituted their only controls. $ 
(1950) conjectures theoretically that favo 
motivation for change under hypnosis depends 
the degree of participation or congruence of 
subject’s self-concept and the role of the hypno 
subject as structured by the hypnotist. Brenn 
and Gill (1959; Brenman, Gill, & Hacker, 19 
believe from their clinical experience that alte 
tions of the ego state do occur as a result 
hypnosis which allow for the circumventio 
weakening of defense mechanisms and pêl 
therapeutic restructuring. Hypnosis alters 
“toning down, inhibiting, or checking” functi 
of the self or ego permitting four kinds of p 
sibilities of change: alterations of body im 
changes in mode of thought, releases of al 
and alterations of the control and channelizal 


of a case of anxiety hysteria. Bellak (1 
postulates an ego-psychological theory of hy 
nosis based on similar beliefs, Leuba (1941 
connection with his work in conditioning € 
ments interpreted his data to show that hypno 
could be used to limit an experimental subjec 
attention, to eliminate the effects of extraneo 
stimuli, and to help control variables (thi 
self- initiated stimuli partially connected wi 
self-conception) within the subject’s own 
havior. Conn (1949) found that hypnosis P 
mitted a creative, dynamic experience to de 
in which the patient more adequately rti 
what self-acceptance really was and in which 


Brier ARTICLES 


trance was useful in confronting the experienc- 
ing portion of the self with that part of the 


. personality excluded from consciousness. He saw 


hypnosis as a unifying interpersonal experience 
which he labeled hypnosynthesis. Kline (1955, 
1958) gives great emphasis to the role of the 
self in determining the response of either a sub- 
ject or a patient to hypnosis. 


.,, tremendous differences . . . exist in relation to 
the orientation of the self and the emerging physical 
and emotional attitudes about the self that are part 
of the hypnotic state and may be expressed only as 
the hypnotic state is extended in time and explored 
in terms of the subjects’ own reaction to it. Failure 
to deal with this aspect of the hypnotic subject or 
patient’s response to himself and to the values that 
develop in relation to the self are serious omissions 
in the evaluation and understanding of the meaning 
of the hypnotic state to an individual, whether he 


"be subject or patient [Kline, 1958, p. 85]. 


Kline’s (1955) definition of hypnosis makes ego 
and ego functioning central to understanding its 
nature. Klemperer (1955) made use of the Verbal 
Self-Portrait Test to show that hypnoanalytic 
patients could reveal their psychic traits, con- 
flicts, and unconscious motives through spontane- 
ous self-portraits. Asking the patient “If you 
were an accomplished artist, how would you 


` paint yourself?” she contrasts their responses to 


the “as if” situation of this test with spontaneous 
self-concepts manifested in regression and re- 
Vivification techniques. Support for her inter- 
pretations of the meaningfulness of such material 
comes from hypnoanalytic case information. 

Antagonistic conceptions emerge from a review 
Of these studies, some suggesting that the self 
changes and others that it may not, The evidence 
indicates that most of the investigators do be- 
lieve that certain aspects of the self or ego do 
change with hypnosis. The present study attempts 
to determine the influence of hypnosis on self- 
Concept as revealed in modification of attitudes 
of Self-favorability, self-acceptance, and self- 
criticism by experimental and statistically con- 
trolled methods, 


METHOD 
M easures 


geeust’s (1953) Adjective Check List, hereafter 
e erated the ACL, was selected as the instrument 
mee data on self-attitudes. This list is com- 
AR 300 adjectives arranged alphabetically from 
Ra it-minded” to “zany,” and each individual is 

ucted to signify by checking those he considers 


x Self-descriptive, It was adapted for use here by 


having 


nae the subjects give a verbal response during 


the waking and hypnotic state rather than 
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asking them to check in writing those adjectives 
which were self-descriptive. This adaptation had 
been standardized and tested for reliability in earlier 
research (Woolington & Markwell, 1962), The ACL 
results allowed the computation of self-accepting, 
self-critical, and self-favorable scores derived from 
ratios previously established in the development of 
the instrument. 

In view of prior experimental data suggesting 
that the interpersonal relationship between hypnotist 
and subject was significant to any change in self- 
attitude (Kline, 1955, 1958; Sarbin, 1950), a measure 
of the subject’s relationship to the experimenter was 
obtained by adapting a scale derived from Q sort 
procedures (Fiedler, 1950). Fiedler’s work indicated 
an experimental method by which one might measure 
a subject’s agreement or disagreement with con- 
ceptions of the “ideal” therapeutic relationship as 
structured from judgments made by both professional 
psychotherapists and by “lay” people. Scoring the 
responses of each of the present experimental and 
control subjects against this theoretical ideal made it 
possible to derive some measure of their relation- 
ship to the hypnotist in quantitative terms. In 
adapting this scale for the present purpose the 
word therapist was replaced with hypnotist and the 
agreement or disagreement with the ideal direction 
was measured by assigning scores from 7 to 1 for 
the items of the scale most characteristic to least 
characteristic of an ideal therapeutic relationship. 

Previous study (Woolington & Markwell, 1962) 
indicated a tendency for subjects given a suggestion 
of amnesia to show alterations in self-attitude. Per- 
sonality factors may help explain a subject's ability 
to so respond (Weitzenhoffer, 1953). It was decided 
that the Minnesota Multiphasic Personality Inven- 
tory (MMPI) was the most suitable instrument to 
measure some of the possible differences in person- 
ality characteristics of the subjects of this experiment 
and would be helpful in understanding how person- 
ality differences might relate to changes in self- 
attitude. This instrument is in common clinical and 
experimental use and the reader unfamiliar with 
it should consult the manual and voluminous litera- 
ture available (Hathaway & McKinley, 1951). 


Subjects 


The experimental subjects consisted of 20 volun- 
teers from undergraduate psychology classes, All 
were selected for study from a larger group of 
volunteers since they were capable of entering a 
relatively deep level of hypnosis; and to maintain an 
equal representation of males and females. Data from 
the ACLs collected on these experimental subjects 
was compared with similar measures collected on a 
control group of 24 volunteers equated for age, sex, 
and college status (presumed psychological naivete). 
All experimental and control subjects were volunteers 
for hypnotic experiments and can be assumed to 
vary from unselected college classmates in the way 
volunteers significantly differ from nonvolunteers 
for hypnotic experiments (Martin & Marcuse, 1957). 


è 
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Procedure 


All experimental and control subjects were seen 
individually on two separate occasions 1 week apart. 
When the subjects arrived for the first session they 
were given the following instructions: 


I am going to read you a list of adjectives. Please 
listen to them carefully and signify by saying 
“yes” or “no” whether or not you consider each 
adjective self-descriptive. Don’t worry about 
duplications, contradictions, and so forth. Work 
quickly and don’t spend much time on any one 
adjective. Try to be frank, and check those ad- 
jectives which describe you as you really are, and 
not as you would like to be. 


After taking the ACL at the start of the first 
session the experimental and control subjects were 
induced into as deep a hypnotic state as it was 
possible for them to attain. Induction procedures 
were standardized by quoting in exact detail the 
directions for the eye-fixation technique given by 
Weitzenhoffer and Hilgard (1959, pp. 13-18, Form 
A). The depth of hypnosis was determined through 
the use of the Davis and Husband (1931) scale, by 
assigning a quantitative score of 1 point for each 
suggestion executed successfully from the following 
sequence of suggestions patterned after those given 
by Wolberg (1948), 


1. Fluttering of eyelids 

2. Heaviness in extremities 

3. Drowsiness 

4. Closing of eyes 

5. Catalepsy of eyelids 

6. Catalepsy of limbs (arm) 

7. Automatic movements (hand rotation) 

8. Disturbances in cutaneous sensations (heat and 
cold) 

9. Partial analgesia (glove anesthesia) 

10. Ability to talk in the trance without awakening 

11. Fantasy and dream induction (theater visual- 
ization) 

12. Simple posthypnotic suggestion (posthypnotic 
blinking) 

13. Hallucinations (auditory hallucination of bell 
ringing) 

14. Ability to open eyes without affecting trance 

15. Positive posthypnotic hallucination (auditory) 

16. Negative posthypnotic hallucination (auditory) 


This method was developed from procedures reported 
by Barry, MacKinnon, and Murray (1931) and 
Rosenzweig and Sarason (1942). The control group 
was then released for 1 week. 

The experimental subjects were awakened and 
the presence of an amnesic state checked in a 
Manner suggested by Weitzenhoffer (1957) according 
to the presence of at least three of the five following 
criteria, 

1, Relative unawareness and altered perception 
of the passage of time noted just prior to entering 
and upon awakening from hypnosis. Upon being 
awakened from the trance the subjects were asked 
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to estimate the number of minutes they had been _ 


under hypnosis. This estimate was compared with 
the actual recorded time value and a judgment of 
a significant difference noted if the estimated value 
varied 10 or more minutes from the “real” value, 

2. A reported feeling of detachment and of being 
an observer to the behavior elicited in the tests 
for depth of trance given above. This was judged 
from the subject’s spontaneous remarks about the 
trance experience and from answers given to the 
question, “Describe in as much detail as you can 
without strain what you felt like during the trance?” 
Although of necessity somewhat subjective the 
presence of such feelings was fairly easy to score 
from the clear-cut nature of the comments of the 
subjects without further detailed probing. 


3. Fuzzy, or only partial recall, for events in hyp- - 


nosis. Amnesia was judged to be present from both 
the quality and quantity of the recall of events 
during the trance at the time of the posttrance 
interrogation of the subject. Each subject’s amnesia, 
for this purpose, was tested by asking him to recall 
as many of the tests (experiences) specifically re- 
quested of him as the trance was gradually deepened 
or increased. Quantitatively amnesia was deemed to 
be present if the subject could only recall six or 
fewer of the experiences used to test the depth of 
the trance as noted above. The quality of the 
clarity or vagueness of his recall of these “tests” was 
additionally noted, 

4. Inability to verbalize on recovery certain letters 


of the alphabet to which suggestions of amnesia had 


been attached. Just prior to awakening, the sub- 
jects were given the following instruction: 


In a moment I am going to say five letters to you 
slowly at the rate of one a second. You will be 
able to hear them without strain and to remember 
them easily while you are relaxed. When I wake 
you up in a few minutes you probably will 
have great difficulty in remembering the letters I 
have told you. In fact, you will find it to be $0 
much of an effort to recall these letters that you 
will have no wish to do so. The letters are 
Beha EE a Peed apie A 


Amnesia was judged to be present when after 
awakening the subject was unable to recall the 
letters when told to recall the five letters given to 
him during the trance. 7 
5. Amnesia for posthypnotic suggestions givet 
during the trance. Instructions patterned after those 
given by Weitzenhoffer and Hilgard (1959, pp. 237 
24, Form A) were used to establish this test of 
amnesia. The exact instructions were as follows: 


‘Remain deeply relaxed and pay close attention t0 
what I am going to tell you next. In a moment I 
shall begin counting from one to five. You ; 
gradually wake up. By the time I reach “five 
you will be fully alert, in your normal state of 
wakefulness. After you open your eyes, you 

feel fine. You will have no headache or othe 
aftereffects. A little time after you have awakene' 


eee 


my pencil on the table. When I do, 
et up from this chair and move to the 
is straight ahead of you, and sit in it. 
‘do this, but forget that I told you to 
fady,now:1...2...3...4...5 


_ was judged present when the subject was 
recall the posthypnotic suggestion as given 
the trance 
t with the experimental subjects was then 
ntil 1 week later. 
eek later each subject in both the control 
imental groups was rehypnotized and 
tered the ACL while under as deep a level 
sis as possible using the same instructions 
Upon awakening the ACL was readmin- 
hitd time, and each subject was given the 
herapeutic Relationship Scale. Instructions 
Therapeutic Relationship Scale were 


apers in front of you contain a number of 
ts expressing the way in which a hypnotic 
may feel toward a hypnotist. Would you 
dicate the way you feel about the relation- 
have just had with the hypnotist. Please 
rank as possible since your answers will 
ite one of the important aspects of the 


from any further participation in the 
experimental subjects were asked to 
abbreviated group booklet form of the 


TABLE 1 


Y OF PROCEDURES USED WITH CONTROL AND 
EXPERIMENTAL SUBJECTS 


+ 


Subjects 
Experimental Control 
(N = 20)" (N = 24) 
ACL ACL 
Induction via eye | Induction via eye 
fixation fixation 
Davis-Husband | Davis-Husband 
scaling scaling 
Spontaneous am- 
nesia indicated 
by Weitzen- 
hoffer criteria 
5 Second induction | Second induction 
later) | via eye fixation| via eye fixation 
ACL: Under ACL: Under 
hypnosis hypnosis 
ACL: Awake ACL: Awake 
| Ideal Thera- Ideal Thera- 
peutic Rela- “peutic Rela- 
tionship Scale tionship Scale 
(Fiedler) (Fiedler) 
MMPI 


of spontaneous amnesia. 
ior age, sex, and college status. 
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TABLE 2 


Means For Group, TEST SESSION, AND ADJECTIVE 
CHECK List ScorEs (SQUARE Root 
‘TRANSFORMATION SCORES) 


Ce) SS N | Mvalue 
Control group 1298.57 | 216 6,01 
Experimental group 1112.01) 180 6.18 
Initial test session 787.38) 132 5.96 
Hypnotic test session 834,02] 132 6.32 
Retest test session 789.18] 132 5.98 
Self-favorability ACL score | 1147.09] 132 8,69 
Self-acceptance ACL score | 803.85] 132 6.09 
Self-criticalness ACL score | 459.64) 132 3.48 


MMPI (Cottle, 1953). The instructions for this are 
contained on the face-sheet of the booklet form and 
are standardized for this purpose. 
Table 1 summarizes the above procedures used in 
this study. 
RESULTS 


The reliability of the adaptation of the ACL 
for the present experimental situation was estab- 
lished by computing test-retest correlation coeffi- 


TABLE 3 


MEANS ror Group, TEST SESSION, AND ADJECTIVE 
CHECK List SCORE INTERACTIONS (SQUARE 
Root TRANSFORMATION SCORES) 


Interaction and condition SS: N M value 
Group X test session 
Control 
Initial 433.92 72 6.03 
Hypnotic 431.43 12 5.99 
Retest 433.22 72 6.02 
Experimental 
Initial 353.46 60 5.89 
Hypnotic 402.59 60 6.71 
Retest 355.96 60 5.93 
Group X ACL score 
Control 
Self-favorability 625.74 72 8.69 
Self-acceptance 434.08 72 6.03 
Self-criticalness 238.75 12 3.32 
Experimental E 
Self-favorability 521.35 60 8.69 
Self-acceptance 369.77 60 6.16 
Self-criticalness 220.89 60 3.68 
Test session X ACL score 
Initial ‘ 
Self-favorability 377.87 44 8.59 
Self-acceptance 262.42 44 5.96 
Self-criticalness 147.09 44 3,34 
Hypnotic s 
Self-favorability 381.71 44 8.68 
Self-acceptance 276.63 44 6.29 
Self-criticalness 175.68 44 3.99 
Retest 
Self-favorability 387.51 44 8.81 
Self-acceptance 264.80 44 6.02 
Self-criticalness 136.87 44 3.11 
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Scores 


R SF SA sc 


Fic. 1, Analysis of variance results between groups, test sessions, and ACL scores. (Under groups: C= 
control, E = experimental; under test sessions: I = initial, H = hypnotic, R = retest; under scores: Sf 


= self-favorability, SA = self-acceptance, 
cients for the control and experimental groups. 
Values varied from .72 to .93, and after the cor- 
relations were transformed to standard scores no 
significant difference at the .05 level was found 
between experimental and control groups for the 
three scoring categories (see Table 2). In agree- 
ment with earlier work (Woolington & Markwell, 
1962) test-retest influences were equal for both 


58 
Interaction 
Groups by 
Test Session] 


56 
54 
52 
50 


48 
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Experimental subject 
Control Subject 


Interaction 
Groups by Scores 


SC = self-criticalness.) 


groups and the adaptation of the ACL was 
deemed sufficiently reliable for use in the present 
study. ah 
A Design VI analysis of variance technique 
(Lindquist, 1956) using standard score data in 
dicated that attitudes toward the self did change 
under hypnosis (see Figure 1). Statistically sig: 
nificant differences beyond the .01 level wett 
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Fic. 2. Analysis of variance results for interactions. (See Figure 1 for definition of symbols.) 
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“found between the results for the initial, hypnotic, 
and posttest sessions (see Figure 2 and Table 3). 
The interaction between test sessions and 

» groups was also significant, p> .01. No signifi- 
cant differences were found between the groups 
or between score categories alone, or in the 
interaction between scores and groups. The 
interaction between test sessions and scores 
‘approached significance (> 5.0%). The triple- 
order interaction was significant at more than 
the .01 level. 

A further test of the significance of the dif- 
ferences found in the analysis of variance was 
made possible by performing ¢ tests of the 
means for the testing sessions. A ¢ value of 2.51 
was found between the initial and hypnotic 
session means and a ¢ value of 2.63 was found 
between the hypnotic and posthypnotic session 
means, These values are both significant, p < .05, 
df= 42. A t value of .74 between the initial and 
posthypnotic session means was not significant 
even at the .10 level. 

Pearson correlation coefficients between the 
number of contradictory ACL answers given 
under hypnotic and under waking conditions and 
hypnotic trance depth scores support the findings 
of differences in self-attitude as a function of 
depth, The correlation of .61 for the 44 subjects 
of the control and experimental groups combined 
Was significant. When the two groups were 
analyzed separately the experimental group 
showed a higher correlation, .82, than the control 
group, .48, but the differences were not significant 
when z transformations were performed. This 
result is supported in part by the findings of the 
tendency for such measures of depth scores to 
correlate with changes in self-attitude reported 
ad research (Woolington & Markwell, 


Part of the effect of the subject’s relationship 


"subjects v 
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to the hypnotist-experimenter, on the above find- 
ings, was checked by correlating the subjects’ 
conception of their relationship to the hypnotist, 
given by their Ideal Therapeutic Relationship 
score, against both the number of hypnotic- 
waking contradictory ACL answers and their 
hypnotic depth scores. The Pearson coefficient 
of .55 between such Ideal Therapeutic Relation- 
ship scores and the number of contradictory ACL 
answers the subject gave under hypnosis when 
compared to his waking answers was significant, 
p<.01. These results indicate that the more 
the subject is “favorable” to a “helping” relation- 
ship with the hypnotic experimenter the more 
likely some alteration in such a measure of self- 
concept will occur. In addition, the correlation 
of .79 between the subject’s Ideal Therapeutic 
Relationship scores and their hypnotic depth 
scores was highly significant, p< .01. These 
findings are in agreement with considerable evi- 
dence that hypnotic trance depth is a function 
of the relationship conceived by the subject as 
existing between the experimenter and himself. 
These results also suggest that the role con- 
ceptions of the subject may play a significant 
part in creating the results found in the analysis 
of variance. That is, the more the subjects per- 
ceived their relationship to the hypnotist to be 
identical to that of an ideal therapeutic one the 
more they manifested a difference in their 
waking-hypnotic attitudes towards themselves, 
and the deeper they appeared to be capable of 
entering the trance state. Certain of their sponta- 
neous remarks supported this interpretation of 
the results. 

The MMPI data were helpful in providing a 
partial answer to the question of why a given 
subject’s contradictory ACL answers in the 
hypnotic as compared to the waking state seemed 
to vary in a specific direction (that is, to be- 
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ANG. 3. MMPI profile comparisons of males showing different self-attitudes under hypnotic-amnesia state. 
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Fic. 4. MMPI profile comparisons of females showing 


come more self-accepting or self-critical in in- 
dividual cases while under hypnosis). The MMPI 
profiles of those men and women who became 
more self-critical under hypnosis can be seen to 
differ from those whose attitudes became more 
self-accepting (see Figures 3 and 4). The pro- 
files of those who became more self-accepting 
compare very favorably with the known results 
from a random sample of college men and women 
measured by previous investigators (Cottle, 
1953). The profiles of those who became more 
self-critical, however, depart to a considerable 
extent, generally in a direction suggesting that 
they more often experience emotional distress ; 
even though technically their group scores remain 
within the usual thirtieth to seventieth per- 
centile levels considered as the “normal” Tange. 
As a group the hypnotically self-critical subjects 
were more defensive and anxious with tendencies 
to “neurotic-like” symptom complaints, 
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i EFFECT OF CONTINUOUS RESPONDING ON THE ORDER EFFECT 


mode, 


Re with the data. 


The pro-con versus con-pro paradigm has seen 
Considerable use in attempts to elucidate the 
tole of variables governing the relative impor- 
tance of earlier and later information in impres- 
Sion formation and opinion change (e.g., Ander- 


ae data for this study were collected at the 
Ree of California, Los Angeles. The author 
thi es to thank Norman H. Anderson for suggesting 
_ ‘us problem, and for his continued advice and 

+ encouragement. 


2 
A, Now at Cedars-Sinai Medical 
n “ngeles, California, 


Center, Los 


impression formation were studied. Sı 


IN PERSONALITY IMPRESSION FORMATION * 


RALPH H. STEWART? 
University of Portland 


The effects of stimulus sequence length and mode of responding in personality 
ets of personality trait adjectives of 3 
different lengths served as stimuli. In the final mode of responding, the 
impression was given only after all adjectives were presented. In the continuous 
the cumulative impression was given 
There was a primacy effect for the final mode of responding, and a recency 
$ effect for the continuous mode, This recency effect, however, was due primarily 
to 1 of the 2 replications. Strikingly similar order effects for all 3 sequence 
lengths were obtained. 3 theoretical positions which consider the effect of mode 
of responding were discussed. A linear operator model seemed most in accord 


after each successive adjective. 


son, 1959; Hovland, 1957; Luchins, 1958; Miller 
& Campbell, 1959). As yet, however, no clear 
picture seems to have emerged. One factor that 
has varied among the previous studies is the 
amount of information, or length of the stimulus 
sequence. However, the effect of this variable 
cannot be assessed because of the different types 
of stimuli used in the various studies. 

The work of Asch (1946) has called attention 
to the use of personality trait adjectives, which 
constitute a large body of relatively homogene- 
ous stimuli. Anderson and Barrios (1961) used 
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such stimuli in sets of equal length and obtained 
a primacy effect. One purpose of the present 
experiment was to utilize these same stimuli to 
study the effect of different sequence lengths on 
the primacy effect. 

All impression formation studies using 
sequences of more than two stimuli have meas- 
ured the impression at the end of the sequence. 
The primacy effect obtained in such studies could 
be, in part, a function of this final mode of 
responding. An alternative mode is continuous 
responding, in which the accumulating impres- 
sion is measured successively after each stimulus 
presentation, 

There are different hypotheses on the possible 
effect of such continuous responding. Asch’s 
(1946) “directed impression” hypothesis states 
that information obtained early influences the 
Meaning of later information. From this, a 
primacy effect is predicted regardless of the 
response mode. An attention hypothesis (Ander- 
son & Hubert, in press) suggests that the primacy 
effect is obtained for the final mode of respond- 
ing because the subject attends less to later 
information. By responding continuously, how- 
ever, the subject must pay equal attention to 
the later information. The mathematically derived 
linear operator model of Anderson and Hovland 
(1957) also predicts recency for the continuous 
mode of responding, at least assuming constant 
model parameters, 

Thus, the present study was designed to study 
the influence of different lengths of communica- 
tions on the primacy effect and the effect of mode 
of responding on the final impression, 


METHOD 
Subjects 


Subjects were 64 male and female volunteers who 
were fulfilling a class requirement in introductory 
psychology. Two identical replications of the experi- 
ment were run at different times of the school year. 
Within each replication, subjects were randomly 
assigned to the experimental conditions. 


Stimuli 


The stimuli were obtained in the manner described 
by Anderson and Barrios (1961). Personality trait 
adjectives of high (H) and low (L) rated likable- 
ness were used to construct the sets. The first half 
of an HL set consisted of favorable, or H, adjectives 
and the second half consisted of unfavorable, or L, 
adjectives. Similarly, the first and second halves of 
an LH set consisted of L and H adjectives, respec- 
‘tively. Random (R) sets consisted of L and H 
adjectives in unsystematic order. These were inter- 
spersed among the HL and LH sets in order to 
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reduce the possible development of a tendency to 
respond to the HL and LH patterning. 


Procedure 


number of sets of adjectives, each set describing a 
different person with each adjective in a set given 
by someone who knew the described person well, 
The subject was told to respond verbally, rating the 
person on an 8-point scale of likableness which 
ranged from 1, highly unfavorable, to 8, highly 
favorable. This scale was typed on a card to which 
the subject could refer during the experiment, 
There were two modes of responding. The final 
(F) mode was identical to that described by 
Anderson and Barrios (1961) in which ‘the subject 
responded only after all adjectives in a set were read. 
For the continuous (C) mode, the subject responded 
after each additional adjective of a given set. The 
instructions emphasized that each successive response 
should be based on all the information accumulated 
up to that point and, as a consequence, the subject 
should change his response when this seemed called 
for. In both response modes, it was emphasized that 
there was no right or wrong, that the subject’s own 
impression was what mattered, and that he should 
simply give his prevailing impression each time. The 
adjectives were read at the rate of one every 3 
seconds, 


Design 


Sets of 4, 6, and 8 adjectives were constructed 
randomly with restrictions imposed by the type of 
set and by the requirement that no adjective occut 
twice in a given set. Thus, there were nine types of 
set: HL4, LH4, HL6, LH6, HL8, LH, R4, R6, 
and R8. A typical HL8 set was: perfective, skillful, 
devoted, talented, > domineering, aloof, malicious, 
faultfinding. Four main blocks of 18 sets each were 
formed such that each block contained two sets of 
each of the above nine types. Two blocks of nine 
sets each were also used for preliminary practice. 

Each subject responded to all sets of each of the 
four main blocks, Alternate blocks for each subject 
were given in the two response modes, C and É 
Thus, each subject served in both response modes. 
Moreover, since each main block contained two sels 
of each type, each subject judged sets of each 0 
lengths 4, 6, and 8. 

Half the subjects received the four main blocks 
in the order, CFCF; the other half received them 
in the order, FCFC. In addition, the main block 
of sets as well as the practice blocks, were counter- 
balanced for serial position over trials and for mode 
of responding. Subjects were told before each block 
whether the C or F mode of response would be used. 


RESULTS 


The data for Condition C portray the cours? 
of development of the impression over successive 
adjectives within a set. These data are show! 


Each subject was told that he would be read q | 


| 


MEAN IMPRESSION 
-vou pa on @ 


6-SETS 


| (243.24 l 


re 1 in which each point is the mean 
sion averaged over all subjects and blocks. 
Ourse of development of the impression is 
ost clearly in the 8 adjective sets. Thus, 
[L8 curve is initially and increasingly favor- 
as the 4 initial H adjectives are presented; 
hen gradually declines and becomes less 
rable as the 4 successive L adjectives are 
sented. The LH8 curve shows the opposite 
‘trend, The curves for the 4 and 6 adjective sets 
‘sh w the same characteristics. 
I Note that the two curves for each set length 
Oss Over at the last adjective of the set. This 
cally represents a recency effect: in the 
Sets, the L adjectives have a greater influ- 
ce than the H adjectives;, similarly, in the 
sets, the H adjectives have a greater influence 
he L adjectives. 
order to analyze the effects of order of 
sentation, each subject was given a score 
to the difference between the response 
LH and HL sets under a given condition. 
Condition C, of course, this order effect 
is based on the terminal response to the 
ach subject received six such order-effect 
since there were six conditions that re- 
i from the use of two modes of responding 
and F) and three lengths of set (4, 6, and 
jectives). Each of these six order-effect scores 
€ach subject was based on the difference 
tween four LH sets and four HL sets. The 
ence was taken in the direction, LH-HL, 
a positive score means a recency effect. 
e means of the order-effect scores are pre- 


rial design in which two factors correspond 
peated measures (Type VI design; Lindquist, 
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O HL PRESENTATION ORDER 


O LH PRESENTATION ORDER 


2. SF (ONO aes 


SUCCESSIVE ADJECTIVES WITHIN SETS 


Fic. 1. Impression development in Condition C for sets of three different lengths. 


1953).3 Mode, replication and their interaction 
were significant, with F ratios of 15.35, 5.19, and 
4.52, respectively, each with df = 1/60. No other 
effects approached significance. 

The significant interaction requires separate 
consideration of the F and C conditions. More- 
over, it is necessary to test directly for signifi- 
cant primacy and recency effects, rather than 
simply for differences between conditions. These 
tests are of the null hypothesis that the true 
mean is zero. This more detailed elucidation of 
the data will now be taken up. 

For Condition F, Table 1 shows a primacy 
effect in both replications, and both are signifi- 
cant (Replication I, F = 10.79; Replication II, 
F= 4.15). Although the magnitude of the pri- 
macy effect was somewhat smaller in the second 
replication, the difference between the two 
replications was not significant. 

For Condition C, Table 1 shows a slight, non- 
significant primacy effect for Replication I 
(F = .10), and a sizable, significant recency effect 
for Replication II (F =9.71). The overall re- 
cency effect of .434 scale points is significantly 
greater than zero, but the difference between the 
two replications is significant. This difference is 
also the locus of the interaction between mode 
and replication noted above. 

Finally, it is interesting to note that the order 
effect was about the same for the sets of various 


8The complete analysis of variance tables have 
been filed with the American Documentation Insti- 
tute, Order Document No. 8184 from ADI Auxiliary 
Publications Project, | Photoduplication Service, 
Library of Congress, Washington, D. C. 20540, Remit 
in advance $1.25 for microfilm or $1.25 for photo- 
copies and make checks payable to: Chief, Photo- 
duplication Service, Library of Congress, — 
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TABLE 1 


MEAN Impression ORDER EFFECT PER SET AS A 
FUNCTION OF REPLICATIONS AND MODE 
OF RESPONSIBILITY 


Replication 
Response 
mode 
I Il M 
Continuous —.096 963 434 
Final —.465 —.289 —.378 
M —.281 337 028 


Note.—Entries are differences between response to HL and 
LH presentation orders. A negative score means a primacy 
effect; a positive score means a recency effect. 


lengths. For Condition C, this can be seen in 
Figure 1 in which the recency effect is the dif- 
ference between the LH and the HL terminal im- 
pressions for each set length. For Condition F, the 
primacy effect was essentially constant over set 
length, Although it is difficult to accept a null 
hypothesis, this result is based on a respectable 
N and on the smallest of the within-subjects 
error terms. The standard error for comparing 
the mean order effect for sets of different 
lengths is .052. 


Discussion 


The primacy effect for the present Condition F 
is consistent with previous work. Both Asch 
(1946) and Anderson and Barrios (1961) had 
their subjects respond only after all adjectives 
had been read, and both obtained a primacy 
effect. In contrast, the present Condition C, in 
which judgments were made after each successive 
adjective of a set, yields a recency effect, 

A reasonable explanation of the difference be- 
tween Conditions C and F is in terms of the 
attention hypothesis mentioned in the introduc- 
tion. Thus, it is supposed that in Condition C 
the subject pays equal attention to each adjective 
in a set since a response is required each time 
an adjective in a set is presented. In Condition F, 
however, it is supposed that the subject pays 
progressively less attention to the later adjectives 
of a given set. 

It should be clear that this attention hypothe- 
sis predicts a primacy effect in Condition F. If 
the subject pays progressively less attention to 
the later adjectives in a set, then the later adjec- 
tives will carry less weight in the final impression 
than will the earlier adjectives. If the earlier 
adjectives carry the greater weight, then a pri- 
macy (first impression) effect must of course 
be expected. 

At first glance, it might seem that the attention 
hypothesis should predict neither primacy nor 
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recency in Condition C rather than the recency 


effect that was obtained. However, a mathemati- 
cal analysis of this problem (Anderson, 1959; 
Anderson & Hovland, 1957) shows that if the 
“susceptibility” (which is here interpreted in 
terms of attention) of the subject does not de 
crease over successive adjectives within a set, 
then a recency effect must be expected. 

The present results are inconsistent with Asch’s 
(1946) interpretation of his primacy effect in 
terms of “directed impression” since this formu- 
lation would predict that a primacy effect should 
be obtained in Condition C as well as in Condi- 
tion F. However, with the addition of the atten. 
tion hypothesis, both the primacy in Condition F 
and the recency in Condition C are consistent 
with the linear operator model approach of 
Anderson and Hovland. 

Additional evidence in favor of the linear 
operator model may be obtained from its pre- 
dictions as to curve shape for Condition C. Ac 
cording to this formulation (Anderson, 1959) the 
response curves should be simple growth curves 
over a sequence of H adjectives, and simple 
decay curves over a sequence of L adjectives. 
Inspection of the curves of Figure 1 indicates 
that this is indeed the case. 

It should be noted, however, that in one 
respect the generality of the above results and 
discussion is limited by the replication interac 
tion. The primacy effect under Condition F is 
essentially the same for both replications and is 
consistent with the previous research (Ander 
son & Barrios, 1961; Asch, 1946). In both repli- 
cations also, thete is greater primacy undet 


Condition F than C which can be handled by — 


the attention hypothesis. The difficulty in inter- 
pretation arises because under Condition C the 


recency effect is entirely due to Replication I, 


the order effect being essentially zero for 
Replication I. x 

Tt is difficult to explain this discrepancy i 
Condition C since there were no apparent 
changes in procedure between the two replications 
except that they run at different times of the 
year. In the pilot work, it was found that, due 
to ambiguity in the instructions, some subjects 
were responding in terms of the last presente 
adjective alone. To check whether this had 0c 
curred in the main experiment, and perhaps 
differentially in Replication II, six curves similat 
to those of Figure 1 were plotted for each indi- 
vidual subject. Except for a few deviant sub- 
jects, the shapes of these individual curves were 
similar to the average data of Figure 1. This 
indicates that the subjects were in fact respond: 
ing in terms of the accumulating information. Ít 


di 


f course, that the curve shapes of 
e not artifacts of the averaging 


. this limitation, the linear operator 
retation is favored by the present 
Replication II clearly supports its 
and opposes the Asch predictions, 
ication I is somewhat opposed to 
would predict. Moreover, the curve 
igure 1 are consistent with the 
el formulation. The results thus tenta- 
r the model prediction but further 
long these lines will be necessary to 
‘issue. 
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NEED ACHIEVEMENT AND THE RESUMPTION OF 
INCOMPLETED TASKS + 


BERNARD WEINER °? 
2 University of Michigan 


Ss classified according to strength of achievement motivation were given 20 
tasks and were interrupted at + the tasks 
then given interpolated success experiences, 
failures, Ss classified as high in n Achievement spontaneously tended to resume 
the interrupted tasks following failure but not following success; 
as low in n Achievement tended to resume the 
success but not after failure. The results support 
The behavioral models as presently formulated 


before completion. 4 the Ss were 
and 4 were given interpolated 


Ss classified 
tasks following interpolated 
previous findings in the area. 
cannot account for these 


related goals may have substitute value, that is, 
may result in a decrement in the strength of 
the persisting tendency to strive for the original 
oal. 
5 The starting point for the experimental investi- 
gation of substitution was Ovsiankina’s (1928) 
demonstration that there is a strong tendency 
to resume tasks interrupted before completion. 
Lissner (1933) showed that there is a decrease 
in the strength of the tendency to resume pre- 
viously interrupted tasks following the attain- 
ment of related goals; he also demonstrated 
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that the decrease in the tendency to resume is a 
positive function of the degree of difficulty of 
the substitute action. Mahler (1933) found that 
the degree of reality of the substitute activity 
is also an important determinant of substitute 
value, and Henle (1944) demonstrated that 
substitute value is a positive function of the 
valence of the substitute task. In these studies 
the behavioral criterion for substitute value 
was a decrease in the amount of spontaneous 
resumption of previously interrupted tasks. 
Other investigators have suggested that the 
attainment of goals similar to a desired goal 
will lead to an increase rather than a decrease 
in the resultant tendency to strive for that goal. 
Murray (1954) found a greater frequency of 
aggressive responses toward significant authority 
figures as a patient progressed through psycho- 


therapy. He employed Miller’s (1944) approach-- 


avoidance conflict model to explain these results. 
Murray suggested that during therapy sessions 
hostile responses directed toward individuals 
similar to a disliked authority figure were not 
punished. This was represented conceptually as 
a lowering of the avoidance gradient, which 
resulted in a subsequent increase in the num- 
ber of aggressive responses made toward the 
orginally feared authority figure. 

The apparent contradiction between these 
studies suggests that the enhancing or decre- 
mental effect of goal attainment may be related 
to the relative strength of the approach and 
avoidance tendencies to strive for the original 
goal. It is postulated that in situations where 
greater approach than avoidance motivation is 
aroused, attainment of goals similar to a desired 
goal will lead to a decrease in the resultant 
motivation to approach the original goal. Thus 
goal attainment will have substitute value. 
However, where the stimulus situation elicits 
greater avoidance than approach motivation, at- 
tainment of goals similar to a desired goal will 
increase the tendency to strive for the original 
goal. In this situation goal attainment will have 
instigating rather than substitutive properties. 

Previous research (Atkinson, 1957) has indi- 
cated that individuals classified as high in need 
for achievement (n Ach) have a greater tend- 
ency to approach than avoid achievement- 
oriented activities, while individuals classified 
as low in n Ach are likely to be anxious about 
failure and avoid problem-solving tasks, It is 
hypothesized therefore that in situations where 
achievement motivation is aroused, individuals 
high in n Ach will not spontaneously resume 
previously interrupted tasks following success 
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experiences (goal attainment), but will exhibit 
resumption following interpolated failures. Con. 
versely, individuals classified as low in n Ach » 
will tend to resume previously interrupted tasks 
following interpolated success experiences, but 
not following interpolated failures. 


PROCEDURE 


Subjects were 30 male undergraduates at the Uni- 
versity of Michigan. All were enrolled in the intro- 
ductory psychology course and were required to : 
participate in psychological experiments. 

Subjects participated in two experimental sessions, 
During the first hour they were administered a The- 
matic Apperception Test (TAT), Picture Series 2, 33, 
52, 24 (Atkinson, 1958). This was scored for n Ach 
according to the method of content analysis devel- 


oped by McClelland, Atkinson, Clark, and Lowel | 


(1953). Interscorer reliability for n Ach was r = 88! 

During the second hour subjects found three book- 
lets placed on their desks. Two of these were identi- 
cal “Zeigarnik” booklets. The booklets contained 20 
different simple puzzle tasks, for example, connecting 
dots, anagrams, etc. There were two forms of every 
puzzle, a long and a short form. Long puzzles ordi- 
narily cannot be completed within the allotted time 
period, while short puzzles can be completed. Four 
random sequences of puzzles were selected sò each 


booklet contained 10 tasks of the long form and 10 1 
of the short form, i 
4 


The third booklet on the subjects’ desks contained 
a series of geometrical designs. The task required 
tracing over the design without lifting the pendil 
from the paper or retracing a line. There were two 
different booklet forms, one containing 10 soluble 
puzzles (success condition) and one containing 3 
soluble and 7 insoluble puzzles (failure condition). 

Subjects were first given the Zeigarnik booklet: 
Seventy-five seconds were allowed for each task. 
The condition of administration was ego involving 
(Atkinson, 1953), Subjects then were given the 
“tracing” booklet. Before starting this task they f 
were told: 


You may find some of these puzzles difficult. Previ- 
ous norms indicate that the average college student 
can complete only about half of the puzzles, oF 
of the 10. 


Subjects were allowed to work on the puzzles for 10 
minutes. At the end of this time the experimentet 


had subjects count the number of correctly com i 
pleted puzzles and write that number on their l | 


let. 
The experimenter then said: 


There is one further task to do this hour, but Í 
need a few minutes to organize the material. ? 
the past individuals have requested to look 

at the first test booklet, so I had the secretary 
make some extra copies and I have put them 0 | 


_* The author wishes to thank Howard Eggeth for 
his assistance in the reliability check. 


TABLE 1 


ATION OF TASK RESUMPTION TO ACHIEVEMENT 
MVATION AND SUCCESS OR FAILURE CONDITION 


Motive classification 


High | Low 


Condition 


Resume Resume 


M Yes No Yes No 
Success 0 7 7 1 
“Failure 8 2 2 3 
4 


your desks. While I put together this material you 


RESULTS 


"Twenty-five subjects arrived for both experi- 
Mental hours. All subjects in the success condi- 
tion correctly completed at least nine of the 
designs, while subjects in the failure condition 
‘completed no more than three of the designs. 
Results indicate that subjects high (above the 
Median) in n Ach more frequently resumed 
interrupted tasks following interpolated failure 
äs opposed to interpolated success experiences 
(p=.045, one-tailed, Fisher exact test). Sub- 
Jects low in n Ach more frequently resumed 
f tasks following success rather than failure experi- 
ences (p = .07, one-tailed, Fisher exact test; see 
Table 1). There is a significañt Conditions X n 
Ach interaction (x? = 6.00, p < .025). 


è 
La DISCUSSION 
E 


Prior studies frequently reveal interaction ef- 
tects between individual differences in anxiety 
nd n Ach and various experimental conditions. 
Atkinson (1953) found subjects classified as 
$ in n Ach recall more incompleted tasks 
subjects low in n Ach when the conditions 
te ego involving, but not when the conditions 
ie relaxed, Lucas (1952) found that following 
pi tinual failure experiences subjects classified 
igh in anxiety suffered decrements in level 

t performance, while subjects low in anxiety 
proved their level of performance. Child and 
Witing (1950) found that subjects with his- 
vile ES of Success reported they more frequently 
” following attainment of a difficult goal, 
as with histories of failure who 
r ed to encounter frustration reported they 
€ frequently strive for new goals following 
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goal attainment. The above investigations used 
recall, level of performance, and instigation of 
new activities as dependent variables. This study 
replicates and extends these results, employing 
task resumption as the dependent variable. 

As indicated by Atkinson (1953) the subjects 
used by Lewin and his associates were frequently 
“volunteers.” Research cited by Atkinson (1953) 
indicates volunteer subjects are more likely to 
be highly motivated to achieve success. It is 
expected that under these conditions attainment 
of goals related to a desired goal will have 
substitute value. 

Murray (1954), on the other hand, was 
working with clinical patients. It is likely that 
for these individuals the fear of making an ag- 
gressive response toward a significant authority 
figure exceeded their aggressive approach tend- 
ency. It is expected that under these conditions 
attainment of goals related to a desired goal will 
increase the tendency to strive for the original 
goal. 

These interactions indicate that behavioral 
models proposed to account for these data will 
have to include both individual difference com- 
ponents and determinants which capture the dif- 
ferential reactions which these individuals exhibit 
under success and failure conditions, As Atkinson 
(1960) and Child (1954) have indicated, the 
drive model used by Spence and his associates 
(Spence, 1958) has not been able to account 
for the increment in performance which sub- 
jects low in anxiety exhibit under stressful 
(failure) situations. The model for achievement- 
oriented behavior proposed by Atkinson (1957) 
can handle data which reveal an increment in 
performance under ego-involving situations for 
individuals low in anxiety, but it does not ac- 
count for the increment in the tendency to 
approach achievement-related goals which high- 
anxiety individuals exhibit following success 
experiences. A model which can adequately ac- 
count for the interaction effects reviewed above 
remains to be formulated. = 
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AGGRESSION AND IDENTIFICATION 


DANIEL M. OGILVIE 


Harvard University 


The following hypothesis was presented in an attempt to account for the 
phenomenon of strengthened ties between an aggressor and his target: Ag- 
gression expressed towards an ambivalent target is a means of incorporating 
characteristics which the target has and the aggressor wants. This hypothesis 
was tested in a small group setting by allowing Ss to express aggression 
towards a nonaggressive stooge who had been monetarily rewarded for per- 
forming well on a difficult predictive task. Control gzoups received the 
same treatment except for the fact that their aggression was reduced by 
having the stooge agree with their complaints. Instances of verbal and physical 
behavior copied from the stooge were observed and recorded and the results 
supported the hypothesis. That is, aggressive Ss copied the stooge’s behavior 
more often than did nonaggressive Ss. A 2nd hypothesis which stated that 
the more intense the aggressive act, the sooner the act of identification will 
occur was also supported. These findings were then related to Burton and 
Whiting’s status-envy hypothesis of identification, 


Aggressive behavior on the part of an individ- 
ual is generally viewed as a disruptive act when 
directed towards another individual. However, it 
has been found that aggressive, disruptive acts 
sometimes correlate with strengthened ties be- 
tween the aggressor and his victim. For instance, 
Walter Miller found that most of the aggressive 
acts of members of a delinquent gang were ex- 
pressed towards their own members. Miller, 
Geertz, and Cutter (1961) state that aggression 
“served as an effective device for producing and 
maintaining group cohesion [p. 296].” Green 
(1933) in her study of preschool children, re- 


ported that quarreling correlates with friendship 
In a study of B-29 crews, Torrance (1955) foui 
that “effective” crews perceived less harmoni 
more discord, friendship, and pleasure in projet 
tive group pictures than did “ineffective” crews 
Finally, Pepitone and Reichling’s (1955) ‘al 
show that groups in which members express 
more hostility towards an instigator rated the 
instigator more favorably than did groups wilt 
had withheld their hostility. č 

To account for this phenomenon, the followité 
hypothesis is offered: Aggression expressed S 
wards an ambivalent target is a means of ™ 


“+ work, 
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© corporating characteristics which the target has 
and the aggressor wants.+ In other words, ag- 
gression, under certain circumstances, leads to 
jdentification with the aggressed upon object. In- 
herent in this hypothesis is a definite time order- 
ing of events; that is, aggression not only cor- 
relates with cohesiveness, understanding (or, as 
the author proposes, identification) but aggres- 

' sion leads to identification.* 
This hypothesis was derived mainly from 
„Freudian and neo-Freudian thought. Erikson 
(1951) lays heavy emphasis on the importance 
of early childhood experience in establishing the 
necessary groundwork for later identifications. 
„At the eafliest stage of development, the child’s 
mouth is the focus of a general mode of approach 
to life, namely, the incorporative approach. That 
is, by feeding, the child satisfies his most basic 
need—his nutritional need. Freudian theory im- 
plies that the satisfactions obtained in this early 
stage are not entirely forgotten even though 
much of this active oral behavior is labeled 
aggressive” after the child has passed his infant 
years. Possibly the most direct expression of this 
stage by adults is seen in the totemism of some 
primitive societies. Freud (1938) contended that 
the totem was a substitute for the father and 
„that killing and eating the totem resulted in, 
among other things, the appropriation of the 
father’s properties. This “appropriation” is partly 
brought about by fantasy—the excessive valua- 
tion of one’s own wishes. The process of fantasy 
is usually thought of as occurring in the minds 
of children but is certainly not absent in the 
minds of adults, Adult fantasy can be described 
Psychoanalytically as a reactivation of the pri- 
mary process when the ego does not produce 

Satisfactory results. 

ae considerations lead to the conclusion 
ane a person feels a need to become like 
Pi hai cae aggressive behavior (with the aid 
en may Cue in a regressive attempt to 
ea at person. Finally, the target of aggres- 
is ambivalent in the sense that the attacker 


LT 

Bee Bypothesis „was developed through the au- 

at Yale neenon with Theodore M. Mills presently 

was enco niversity. The experiment to be described 

renti uraged and criticized by Philip E. Slater 

ott H at Brandeis University. 

tation one be noted that this sequence of identifi- 

ported h ae aggression is not necessarily sup- 

assumption he above-mentioned studies. Usually, the 

Reali in studies of this nature is that cohesive- 
cedes aggression. However, in much of this 


quene Observations which would determine the se- 
/ ce were not made. 
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admires his target’s position, but hates him for 
holding that position. 

This line of reasoning, based on sorely un- 
tested theory, cannot be given much weight 
without a solid fund of empirical evidence. In 
an attempt to partially fill this gap, the following 
experiment was designed to test the above- 
mentioned hypothesis. 


METHOD 
Subjects 


Thirty-six college freshmen were divided into nine 
groups (four experimental groups and five control 
groups (with four subjects in each group. Subjects 
were obtained through a college student employment 
office and were paid $1.50 for the session which 
lasted one hour. The fifth member of the group was 
a stooge—the same stooge for each group. 


Procedure 


Subjects were seated at desks arranged in an L 
shape in an experimental room equipped with a one- 
way mirror. Three subjects sat along the long side 
of the L and the other subject plus the stooge sat on 
the short side of the L. Subjects were informed that 
the session was being taped and that they were being 
observed through the one-way mirror. 

When the subjects were seated, they were given 
the following instructions: 


This is an experiment concerning individual and 
group predictions. The general hypothesis is that 
group predictions should be better than individual 
predictions. Although experiments of this nature 
have been conducted before, I feel that there are 
certain aspects which have not been sufficiently 
covered. 

Today we have a subject named Martin who 
will be the focus of your attention. Before I give 
you your instructions as to your task, I want you 
to read aloud the parts of Martin’s life history 
which I have given you. You should pay close at- 
tention to this information because it may be use- 
ful to you. 


Card 1, Subject 2 read Card 2, 

the subjects read was fictitious. 
The bulk of it had been taken from the case of 
Grope used by Murray (1947) to illustrate what he 
calls an “Icarus Complex.” The rest of the history 
was made up by the author. An example of the type 
of information provided on these cards is: 


Subject 1 then read 
etc. The information 


Since Martin had gone to nursery school when 
he was three, he did not have to attend kinder- 
garten. He has few memories of his early years at 
primary school, except that he showed talent for 
drawing in the second grade and at home he read 
and reread the Oz books . . - - 


After the subjects finished reading these cards, 
they were instructed as follows: 


© 
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The information you have just read was given 
to us when Martin was attending Harvard as an 
undergraduate. He was 20 years old. When he was 
26 he was given a Value Profile Test. He had just 
returned from service in the armed forces. Before 
he took the test, Martin was instructed that it 
was not a test of intelligence or ability; that it 
was a way for him to explore and describe his own 
particular values and beliefs. He was told to an- 
swer according to the way he felt. 

Martin scored each item on a 0-100 basis, 50 
being completely neutral. A low number, such as 
20, would indicate a strong disagreement with an 
item. If he slightly agreed with another item, he 
might put down 52 or 55, and so forth. In other 
words, a number on either side of 50 would indi- 
cate various strengths of agreement or disagree- 
ment depending upon the direction and distance of 
his answer from 50. 

Your task, then, will be to predict Martin’s re- 
sponse to several items that were on the value test. 
To give you some practice, in front of each of you 
there is a card with an item on it. Beginning with 
Subject Number 1, read the item aloud and read 
the response given by Martin. 


The following is one of the “practice” items which 
the subject read: “The best measure of a man’s true 
worth is the quality of his friendships.” Martin’s re- 
sponse: 37. The items used in the experiment had 
been taken from the Bales-Couch Value Profile Test 
which had been administered to a graduate student 
whose responses, upon his own admission, were often 
inconsistent. The usual “strongly disagree .. . disagree 
» + Strongly agree” scale had been transposed into 
a 0-100 scale described above. The 20 items selected 
for the experiment were those which had been found 
to be the most difficult to predict accurately. 

After the subjects had read the practice items, 
they were told; 


Now that you have had some practice, we are 
about to begin. But first, to add a little more inter- 
est to the game, there will be some rewards in- 
volved. On each of the first 10 items, you are to 
predict Martin’s response individually. The one 
who comes closest to the actual score will receive 
a quarter. It is possible, then, for one of you to 
earn an additional $2.50, if you come closer than 
the rest on each of the 10 items, 

The last 5 items will be predicted by the group. 
That is, the group will have to agree on one an- 
swer for each item. However, this time the one 
who has received the most quarters on the first 10 
items will not participate. Each group member will 
receive a quarter if you come within 3 points either 
way of Martin’s answer. If you miss, the winner 
of the first 10 items (who will not be participat- 
ing) will receive a quarter. 


Subjects were each given a sheet of paper contain- 
ing the 15 items which they were to predict. Each 
subject was also provided with a pencil and 10 slips 
of paper on which he would write his predictions. 
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Subjects were then instructed to read Item 1, write 
their prediction on a slip of paper, and to place the 
paper face down in front of them. The experimenter 
collected the predictions and placed them on the 
blackboard. The experimenter instructed Subject { 
to pick up Item 1 which was on a card in front of 
him and read Martin’s response on the reverse side 
of the card. The experimenter placed this response 
on the board and gave $.25 to the subject who had 
made the closest prediction. Two minutes were al- 
lowed for discussion. The same procedure was fol- 
lowed through Item 10, excepting that the subjects 
took turns reading Martin’s responses, that is, Sub- 
ject 2 read the response on Item 2, etc. 

After the subjects had predicted Martin’s response 
for Item 10 and Martin’s response had been read, the 
experimenter totaled the number of winning predic- 
tions by each subject and announced who had made 
the closest predictions the most number of times, As 
the subjects had been instructed, the winner would 
not participate in the group predictions of Items 11- 
15, and would replace the experimenter as moderator, 
His duties included: placing the group’s predictions 
and Martin’s responses on the blackboard, timing” 
the subjects so that the experiment would not rut 
over the hour limit, and distributing the quarters. 
The subjects were reminded by the experimenter that 
if their predictions were within 3 points of Martin's 
response, each group member would receive a quar- 
ter. If they failed the new moderator would receive” 
a quarter. 


Stooge 


The stooge was a college sophomore who stood § 
feet, 8 inches, weighed 135 pounds, had short blond 
hair, and a youthful face. Since the stooge knew the 
correct responses, his predictions were more accurate 
than the subject’s on an average of 7 out of 10 times: 
Therefore, in every group, by virtue of his accuracy; 
the stooge was the “winner” and became moderator— 
not participating in the group’s decisions. 


Stooge as Frustrator 


Central to the experimental design was placing tht 
stooge in the position of ambivalent frustrator. This 
was attempted by allowing the stooge to win most 
of the quarters, give brilliant reasons for his answers; 
share his reasoning only a limited number of times 
become moderator, and by allowing him to win mote 
quarters through the group’s poor predictions to 
Items 11-15. (Only one group out of the nine ple 
dicted any of the last five items with sufficient 80- 
curacy to allow for the distribution of quartets 
among them—this happened on only one item.) 


Stooge as Aggressive Target ; 


The opportunity for aggression was amply PIO" 
vided for by the stooge by limiting the subjects’ tim? 
in Items 13-15 and by making honest but meant 
less suggestions for the subjects’ benefit such Kes 7 


you're disagreeing, why don’t you take a median? A 


| 
i 
f 


Identification Measures 


The stooge exhibited three behavioral character- 
istics consistently throughout the nine sessions. These 
- characteristics were: playing with a pencil by hold- 
ing it in a horizontal position by placing his thumbs 
and forefingers on it and moving them back and 
forth slowly rotating the pencil a half circle each 
time; prefixing statements with “My God,” for ex- 
„ample, “My God, that’s interesting.”; and sitting in 
a reclining position so that his head touched the back 
of his chair. An instance of identification was scored 
= when the subject performed any of these three be- 
_ haviors. 


Experimental and Control Groups 


, The stooge’s behavior was reasonably the same in 
both experimental and control groups until time 
came for the subjects to make their prediction for 
Item 13. At this point, the stooge absorbed aggres- 
sive attacks by the subjects in the experimental 
groups by remaining silent during the aggressive out- 
bursts; but at the same time continuing to remind 
them of their time responsibilities to the experi- 
menter. In the control groups, the stooge agreed and 
sympathized with the aggressor’s statements—thus 
reducing the aggressive outflow. To insure us that 
the stooge’s behavior was the same in all sessions, 
the following device was included in the design. A 
slip of paper on which either X (for experimental) 
or C (for control) was written was placed face down 
on the runner of the blackboard before each session. 
When the stooge placed the subject’s prediction for 
Ttem 12 on the board, he turned the slip over and 
viewed the letter. The X or C was chosen in such a 
manner that the experimenter himself did not know 
which letter was on the paper. 


Data Collection 


While the experimenter was moderator (Items 1- 
10), an observer. was recording any instances of the 
Stooge’s three behavioral characteristics exhibited by 
the subjects and any aggressive statements by the 
subjects, These recordings were made on a Bales- 
Gerbrands Interaction Recorder. After the stooge had 
won” his position as moderator, the experimenter 
FA the observer then made independent recordings 
of the subject’s statements and behavioral charac- 
teristics, 
res session was recorded on a tape recorder. 

er the sessions were completed, the data were 
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gathered by synchronizing the paper tape from the 
Bales Recorder with the recording tape and extract- 
ing the aggressive statements which were made prior 
to any instance of identification. Since the paper 
tape moved at a rate of 1 foot per minute, the dis- 
tance between the aggressive act and the indication 
of identification (if identification occurred) was 
measured with a ruler in inches. 


Apologies 

At the end of each session, appropriate amends 
were made by informing the subjects that the stooge 
was a stooge and that the task information was 


fictitious. Subjects agreed not to discuss the experi- 
ment with their classmates until all groups had met. 


RESULTS 


Table 1 shows the breakdown of instances of 
identification in the experimental and control 
groups. Comparing the total acts of identification 
in the experimental groups with acts of identifi- 
cation in the control groups, we find that there 
were 16 instances in the former groups compared 
with 5 in the latter. This difference was in the 
predicted direction and statistically cannot be 
attributed to chance. 

Disregarding the distinction between experi- 
mental and control groups, 20 of the 21 instances 
of identification followed aggressive statements 
and only subjects who were aggressive exhibited 
the behavioral characteristics used as our identifi- 
cation measures. All but one instance of identi- 
fication appeared after Item 12 had been 
predicted. 

Finally the data suggested a second hypothesis 
which stated: the more intense the aggressive 
statement, the sooner the act of identification 
will occur. This was tested in the following 
manner. Three judges independently rated each 
aggressive statement preceding an act of identifi- 
cation on a 1-3 scale (1=very intense, 2= 
moderately intense, 3 = slightly intense). The 
three ratings for each statement were totaled 
and formed a scale from 3 to 9. This intensity 
scale was then correlated with the number of 
inches measured between the aggresive state- 
ment and the instance of identification. These 


TABLE 1 
MEASURES OF IDENTIFICATION 


Group “My God” Pencil Leaning Total M Variance 
Experimental 5 8 3 16 4 2 
(N = 16, four groups) 
we Control 1 2 2 5 1 3 
a _ SY = 20, five groups) 


O | Note— = 4.18, df = 7, p = .01, two-tailed. 
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measurements were taken from the Bales Re- 
corder paper tape. The correlation was .60 and 
the hypothesis was supported at the .03 level of 
significance. 


Discussion 


The results of the experiment thus support the 
general correlation between the expression of 
aggression and identification reported at the 
beginning of this paper. More important, the 
temporal order of events (aggression followed by 
identification with the aggressed upon object) has 
been demonstrated. It should be noted that 
implicit in this hypothesis is the belief that the 
mechanism of identification through aggression is 
motivated by an unconscious desire to become 
the object. However, the possibility exists that 
some form of identification occurs after every 
aggressive act independent of the nature of the 
target. This theoretically important distinction 
was not tested. 

This theory of aggression has several interest- 
ing implications, practical and theoretical, in such 
fields as child development, psychoanalysis, and 
other areas of social relations. For example, it 
appears to be complementary to the “status- 
envy” hypothesis of identification proposed by 
Burton and Whiting (1961). This hypothesis 
states: 


If there is a status that has a privileged access to a 
desired resource, the incumbent or occupant of such 
a status will be envied by anyone whose status does 
not permit him the control of the right to use the 
resources. Status envy is, then, a motivational com- 
ponent of status disability and such motivation leads 
to learning by identification [p. 86]. 


They suggest that the process of learning by 
identification consists of the covert practice of 
the envied status. This practice of the envied 
role through fantasy and play is obviously a long 
and gradual process and is not very productive 
when a person is motivated by an immediate 
need to “become” another person. In such situa- 
tions (like the situation created in the experi- 
ment presented above), it is proposed that the 
person regresses to a more primitive process of 
identification—identification through aggression. 
Recognition should be given to the possibility 
that identification could have been the result of 
the target’s social power of task competence 
alone. That is, the target’s power and compe- 
tence may have angered the subjects and, at the 
same time, led them to adopt his characteristics. 
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(Support for this interpretation can be found in: 
Mussen & Distler, 1959, and in Bandura, Ross 
& Ross, 1963.) However, subjects in the control 
groups had sufficient time to perceive the target’ 
superior task ability and, for a time (Items 11 
and 12), were also subject to his power. It was 
not until aggression was allowed (experimental 
groups) or complaints were agreed with (control 
groups) that the above differences in identifica 
tion were observed. This finding suggests that 
aggression was more instrumental in producing 
identification than social power theories imply, 
Needless to say, extension or revisions of the 
hypothesis presented in this paper will require 
further experimentation. Particular attention 
should be directed to investigations of conditions 
precedent to activation of this mechanism of 
identification and to individual differences in 
the degree of utilization of the mechanism. Of 
course, any research on the subject of man's 
destructive impulses should be helpful in clarify- 
ing ‘issues raised (and implied) in this paper. 
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A recent experiment (Walster & Festinger, 1962) supports Hovland, Janis, and 
Kelley’s (1953) suggestion that a “desire to influence” on the part of a com- 
municator will decrease his effectiveness by making him appear relatively 
untrustworthy. In the present paper we suggest that this result may be limited 
to a covert rather than an overt expression of intent to influence. Our major 
hypothesis was that an overt, frankly stated desire to influence would actually 
enhance the effectiveness of an attractive communicator. The results confirmed 
this hypothesis: When the communicator was attractive he was more effective 


effectiveness. 


If the members of an audience think that a 
communicator wants to change their opinions, 
how will this affect their actual opinion change? 
Hovland, Janis, and Kelley (1953) have sug- 
gested that the influence of the communicator 
will be reduced because he will be regarded as 
untrustworthy. They state “when a person is 
perceived as having a definite intention to per- 
suade others, the likelihood is increased that he 
will be perceived as having something to gain 
and hence, as less worthy of trust [p. 23].” 

This suggestion was verified in a recent experi- 
ment by Walster and Festinger (1962) who 
showed that a communication was more effective 
if the members of the audience felt that the 
communicator was unaware of their presence. 
Walster and Festinger concluded that, in the 
aware condition, the speaker was less effective 

ecause he was seen as intending to exert influ- 
ence and therefore, may have aroused suspicions 


_ Of possible ulterior motives. 


Although this interpretation is reasonable, one 
So exercise caution in generalizing from it. 
pecifically, it should be pointed out that in 


the Walster-Festinger experiment any perceived 


oe to influence was indirect. The reduced 
3 i dhe opinion change may not have been due 
aa ae to influence” alone but, rather, due 
er oe drawn by the audience that the 
Thay i ecause of a covert desire to influence, 
Pe a been overstating his position in a 
tien A somewhat dishonest fashion. Thus, the 
fhe e may have resisted because of a reluc- 

e to be “conned” or “taken in.” It is our 
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* if he announced his intention to persuade; when the communicator was 
unattractive his stated intention to persuade had no apparent influence on his 


contention that, under certain conditions, a 
frankly stated desire to influence may actually 
enhance the effectiveness of a communicator. 

Suppose the communicator were a very at- 
tractive person—a person whom the members 
of an audience would like very much to please. 
It seems reasonable to suggest that such a person 
would be more effective if he expressed a desire 
to influence. If people think that someone whom 
they like very much wants them to do something, 
they may be motivated to do it, even if the 
admired person might never know about it. Thus, 
a very attractive source may be more effective 
in changing opinions if he openly and honestly 
informs the audience that he wants to change 
their opinions. 

What if the communicator is unattractive? If 
the audience does not like the communicator, 
they will not be prone to please him by allowing 
their opinions to be changed according to his 
desire. Thus, a stated desire to influence will not 
increase the effectiveness of such a communicator. 
In fact, if anything, an unattractive communi- 
cator may lead the audience toward active dis- 
affiliation; hence, a stated desire to influence, in 
this case, might actually decrease the effectiveness 
of the communication. 


EXPERIMENT I 

Method 

The essential features of the design were as fol- 
lows: Opinions of male subjects were measured 
before and after they received a communication from 
a female communicator. Identical communications 
were presented by the same communicator under 
four different conditions: attractive persuade, at- 
tractive nonpersuade, unattractive persuade, and 
unattractive nonpersuade. In the attractive conditions 
the female communicator was made up to look 
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physically attractive. In the unattractive conditions 
she was made up to look physically unattractive. 
In the persuade conditions the communicator volun- 
teered to give the communication and stated before- 
hand that she would very much like to influence 
the views of others, In the nonpersuade conditions 
she did not volunteer and stated that she was not 
at all interested in influencing the views of others. 

The subjects were male students in introductory 
psychology. They were required to participate in 
the experiment as part of their regular course work. 
The subjects were randomly assigned to the four 
experimental conditions, They were run in groups, 
20-30 subjects in each group. A few women students 
from the same class were also scheduled at each 
session so that the female confederate would not 
appear conspicuous. The experiment was run on 2 
successive days, two groups on each day. 

More than 2 months prior to the experimental 
sessions the subjects answered a disguised preques- 
tionnaire. This questionnaire was administered during 
their regular class period and consisted of 42 Likert- 
type items dealing with 10 different issues of rele- 
vance to college students. There were 7 alternatives 
for each item ranging from strongly agree to strongly 
disagree. Five of the items were concerned with the 
issue of general versus specialized education, for 
example, “Every college student should receive a 
broad general education.” “Students should not be 
forced to take courses to make them well rounded.” 

When the subjects arrived at the experimental 
session, the experimenter explained that he was 
conducting a study to find out how students feel 
about the issue of general versus specialized educa- 
tion. He said that he was going to have them answer 
a questionnaire on this topic and went on to explain 
that the results of the questionnaire were much more 
valid if respondents were allowed to think about 
the questions beforehand. He told them that in 
order to familiarize everyone with the questions he 
was going to have one person in the group give his 
answers to the questionnaire aloud, in front of 
everyone, before distributing it. 

In the persuade conditions the experimenter asked 
for a volunteer to answer the questionnaire aloud. 
The female confederate volunteered and was chosen 
by the experimenter. In the nonpersuade conditions 
the experimenter said he had had great diffculty in 
getting volunteers in the past, and so he was going 
to pick someone himself. He then chose the confeder- 
ate, who always sat near the front of the room. 
She appeared very reluctant to participate but finally 
agreed to do so. 

In the attractive conditions the confederate was 
made up to look extremely attractive, She wore 
chic, tight-fitting clothing; her hair was modishly 
coiffured; she wore becoming makeup. In the un- 
attractive condition the same confederate was made 
up to look repulsive. She wore loose, ugly, ill-fitting 
clothing; her hair was messy; her makeup was con- 
spicuously absent; the trace of a mustache was 
etched on her upper lip; her complexion was oily 
and unwholesome looking. 

When the confederate responded to the question- 
naire in front of the group, she gave previously 
prepared answers. In all conditions she made identical 
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responses to the first four questions which dealt 
with interest in the issue, amount of thought about 
the issue, desire to get more information about it, 
and desire to know the views of others. For the fifth 
question, “How much would you like to influence 
the views of others on this issue?” her responses 
varied in accord with the different conditions, Ih 
the persuade conditions she answered “very much,’ 
the most positive of the five alternatives, In the 
nonpersuade conditions she answered “not at all’ 
the least positive of the alternatives. 

The remainder of the questionnaire consisted of 
10 items concerning general education—the 5 items 
that were given in the prequestionnaire and 5 new 
items similar in content. In addition to indicating 
her degree of agreement with each item on the 
7-point scale, the confederate was also asked to state 
her reasons in a few sentences, In all conditions she 
gave extremely pro-general-education responses to 
each item and made identical remarks about het 
reasons. 

After the confederate had given her answers, the 
questionnaires were administered to the subjects, 
When they were completed, the experimenter told the 
subjects that since the questionnaire did not take 
the full hour, the remaining time was to be devoted 
to another study, He then introduced a second 
experimenter and left the room. 

The second experimenter explained that he was 
interested in finding out how much people agree 
in their ratings of someone after they had just 
seen him briefly. He asked the subjects to rate the 
experimenter who had left, and informed them that 
their rating would be kept confidential. He dis- 
tributed forms on which were listed 28 different 
personality characteristics. The subjects indicated 
how well each characteristic applied to the person- 
rated by circling a number on a scale from 0 
(extremely inappropriate) to 20 (extremely appro 
priate). The purpose of having the subjects rate 
the first experimenter was to provide a plausible 
context in which they could be asked to rate the 
female confederate. When they had finished rating 
the first experimenter, the second experimenter went 
over to the confederate and told her she could leave: 
He then told the subjects that he would also like t0 
find out how much people agreed in their ratingi 
of a female and asked them to rate the confederalé 
on a similar rating form. 1} 

After the subjects had completed their ratings 
of the female confederate, the experimenter 48 
them to turn over their papers and describe 
reactions to the experiment in a few sentence 
Those subjects whose comments indicated suspic? 
of the procedure were eliminated before the res 
were analyzed. In all, 12 subjects were eliminated 
no more than 4 from any one experimental conditioni 
Finally, before dismissing the subjects, the ¢xP® a 
menter cautioned them not to discuss the experime 
with other students. He explained that if they didi 
future results would be biased. 


$ 
Results 


Before presenting the data on opinion chaa 
it is appropriate to consider the evidence co . 
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TABLE 1 
MEANS FOR THE MEASURES OF FAVORABILITY TO GENERAL EDUCATION 


Experiment I 


Experiment II 


N Prequestionnaires | Postquestionnaire»| Adjusted post | Postquestionnaires| MN 

24 24.3 42.1 41.8 59.5 38 

18 23.8 36.9 37.0 55.6 19 

e 26 23.8 38.0 38.1 56.8 19 
23.8 37.7 37.8 56.6 22 


ve nonpersuade | 29 


on 5 items, 7 alternatives per item. 
‘on 10 times, 7 alternatives per item. 
10 items, 9 alternatives per item. 


uccess of the experimental manipu- 
the perception of the communicator. 
cts’ ratings of the confederate’s charac- 
"provide strong evidence that she was 
ed as more attractive in the attractive 
than in the unattractive conditions. 
‘rating for the characteristic attractive 
r the two attractive conditions com- 
as 13.2 for the two unattractive condi- 
bined. The difference is sizable and is 
ant beyond the .001 level.? The ratings 
fashionable, neat, and romantic 
‘significantly greater (p < .001) in the 
conditions than in the unattractive 
. In the only other significant differ- 
tween attractive and unattractive condi- 
mfederate was rated more affectionate 
and more superficial ( < .05) in the 
conditions. 
vidence concerning thé effectiveness of 
pulation of the perception of the source’s 
to influence is much less direct. The con- 
was rated as significantly less timid 
), less modest (p <.01), less anxious 
more conceited (p < .01) more affec- 
<.05), and more cynical (p < .05) in 
de conditions than in the nonpersuade 
The differences for timid, modest, 
d conceited can be considered evi- 
the subjects did notice whether the 
volunteered to give her answers in 


‘the other characteristics. 

one other bit of evidence concerning 
between the persuade and nonpersuade 
On the item in the general education 


le views of others, subjects in the 
tions indicated they were more 
luencing others than did sub- 


ficance was tested by analysis of 
erwise specified. 


UThe higher the score, the more favorable to general education. 


jects in the nonpersuade conditions ($ <.05). 
Hearing someone say that she wanted to influ- 
ence others seems to have made the subjects 
in the persuade conditions feel more free to 
admit their own desire to influence others. 

In order to determine the effect of the experi- 
mental conditions on opinions, measures of 
favorability to general education were calculated 
from the subjects’ responses on the pre- and 
postquestionnaires. Scores from 1 to 7 were 
assigned to the 7 degrees of agreement or dis- 
agreement with each item concerning general 
education (the more pro general education the 
response, the higher the score). These scores 
were then summed for the 5 items in the pre- 
questionnaire and the 10 items in the post- 
questionnaire. Means of the four different experi- 
mental conditions for the pre- and postquestion- 
naires, along with the Ws on which they are 
based, are presented in Table 1, under the head- 
ing Experiment I. The means for the prequestion- 
naire were not significantly different in the 
various conditions. However, since the attractive 
persuade condition was slightly higher than the 
others on the prequestionnaire, the scores for 
the postquestionnaire were adjusted in terms of 
the scores for the prequestionnaire by means of 
analysis of covariance. The adjusted post means 
are also presented in Table 1. 

Our hypothesis leads us to expect that the 
adjusted post mean for the attractive persuade 
condition would be greater than the adjusted 
post mean for the attractive nonpersuade 
condition. Inspection of Table 1 reveals that 
this is the case. However, the difference between 
the means does not reach the conventional level 
of significance (t= 1.45, p=.12, two-tailed). 

In addition, we predicted that a desire to influ- 
ence would not be an asset (and might be a 
liability) if the communicator were unattractive. 
Comparing the adjusted post mean for the un- 
attractive persuade condition with the adjusted 
post mean for the unattractive nonpersuade 
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condition reveals no apparent difference; the 
means are virtually identical when the com- 
municator is unattractive. 

None of the other possible comparisons ap- 
proaches significance. If the data are viewed in 
terms of the effect of the attractiveness of the 
communicator, the adjusted post mean for the 
attractive persuade condition is slightly greater 
than that for the unattractive persuade condi- 
tion; this difference does not approach signifi- 
cance, Similarly, the adjusted post means for 
the two nonpersuade conditions are virtually 
identical. 

Because the results of the experiment were 
provocative but not conclusive, the same experi- 
ment was repeated several months later at a 
different university. 


EXPERIMENT II 


The procedure in Experiment II was identical to 
that used in Experiment I with the following excep- 
tions: 


1. The prequestionnaire was eliminated. 

2. Two groups of subjects were run under each 
of the four experimental conditions, 10-20 subjects 
in each group. 

3. A different girl acted as the confederate in 
Experiment II, and different persons played the 
roles of the two experimenters. 

4. In an attempt to strengthen the manipulation 
of the perception of the source’s desire to influence, 
another question was inserted at the beginning of 
the general education interview with the accomplice, 
namely, “To what extent would you like to get other 
students to agree with you on this issue?” In the 
persuade conditions the confederate answered “very 
much”; in the nonpersuade conditions she answered 
“not at all.” 

5. In an attempt to increase the sensitivity of the 
opinion questionnaire, two additional alternatives 
were included for each of the items about general 
education, making 9 possible degrees of agreement 
or disagreement for each item. 

6. In an attempt to increase the impact of the 
confederate’s opinion, the subjects were asked to 
make their responses to each item immediately after 
they heard the confederate’s response to that item. 
In other respects the procedure for Experiment II 
was identical to that described for Experiment I. 


On the basis of the comments written at the end 
of the experiment, 15 subjects were eliminated from 
Experiment II because of suspicion about the pro- 
cedure. No more than 6 subjects were eliminated 
from any one condition. 

The ratings of the confederate Provide evidence 
that the manipulation of attractiveness was even 
more successful in Experiment II. The mean rating 
for the characteristic attractive was 16.3 in the two 
attractive conditions combined; it was 7.1 in the 
two unattractive conditions combined (p <.001). 
Again the ratings for charming, fashionable, neat, 
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r 
romantic, and affectionate were significantly greate 
(< .001) in the attractive conditions than in 
the unattractive conditions. In addition, the con 
federate was rated more likable (p<.001), les © 
unpleasant (p< .01), less annoying (p<.01), and | 


less emotional (p< .05) in the attractive conditions, 
The difference for superficial was not significant in 
Experiment II. 

As in Experiment I, the confederate was rated 
as significantly less modest (p < .05) and more con ! 
ceited (ġ < .01) in the persuade conditions than in ` 
the nonpersuade conditions. The ratings were als 
significantly higher in the persuade conditions | 


superficial (p< .01), emotional (p< .01), candid 
($ < .01), and frank (p < .05). There were no sig- 
nificant differences in the ratings for timid, anxious, 
affectionate, or cynical. As before, subjects in the 
persuade conditions said they were more interested ' 
in influencing the views of others (p < .001) and 
also, on the new question, said they were more 
interested in getting other students to agree with 
them (p < .001). A 

A measure of favorability to general education 
was calculated from responses to the questionnaire 
about general education in the same way as in 
Experiment I. The means of the two different groups 
in the same condition were highly similar in all four 
cases and so they were combined. The combined 
means for the four different experimental conditions 
are presented in Table 1, along with the corre- 
sponding Ns. i 

It can be seen from Table 1 that the pattern of Í 
the results in Experiment II is highly similar to that 
of Experiment I. The mean for the attractive pe- = 
suade condition is higher than the mean for the i 
attractive nonpersuade condition. The two unate 
tractive conditions have almost identical meani 
the unattractive persuade condition being slightly 
higher. As in Experiment I, the difference betwee 
the attractive persuade and attractive nonpersuadė 
conditions does not reach significance (t=120) _ 
However, when the p value for the difference from 4 
Experiment I is combined with the value ai 
Experiment II by Stouffer’s method (Mosteller f 
Bush, 1954), the resulting (two-tailed) p is 06. 


DISCUSSION 


The results of the two experiments together, 
offer support for our hypothesis—namely, when 
the communicator is very attractive, an op y 
stated desire to influence the views of the aud! 
ence will actually increase the effectiveness of Me i 
communication. Although the predicted differen 
between the attractive persuade and atta 
nonpersuade conditions did not reach significant’ 
in either of the two experiments alone, the com 
bined $ is just barely short of the conventio : 
05 level. The great consistency of the data argu 
strongly against the acceptance of the null 7 
pothesis. Contrary to inferences drawn e 
previous results, these data indicate that a des" 
to influence, if frankly stated, can be an ai 
to persuasiveness rather than a detriment. 


Y | potential alternative explanation for these 
results might involve the possibility that the 
confederate was seen as more attractive in the 
tractive persuade condition than in the attrac- 
nonpersuade condition. Perhaps when the 
“confederate volunteered she was perceived as 
‘more exuberant, vivacious, and lively. This might 
have made her more attractive and therefore 
id have increased her influence. However, the 
do not support this interpretation. If any- 
g, the confederate was actually rated slightly 
s attractive in the attractive persuade condition 
n in the attractive nonpersuade condition in 
both experiments. 
+ We also’ suggested that, when the communi- 
Cator is not attractive, the greater the desire to 
‘influence, the less will be the opinion change. 
‘The data lend no support to this hypothesis. In 
both experiments the unattractive persuade and 
unattractive nonpersuade conditions were remark- 
be similar, Thus, in this condition, a desire to 
“influence does not appear to be an asset; neither 
does it appear to be a liability. 
ate Should be noted that the manipulation of 
s the attractiveness of the communicator was de- 
signed so that it would not also affect the per- 
ved expertness of the communicator. If, for 
imple, the communication had dealt with 
"Women’s cosmetics, then these two variables 
ae well have been confounded. The sub- 
“it 
ei) 
r ese tf Sil Petters 
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general, and certain professions. 


from the instructor. 


wa common assumption among investi- 
ors of attitude change that the recipient of 


` This study, completed in 1958 when both authors 
Pe i Pn derbilt University, was partially financed 
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ject of communication—general education—was 
chosen so that it would be irrelevant to physical 
attractiveness, the basis for the attractiveness 
manipulation. 

Taken as a whole, our findings dictate caution 
in interpreting the results from recent experi- 
mentation which indicate that a desire to influ- 
ence decreases the effectiveness of a communi- 
cator. Our results suggest that this decrease in 
effectiveness is limited to a covert rather than 
an overt desire to influence. Our data indicate 
that an overt, frankly stated desire to influence 
does not influence the effectiveness of the com- 
municator, unless he is attractive. If he is at- 
tractive it increases his effectiveness; if he is 
unattractive it has no apparent effect. 
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ATTITUDE CHANGE AS A FUNCTION OF AUTHORITARIANISM 
AND PUNITIVENESS + 


0. J. HARVEY 


University of Colorado 


To study reactions to abuse in the form of a nondirectional order from an 
authority figure to change previously stated attitudes, 55 undergraduate psy- 
chology students were administered the 
P-F Study, and 5 opinion scales regarding self, peers, instructors, people in 
Authoritarianism related 
changes in opinion as a result of treatment while punitiveness scores did not. 
No evidence of displacement among more authoritarian Ss was found. On 
the contrary, the results suggest that 
authoritarian Ss may expect (and accept as proper) abuse from authority 
figures; consequently they did not feel frustrated but confirmed by derogation 


California F Scale, the Rosenzweig 


significantly to 


because of previous histories more 


information which is incongruent with existing 
concepts reacts in ways that favor his more 
central concepts (Festinger, 1957; Harvey, 
Vanderbilt University. Reproduction of this study 


in whole or part is permitted for any purposes of 
the United States Government. 
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Kelley, & Shapiro, 1957; Osgood & Tannenbaum, 
1955). This reaction is reflected in various 
specific modes of psychological gerrymandering 
and neutralization of the incongruity, such as 
discrediting the source (Hovland, Harvey, & 
Sherif, 1957; Osgood & Tannenbaum, 1955), 
distortion of the input toward greater extremity 
(Hovland et al., 1957), boomerang and distor- 
tion of own concept to fit the input (Harvey & 
Caldwell, 1959), as well as others, 

Following from the assumption that situa- 
tional and dispositional factors operate inter- 
dependently to determine cognition and response 
at any given time, both stimulus and personality 
determinants of differential responses to such 
incongruities have been found. Among the rele- 
vant personality variables, authoritarianism has 
been found to be among the most consistent 
correlates of differential response to concept 
negation. This consistency occurs across target 
concepts of varying degrees of centrality from 
such a highly central referent as the self 
(Harvey, 1958) to such concepts of less in- 
volvement as heaviness of weights (Harvey, 
1959), or distance between points of light 
(Harvey & Caldwell, 1959). 

While argument continues over the actual 
nature of authoritarianism, one reliable charac- 
teristic of this syndrome appears to be a high 
weighting of authority-related cues as guidelines 
to behavior. Hence more than one study has 
found a significant positive relationship between 
authoritarianism and opinion change when the 
source of the incongruous input is of fairly high 
status, and when the involvement in the target 
concept is not too high (e.g., Crutchfield, 1955; 
Wagman, 1955), or even when involvement in 
the target concept is high but the authority of 
the source is also very high and unimpeachable 
(e.g, Harvey & Beverly, 1961). On the other 
hand, authoritarianism has been found to relate 
negatively to concept change when the source 
of the incongruous input was a person of low 
status or a discrepant psychophysical anchor, 
or when the target concept was of very high 
involvement (Harvey, 1958, 1959; Wagman, 
1955). The authority orientation of the more 
authoritarian individuals is well illustrated in a 
study by Harvey and Beverly (1961). Among 
the experimental treatments, subjects who were 
opposed to alcohol and varied in authoritarian- 
ism were exposed to a mildly proalcohol com- 
munication which they interpreted as being 
endorsed by an unimpeachably high status source. 
Along with a retest of their opinions about alcohol 
following the communication, subjects were also 
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tested on their ability to recall the 
the speaker had made in favor of alco 
jects higher in authoritarianism not on 
their opinions more in the proalcohol 
they also failed significantly more than 
low in authoritarianism to get the poini 
speaker. t 

Experiments on the effects of cone 
tion typically have exposed the subj 
directional communication, one which 
information on the positional prefer 
source. In the present study departure 
from this practice, with the subject 
posed to a mondirectional communica 
a source of high status. Following au 


instructors, professions, and people in 
the subjects were strongly reprimanded 


the instructor, were ordered by her to 
ratings. Responses to this seemingly 
and tyrannical treatment by the instru 


ables, scores on the F Scale and the Rose 
P-F Study. 

Bases exist for predicting different effi 
authoritarianism: 


assumption of 
Brunswik, Levinson, and Sanford (19. 
authoritarianism disposes toward displacen 
aggressive tendencies away from high and 
low status figures would lead to the 
that persons of higher F scores would, fo 
reprimand by the instructor, become more 
tive toward the low status figures whil 
of lower F scores would not. 
2. It might be expected, because of 
training histories, that persons higher in al 
tarianism would come to expect as pi 
fitting more extreme treatment from 
figures. If so, instead of being frus 
the seemingly unfair treatment in this 


positive affect generated, and become mo! 
tive in their reratings of authority (ins! 
and other figures. 


Application of Rosenzweig’s assumptio! 
lead to the expectancy that subjects 
extrapunitiveness would be more likely ti 
more negative toward authority-relate 
whereas those higher in intrapunitiven 
become more unfavorable toward 
following reprimand by the instructo 
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METHOD 
Instruments 


Two personality scales were administered: The 
California F Scale (Form 30), as a measure of 
authoritarianism; the Rosenzweig Picture-Frustration 
Study (Revised Form for Adults), as a measure of 
punitiveness. Also administered were questionnaires 
designed to measure subjects’ attitudes before and 
after treatment toward each of five referents. These 
referents included: self, rated on 15 social charac- 
teristics, such as friendliness, sociability, and trust- 
worthiness, found in an earlier study (Harvey et al, 
1957) to be relevant to college students; peer group, 
rated on 15 multiple-choice items concerning the 
social and intellectual capacities of school and class- 
mates; instructors (in general), rated on 22 multiple- 
choice items reflecting subject identification with 
them; professions, 10 of which were rated on a 
single item of morality; and people in general, rated 
on’21 items concerned with honesty, intelligence, 
ete, The corresponding split-half reliabilities for each 
instrument were: self=.54, peer group=.84, instruc- 
tors = .42, professions = .89, and people in general 
=.82. The test-retest reliability was obtained for 
only the self-rating scale (r = .68). 


Procedure 


The experiment involved two sessions. In the 
first, all personality and rating schedules were 
administered following an explanation of the study 
as one concerned with the development of reliable 
opinion scales, This incorrect explanation was aimed 
at providing a rationale for the later retest, as well 
as minimizing threat on test performance. At the 
end of the first session, the experimenter indicated 
that he would leave the forms with the instructor 
and would return in 10 days for further testing. It 
was pointed out that, as subjects had omitted their 
names from the forms, the anonymity of their 
Yatings would be assured. In ways known only to 
the experimenter, however, the forms were marked 
in order that they could be matched by subject 
for the two sessions. 

At the appropriate time before the experimenter’s 
ie for the second administration, 10 days later, 
T instructor derided the subjects for the ratings 

ey had made earlier of the 10 professions. She 
aes that in glancing at these ratings she had 
a them so childish, immature, and naive that 
ated ashamed to have the experimenter see them. 
Mee she had destroyed all the forms on 
Chae i professions had been rated. The class was 
eeina to rerate the professions, under the stern 
aan ni _from the instructor that greater maturity 
‘era, pees be manifested, and greater effort 
ls toward more adequate performance in the 
Rese: In effecting the derogation, the in- 
eae Was careful not to suggest any preferred 
ae on of change, especially when such information 

Sought by the subject. 
He Rearrangement, the experimenter arrived just 
e subjects had completed their reratings of 
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the professions. All the remaining attitude inven- 
tories were then administered by the experimenter 
for the second rating. As the terminal task, the 
subjects completed a form which was aimed at 
measuring their involvement in and reaction toward 
the experiment. Questions on this form offered the, 
first clue in the experiment that the instructor had 
not been serious in her derogation, Following its 
completion the nature of the study was explained 
and all questions of the subjects answered to their 
satisfaction. 


Subjects 


Fifty-five subjects completed the required forms 
in both sessions. All were students in the same 
introductory psychology course at Memphis State 
College. 


RESULTS AND DISCUSSION 


Authoritarianism, Punitiveness, and Change in 
Ratings 

With direction of change disregarded, owing to 
the nondirectional nature of the pressure toward 
change, scores on the F Scale correlated with 
change in ratings of the five referents as follows: 
self, .62; peer group, .40; instructors, 29; 
people in general, .38; and professions (on which 
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Frö. 1. Changes in regression lines of correlations 
between F Scale score and dependent variables as a 
result of experimental treatment (W = 55). 
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the criticism was administered), .09. All of these 
correlations except the one related to professions 
were statistically significant (p < .05). 
Authoritarianism also correlated significantly 
, (r =.58) with involvement in the experiment. 
This led to correlating both F Scale and P-F 
Study scores to change toward the five referents 
with involvement partialed out. The correlations 
between F scores and changes in ratings were 
almost completely unaffected by controlling for 
involvement with one exception: The correlation 
between change in ratings of professions and F 
was increased from .09 to .31 (p<.01). 

Scores on the P-F Study did not correlate 
with any dependent variables, with involvement 
uncontrolled or partialed out, nor with F Scale 
scores. Plotting of the pre- and posttreatment 
regression lines of the dependent variables upon 
F Scale scores, however, reveals a consistent 
pattern: With the exception of the ratings of 
professions, upon which derogation occurred, 
persons high on authoritarianism were high on 
pretreatment dependent variable scores and 
became still higher after treatment (Figure 1). 
Persons low in authoritarianism remained low on 
the dependent variable or became still lower. 

The F scores on the ordinates of the graphs 
in Figure 1 do not represent actual scores of 
subjects, but are hypothetical; they were 
selected for convenience in presenting the figure. 
The only figural change resulting from their use 
is in the graph for self-concept. In this case, 
high authoritarians maintained their self-concept 
at about the same level while low authoritarians 
dropped in their self-concept to below the 
horizontal ordinate. 

It can be seen from the figure that higher 
authoritarianism is not reflected in greater nega- 
tive change toward nonauthority referents fol- 
lowing abuse by an authority, as a simple dis- 
placement hypothesis would predict. Instead, 
persons higher on this attribute were found to 
become more favorable toward these referents. 
Persons low in authoritarianism showed little 
change in ratings as a function of the treatment. 
Instead of a displacement theory, as suggested 
by Adorno et al. (1950), these results suggest 
that high authoritarianism disposes one toward 
expecting (and accepting as proper) extreme 
treatment from authority. Rather than frustra- 
tion and negative affect, it appears such treat- 
ment may result in the opposite effects for more 
authoritarian individuals. If one were inclined, 
he might say these results suggest a reaction 

This theoretical possibility would suggest that 
formation. 
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persons high in authoritarianism would tolerate 

more abuse from a high status than from a low 

status source before hostility is generated, The 

ideal test of this would be to actually vary the 

status of frustrating sources. This was not done 

in this study, but an attempt was made to utilize 

the available data to test this possibility, Find 

ings such as those of Roberts and Jessor (1958) 

and Thibaut and Riecken (1955) led us to 
examine the relationship between authoritarian- 

ism and response to the P-F Study cartoons 
with status of the frustrating source in the 

cartoons controlled. Three independent judges 

selected the cartoons on which the status of 

the source could be classified as clearly high: 
(N =4) and as clearly low (N = 3). The dis- 

tribution of F scores was divided at the median 

and differences among the obtained punitive 

scores for the four conditions (high status-low F, 

high status-high F, low status-low F, low status- 

high F) were examined. Although not statisti- 

cally significant, the results of all four cells 

were in line with expectancy. Persons lower in 

authoritarianism showed no difference in reaction 

to high and low status frustrators, while per- 

sons higher in authoritarianism showed greater 

hostility toward the low than high status source, 

The failure of these differences to be significant 

is perhaps attributable to the small number of 

cartoons in which status was classifiable. 

The fact that F scores failed to correlate with 
changes in ratings of professions, the target of 
abuse, until involvement was partialed out helps 
to clarify still further the conditions under 
which authoritarianism produces differential ef- 
fects. This result is in line with the finding of 
Harvey that on a highly relevant issue (the 
self), authoritarianism correlated negatively with 
concept change even though the source of the 
derogation was a friend. It would appear that 
the current knowledge on the effects of authori- 
tarianism could be summarized as follows. 
Authoritarianism is positively related to opinion 
change when the source of the incongruous input 
is of high status and the target concept is of 
low involvement. But it is negatively related 
to change when the source is of low status, and 
the target concept is of high involvement. 
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FACILITATING EFFECTS OF “EATING-WHILE-READING” ON 
RESPONSIVENESS TO PERSUASIVE COMMUNICATIONS * 


IRVING L. JANIS, DONALD KAYE, ann PAUL KIRSCHNER 


Yale University 


This experiment was designed to test the hypothesis that food, as an extraneous 
gratification accompanying exposure to a persuasive communication, will in- 
crease acceptance, even though the donor of the food is not the source of the 
communication and does not endorse it. 2 replicating experiments were carried 
out with 216 male college students. In both experiments there were 3 groups 
of Ss, assigned on a, random basis to the following conditions, which involved 
exposure to: (a) 4 persuasive communications while eating desirable food; 
(b) the same 4 communications with no food present; (c) no relevant com- 
munications (control condition). Both experiments provide confirmatory evi- 
dence, indicating that more opinion change tends to be elicited under conditions 
where the Ss are eating while reading the communications. The theoretical 
implications are discussed with respect to psychological processes involved in 


changing attitudes. 

fe 1s commonly assumed that people are more 
p y to yield to persuasion at a time when they 
i a eating or drinking than at a time when they 
ae engaged in any such gratifying activity. 
by a business promoters, and lobbyists often 
a ae oen up” their clients by inviting them 
ae hings over at a restaurant or cafe. Repre- 
a ves of opposing economic or political 
Ww ups, when unable to settle their disputes 
e seated formally around a conference table, 


1 s 
ee experimental investigation was conducted 
and Cin auspices of the Yale Studies in Attitude 
< from hamunication, which is supported by a grant 
í e Rockefeller Foundation. 


may find themselves much more amenable to 
mutual influence, and hence more conciliatory, 
while seated comfortably around a dinner table. 

Little systematic research has been done, as 
yet, to determine the conditions under which 
pleasant stimulation will augment the acceptance 
of persuasive communications. One might expect 
that when the communicator is the perceived 
source of the gratifying stimulation, a more 
favorable attitude toward him will ensue, which 
would tend to lower the recipient’s resistance 
to his persuasive efforts (see Hovland, Janis, & 
Kelley, 1953, pp. 19-55). But a more complicated 
situation often arises at educational symposia, 
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political conventions, cocktail parties, and in- 
formal dinners where: (a) the donor (that is, the 
person who is perceived as being responsible for 
the gratification) is mot the communicator and 
(b) the donor does not endorse the persuasive 
communications that happen to be presented at 
the particular time when the recipients are being 
indulged. If a positive gain in effectiveness is 
found to occur under these conditions, where the 
gratifying activity is entirely extraneous to the 
content, source, or endorsement of the communi- 
cations, a number of important theoretical ques- 
tions will arise—questions concerning some of 
the basic processes of attitude change which will 
require systematic experimental analysis. For 
example, when eating has a facilitating effect on 
acceptance of persuasive messages, does it always 
depend entirely upon the heightened motivation 
of the recipients to conform with the donor’s 
wishes? If so, a positive outcome under non- 
endorsement conditions will be paradoxical unless 
it turns out that there is a general tendency for 
people to assume, consciously or unconsciously, 
that the donor would like them to be influenced 
by whatever communications are presented (even 
though he explicitly says that he does not endorse 
the point of view being expressed). Or does the 
extraneous gratification operate as a source of 
reinforcement independently of the recipient’s 
attitude toward the donor? If this is the case, 
we might be led to assume that the food cor- 
responds to an “unconditioned stimulus,” and its 
facilitating effects might be accounted for in 
terms of the laws of conditioning. 

The latter theoretical possibility is suggested 
by Razran’s (1940) brief research note, published 
25 years ago, in which he gave a summary 
statement of the following two experimental ob- 
servations: (a) an increase in ratings of “personal 
approval” occurred when a series of sociopolitical 
slogans were presented to experimental subjects 
while they were enjoying a free lunch and (b) a 
decrease in such ratings occurred when the 
slogans were presented while the subjects were 
being required to inhale a number of unpleasant, 
putrid odors. In his report, however, Razran 
does not mention certain important details, such 
as whether the experimenter was the donor of 
the free lunch and whether he said anything to 
the subjects about his personal attitude toward 
the slogans, 

So far as the authors have been able to 
ascertain, no subsequent experiments have been 
published pertinent to checking Razran’s ob- 
servations. Nor has any published research been 
found bearing on the related questions of whether 


Brier ARTICLES 


or not (and under what limiting coi 
extraneous pleasant or unpleasant stimi 
affect the degree to which a recipient 
a series of persuasive arguments that 
to induce him to change a personal 
preference. 

As a preliminary step toward reop 
mental research on the above-mentio, 
theoretical problems, the present 
designed to investigate the alleged pl 
of enhanced communication effectivene: 
from “eating-while-reading.” The res 
designed primarily to answer the f 
question: If an experimenter gives the 
desirable food and drink but states í 
that the persuasive messages to be pre 
ones with which he does not neces: 
will there be a significant increase in 
from the gratifying activity of eal 
accompanies exposure to the comm 


p 


METHOD AND PROCEDURE 
Experimental Design 


The basic design involved randomly a 
subjects to two different experimental cond 
was a condition in which a substantial qu 
food was offered to the subjects during 
they were engaged in reading a series of 
suasive communications. Upon entering 
mental room, the subjects found the exp 
imbibing some refreshments (peanuts 4 
Cola) and they were offered the same reff 
with the simple explanation that there 
on hand because “I brought some along for’ 
The contrasting “no-food” condition was 
in every respect except that no refreshm 
in the room at any time during the session. 

The same measures of opinion change ¥ 
in the two experimental groups and also 
group of unexposed controls, who were 
the study in order to obtain a base line 
taining the effectiveness of each communicalil 
se. The subjects randomly assigned to 
condition were given the same pre- and- 
munication questionnaires, separated by 1 
time interval as in the other two experimé 
ditions, but without being exposed to any 
communications. 


The Communications and the Opinion ] 


On the basis of extensive pretesting, 
four communications, each of which advo 
unpopular point of view and had been 
be capable of inducing a significant 
opinion change, These communications 
tributed to fictitious authors who were 
journalists or news commentators. The 
clusions, all of which involved quantita! 
tions or preferences about future events, ] 
follows: i 


could b 
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1. It will be more than 25 years before satis- 
factory progress can be expected in the search 
for a cure for cancer. 

2, The United States Armed Forces do not need 
additional men and can be reduced to less than 
85% of their present strength. 

3, A round-trip expedition to the moon will be 
achieved within the next decade.? 

4, Within the next 3 years, three-dimensional 
films will replace two-dimensional films in prac- 
tically all movie theaters. 


. In order to assess opinion changes, four key 
questions were included in both the pre- and the 
postcommunication questionnaires, each of which 
asked the subject to express his opinion in the 
form of a quantitative estimate (for example, “How 
many years do you think it will be before an 
extremely effective cure is found for cancer so that 
cancer will no longer be a major cause of death? 
About years.”) 


Experiments I and II: Similarities and Difer- 
ences 


The same experimental design, described above, 
was used in two separate experiments, during suc- 
cessive semesters at the same college. In all essential 
features the first (Experiment I) was identical with 
the second (Experiment II) in that exactly the same 
experimental variations were used along with the 
same instructions, the same communications, and 
the same pre- and postcommunication questionnaires. 
But the two experiments differed in several minor 
ways. ‘The main difference was that in Experiment I 
the time interval between the precommunication 
questionnaire and exposure to the communications 
was about 2 months; whereas in Experiment II 
the precommunication questionnaire was given at the 
beginning of the experimental session, immediately 
Preceding the communications. 
fe et I, the initial questionnaire was 
; stered in regular undergraduate class sessions. 
anki introduced as a “survey of student opinions” 
Rae e key questions were embedded among nu- 
‘seca qlee questions on a variety of other con- 
aa issues. After a period of 2 months, the 
aa vere contacted by telephone and asked to 
jes ai volunteers for a study on reading prefer- 
Neck eg e vast majority volunteered and each sub- 
ae fe seen in a private interview session, at the 
Ries Si of „which he was randomly assigned to 
iio food with communication” condition or the 
Mice with communication” condition or the 
— “Posed control condition. After answering the 

27: 
td was carried out before the major 
ime wh, nts in space flights had occurred, at a 
Herter eae people were optimistic about the 
to the mo nical progress in this field. In response 
Silke fie question on the initial question- 

» almost all the students gave estimates of 10 


Year: 
S Or more before a successful round-trip flight 
e expected. 
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final set of postcommunication questions, each sub- 
ject was briefly interviewed concerning his reactions 
to the experimental situation. 

In Experiment II, the same essential procedures 
were used except for the fact that the precom- 
munication questionnaire was given at the beginning 
of the experimental session. Another minor differ- 
ence was that the unexposed controls were given 
some extracts from a popular magazine on irrelevant 
topics, which took approximately the same reading 
time as the four persuasive communications. More- 
over, unlike the unexposed controls in Experiment I, 
those in Experiment II were given the same food in 
the same way as in the main experimental condition, 
so that they too were eating while reading the 
(irrelevant) articles. 

In addition to the three conditions that were set 
up to replicate the essential features of Experiment I, 
a fourth experimental condition was introduced in 
Experiment II in order to investigate a subsidiary 
problem, namely, the effects of extraneous unpleasant 
stimuli. The fourth experimental group, while reading 
the four persuasive communications, was exposed to 
an unpleasant odor (produced by a hidden bottle 
of butyric acid), for which the experimenter dis- 
claimed any responsibility. 

In both experiments, the experimenter explained 
that the purpose was to assess the students’ reading 
preferences. He asserted that he did not endorse the 
communications and casually mentioned that he hap- 
pened to agree with certain of the ideas expressed 
and not with others (without specifying which). 
He asked the subjects to read the articles as though 
they were at home reading a popular magazine, In 
line with the alleged purpose, the postcommunication 
questionnaire in both Experiments I and II included 
20 filler questions asking for interest ratings of the 
articles (for example, ratings of how much interest 
they would expect the average college student to 
have in each topic). 


Subjects 


A total of 216 Yale undergraduate students were 
used in the two experiments. In Experiment I, 35 
men were in the unexposed control group, 32 in the 
“no food with communication” condition, and 33 in 
the “food with communication” condition. In Experi- 
ment II, the corresponding numbers were 23, 31, 
and 31, respectively. There were also 31 subjects in 
the fourth experimental group exposed to the 
“unpleasant” condition. 


RESULTS 


In both experiments, observations of the sub- 
jects’ eating behavior in the “food” condition 
showed that every one of them ate at least one 
handful of peanuts and drank at least one-half 
glass of the soft drink. The main findings con- 
cerning the effects of eating desirable food on 
the acceptance of the four persuasive communi- 
cations are shown in Table 1. In general, the 
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TABLE 1 
BY EXPOSURE TO Four PERSUASIVE COMMUNICATIONS 
SE oer Tao Dore Conprtions: “Foon” versus “No Foop” } 
GIVEN BY THE EXPERIMENTERS p 
% opinion change f 
i f 
Communication Experiment I Experiment I Rone deta mon 
topic 
Food No food Food 
Fp food, ands) nes (= 31) (W=03) (N = 6h) | 
1. Cure for cancer 
Positive change 68.7 81.8 80.7 93.5 74.6 sra 
Negative change 21.8 12.1 3.2 0.0 12.7 oa 
No change 9.5 6.1 16.1 6.5 12.7 . o 
Total 100.0 100.0 100.0 100.0 100.0 100.0 
Net change 46.9 69.7 77.5 93.5 61.9 81.1 j 
ee ee Sa A EAE 
$ =.11 <.10 <.05 
2. Preferred size of 
ma States 
Armed Forces 
Positiye change 65.6 81.8 29.0 51.6 47.6 oe 
Negative change 9.4 0.0 0.0 0.0 4.8 ae 
No change 25.0 18.2 71.0 48.4 47.6 3 D 
Total 100.0 100.0 100.0 100.0 100.0 ee 
Net change 56.2 81.8 29.0 51.6 42.8 67. 
— a) eee ui, uo ~— 
bd <.05 <.05 <01 
3. Round trip to 
moon a 
Positive change ji 75.9 48.4 58.0 50.8 ei 
Negative change 21.9 12.1 19.4 12.9 20.6 03 ji 
No change 24.9 12.0 32.2 29.1 28.6 00 
Total 100.0 100.0 100.0 100.0 100.0 wy 
Net change 31.3 63.8 29.0 45,1 30.2 54, 
nee Se — ~— 
2 =.05 =.20 <.05 
4, Three dimen- 
sional movies 166 
Positive change 68.7 75.9 71.5 94 
Negative change 21.8 12.1 11.1 200m 
No change 9.5 12.0 17.4 1 a 
Total 100.0 100.0 100.0 
Net change 46.9 63.8 60.4 61. 
eee ee 
p To 


results indicate that “eating-while-reading” has 
a facilitating effect on the amount of opinion 
change. In Experiment I, the differences between 
the food and no-food conditions are consistently 
in the predicted direction for all four communi- 
cations, two of which are significant at the .05 
level. (All p values are one-tailed and were 
obtained on the basis of the formula for assess- 
ing the difference between two net percentage 
changes, given by Hovland, Lumsdaine, & Shef- 
field, 1949.) The results for Experiment II show 
differences in the same direction for three of the 
four communications, two of which are significant 
at the .10 level. There is a very small, non- 
significant difference in the reverse direction on 
the fourth communication. R 


both experiments, shown in the last column a 
Table 1, can be regarded as a satisfactory SWM 
mary of the overall outcome inasmuch as: ( 
the numbers of cases in each experiment f 
almost equal; and (b) the two experiments K. ' 
fered only in minor features that are irrelevl 
to the main comparison under investigation. ; 
combined data show that all four communicatio 
produced differences in the predicted direct 
and for three of them the differences are Jats 
enough to be statistically significant. Thus, 
results support the conclusion that, in gene 
the extraneous gratification of eating while 1) 
ing a series of persuasive communications t® | 


4 
ue 
<.20 
1 
The p values based on the combined data from 


to increase their effectiveness. 
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d That each communication was effective in 
inducing a significant degree of opinion change, 
whether presented under food or no-food condi- 

- tions, is indicated by the comparative data from 
the unexposed controls. In both experiments, the 
control group showed very slight positive 
changes, if any, on each of the four key questions 
and the amount of change was always signifi- 

“cantly less than the corresponding net change 
shown by the food and no-food experimental 

groups." There were no consistent differences 

“between the control group in Experiment I and 

the one in Experiment II, which indicates that 
the different time intervals between the before 
and after ‘measures and the other minor pro- 
cedural differences between the two experiments 
had no direct effect on the opinion measures. 

The condition of unpleasant stimulation intro- 

: duced into Experiment II had no observable ef- 

| fect on the amount of opinion change. The net 

J changes obtained from the group exposed to the 
foul odor (N = 31) were as follows: cancer cure, 
67.1%; size of armed forces, 25.8%; round trip 
to moon, 38.7%; three-dimensional movies, 

64.5%. These values differ only very slightly 

-from those obtained from the group exposed to 
the no-food condition in Experiment II (see 
Table 1); none of the differences are large 
enough to approach statistical significance. As 
expected, however, all the net changes for the 
Unpleasant odor condition are smaller than those 
for the food condition and in two of the 
four instances the differences are statistically 
Significant at beyond the .05 level. 


b “a all but one instance, the net change shown 
ka ee controls was not significantly dif- 
K i heey zero. The one exception occurred in the 
i n group in Experiment I with respect to the 
Rha issue (cancer cure), on which a significant net 
ae of —34% was found. This change, however, 
the z the reverse direction from that advocated by 
ee ation (probably as a consequence of 
; ke c publicity concerning new advances in 
Ain, ore that appeared in the newspapers 
is, e months between the before and after 
the ee Thus, on this item, as well as on 
cantly te three, the control group showed signifi- 
aes change in the expected direction than 
A experimental groups. 
eee of responses to the precommunication 
initially. o e from both experiments showed that 
E aS ‘ n each of the four key opinion questions, 
among ai only very slight, nonsignificant differences 
A culis. experimental and control groups. None of 
- difference in Table 1 and none of the other observed 
tributable al amount of opinion change are at- 
fi o initial differences. 
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Discussion 


Our finding that the extraneous gratifying 
activity of eating tended to increase the degree 
to which the accompanying persuasive messages 
were accepted may prove to have important 
implications for the psychology of attitude 
change, especially if subsequent research shows 
that the gains tend to be persistent, giving rise 
to sustained modifications of personal beliefs or 
preferences. Since the control group in Experi- 
ment II (which received food along with ir- 
relevant communications) showed net opinion 
changes that were practically zero and were 
significantly less than those shown by the main 
experimental group, the food alone appears to 
have had no direct effect on any of the opinion 
measures. Hence the observed outcome seems to 
implicate psychological processes involved in the 
acceptance of persuasive influences. 

Our results on the positive effects of food are 
similar to Razran’s (1940) findings on the in- 
crease in favorable ratings of sociopolitical 
slogans induced by a free lunch. Razran has 
indicated that he regards his observations as 
evidence of Pavlovian conditioning, resulting 
from the contiguity of the conditioned stimuli 
(the slogans) and the unconditioned pleasant 
stimuli (food). Before accepting any such inter- 
pretation, however, further investigations are 
needed to check systematically on the possibility 
that the change in acceptability is brought about 
by creating a more favorable attitude toward 
the donor. We attempted to minimize this pos- 
sibility in both Experiments I and II by having 
the experimenter give the subjects an intro- 
ductory explanation in which he clearly stated 
that he was not sponsoring the persuasive com- 
munications. Despite this attempt, however, the 
subjects may have ignored or forgotten his re- 
marks and assumed that he was sponsoring them. 
We have no evidence bearing directly on this 
matter, but we did note that in the informal 
interviews conducted at the end of each experi- 
mental session, many more favorable comments 
about the experimenter were made by the sub- 
jects who had been in the food condition than 
by those who had been in the no-food condition. 

Our failure to confirm Razran’s findings on 
the negative effects of unpleasant stimulation 
might be accounted for in terms of attitude 
toward the experimenter. In Razran’s experiment, 
the experimenter “required” the subjects to 
sniff the putrid odors, and hence he might have 
been directly blamed for the unpleasant stimula- 
tion; whereas in our Experiment II, the un- 
pleasant odor was presented as an accidental 
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occurrence for which the experimenter was not 
responsible, Further experimental analysis is 
obviously needed to determine if the effects of 
pleasant and unpleasant stimulation observed in 
our experiment are dependent upon whether or 
not the experimenter is perceived as the causal 
agent. 

The fact that the experimenter himself par- 
ticipated in eating the food might have influ- 
enced the subjects’ perceptions of the general 
atmosphere of the reading session and hence 
needs to be investigated as a possible variable, 
independently of the subjects’ food consumption. 
The limiting conditions for positive effects from 
“eating-while-reading” also require systematic in- 
vestigation, particularly in relation to unpleasant 
interpersonal stimuli, such as those provoking 
embarrassment, outbreaks of hostility, or other 
forms of emotional tension that could counter- 
act the positive atmosphere created by the 
availability of desirable food. 

It is also important to find out whether varia- 
tions in the experimenter’s endorsement of the 
communications play a crucial role in determining 
the facilitating effects of the proferred food. For 
example, if subsequent research shows that the 
experimenter’s positive versus negative endorse- 
ments make a difference, then an explanation 
in terms of increased motivation to please the 
donor will be favored, rather than a simple 
conditioning mechanism, and a more complicated 
explanation will be required to account for the 
positive effects obtained under conditions where 


ERRATUM 


In Table 2 (page 311) of the article “Cooperation and Competition 
in Means-Interdependent Triads,” by 


Todd Eachus 


BRIEF ARTICLES 


(Journal of Abnormal and Social Psychology, 1963, 
67, 307-316), the interaction effect was actually not significant. The 
A X B row of the table should indicate that the mean sum of squares 
is 14.40 and the F value is .40, with no indication of significance. This 
change is in no way relevant to the substantive findings of the article, 
since there had been no expectation of an interaction effect. However, 
in the interests of accuracy, the last two sentences on page 310, which 
refer to the significance of the interaction, should be omitted. 


the experimenter explicitly detaches himself from 
sponsorship of the communications.* These impli. | 
cations are mentioned to illustrate the new lings | 
of research suggested by comparing the results | 
from the present experiment with those from | 
Razran’s earlier study. | 


* The potential importance of positive versus nega. | 
tive endorsement by the experimenter as an inter. | 
acting variable was suggested by some unexpected | 
results obtained in a pilot study by Dabbs and 
Janis, which was carried out as a preliminary step 
toward replicating the present experiment under 
conditions where the experimenter indicates that he | 
personally disagrees with the persuasive communica: | 
tions. The pilot study results led us to carry out a 
new experiment in which we compared the effects 
of eating-while-reading under two different endorse- 
ment conditions (the experimenter agreeing or dis | 
agreeing with the communications). A report on | 
the effects of the interacting variables, as revealed 
by the data from the Dabbs and Janis experiment, 
is currently being prepared for publication. 


REFERENCES 


Hovranp, C. I, Janis, I. L., & Ketrey, H. H. Com 
munication and persuasion. New Haven: wad 
Univer. Press, 1953. | 

Hovranp, C, I, LUMSDAINE, A. A., & SHEFFIELD 
F. D. Experiments on mass communication, Prince- 
ton: Princeton Univer. Press, 1949. ý 

Razran, G. H. S. Conditioned response changes | 
in rating and appraising sociopolitical slogans. 
Psychological Bulletin, 1940, 37, 481. 


(Early publication received July 21, 1964) 


Bertram H. Raven and H. 


» 
al of Personality and Social Psychology 
‘gs Wal 1, No. 2, 187-188 


i 


‘The findings of the APA’s Project on Scientific Information Exchange in Psychology call attention 
“to the difficulties of the scientist in establishing and maintaining contact with current work relevant 
to his interests. Accordingly, the Project is sponsoring a series of changes during the next 12-15 
“months which are designed to enhance scientific communication. These changes are planned as 
‘Jimited trials. Moreover, they are wholly supported by the Project and hence do not cut into our 
allotment. 

The Editor has agreed to make possible a test of one of the changes; i.e., the titles and authors 
of accepted papers will be listed following the last article in the journal. Such listing will make it 
possible for our readers to become aware of research many months in the advance of its journal 
“publication, At the end of the year’s trial the results of the trial will be evaluated and further 

consideration will be given at that time as to whether the service is worth continuing. 
This listing plus that published in the January 1965 issue are the entire backlog of manuscripts 
“accepted through March 1964 by this journal. d 
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Ego Involvement and the Absolute Judgment of Attitude Statements: Charles D. Ward*: Department of Psy- 
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chievement, Nurturance, and Competition in Male and Female Triads: Imat R. Amidjaja and W. Edgar 
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Versity of California, Berkeley 4, California. fp i; 
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A an Baron*: Department of Psychology, Wayne State University, Detroit, Michigan 48202. à 
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' Gro man* and Lester Luborsky: 9 Ellis Drive, Syosset, New York. A ; 
up Performance as a Function of Size, Structure, and Task Difficulty: J. O- Morrissette,* S. A. Switzer, and 
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sity of Kansas, Lawrence, Kansas 66045. 

The Relationship between Manifest Anxiety and Two Indices of Autonomic Response to Stress: Edward S. Kaj. 
kin*; Department of Psychology, State University of New York, Buffalo, New York 14214, z 4 

The Discriminability of REM and NREM Reports: Lawrence J. Monroe,* Allan Rechtschaffen, David Foulkes, 
and Judith Jensen: Department of Psychiatry, University of Chicago, 950 East 59th Street, Chicago, I. 
nois 60637. 

The Similarity, Compatibility, or Incompatibility of Needs in Interpersonal Selection: Joseph F. Rychlak*: De 
partment of Psychology, St. Louis University, 221 No. Grand Blvd., St. Louis 3, Missouri. | 

Tolerance for Unambiguous Feedback: Solomon E. Feldman* and Joy K, Rice: Department of Psychology 
University of Wisconsin, 600 N. Park, Madison 6, Wisconsin. i, 

Effects of Success and Failure on Level of Aspiration as Related to Achievement Motives: Robert W, Moulton’, 


Performance Expectancy as a Determinant of Actual Performance: Alan S. Waterman* and LeRoy H. Ford, 
Jr.: Department of Psychology, State University of New York, Buffalo, New York 14214, 

Others’ Self-Evaluations and Interaction Anticipation as Determinants of Self-Presentation : Kenneth J. Gergen! 
and Barbara Wishnov: Department of Social Relations, Harvard University, Cambridge, Massachusetts 
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Effect of Social Isolation and Physical Confinement on Motor and Sexual Behavior of Guinea Pigs: Helene D, 
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Some Aspects of Observed Aggression: Leonard Berkowitz*: Department of Psychology, University of Wiscon} 
sin, Madison 6, Wisconsin. 

Effects of Apparent Pain in Others on Observer's Reaction Time: Vincent Di Lollo* and Seymour M. Bergeri 
Department of Psychology, Indiana University, Bloomington, Indiana 47405, 


Authoritarianism, Displaced Aggression, and Social Status of the Target: Ralph Epstein*; Department o 
Psychology, Wayne State University, Detroit, Michigan 4820: 

Internal versus External Control of Reinforcement and Decision ‘Time: Julian B. Rotter* and Ray C. Mulryii 
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Rowland and Young found that hypnotized Ss were willing to carry out such 
apparently antisocial actions as grasping a dangerous reptile, plunging their 
hand into concentrated acid, and throwing the acid at an assistant. However, 
only informal attempts were made to show that the Ss perceived the acts as 
dangerous, or to demonstrate that the requested behavior exceeded the broad 
limits of social control implicit in the context of a psychological experiment. 
The experiment conducted by Young was replicated exactly, and his results 
confirmed. However, nonhypnotizable Ss simulating hypnosis to a “blind” E, 
and even normal, waking control Ss, complied with the same requests. Ss in- 
variably reported they were convinced the activities were safe because they 
were participating in research conducted by competent, responsible scientists, 
It is concluded that the tasks were perceived by Ss as being within the limits of 
legitimate requests made in an experimental context. The present study does 
not answer the question whether antisocial behavior can be elicited under 
hypnosis. However, it demonstrates the nature of control groups essential for 
a valid test of the antisocial hypothesis, and illustrates the broad range of 
behavior legitimized in an experimental context. 


SOCIAL CONTROL IN THE PSYCHOLOGICAL 
EXPERIMENT 


There has been increasing awareness by 
behavioral scientists that the subject in an 


1 This study was conducted at the University of 
Sydney, Australia, during a visit by the senior 
author, June-August 1960. It was supported in part 
by Research Grant AF-AFOSR-88-63 from the Air 
Force Office of Scientific Research and by a grant 
from the Human Ecology Fund. 
aus wish to thank A. G. Hammer, then acting 

lead of the Department of Psychology, for his 
cooperation with the use of departmental facilities. 
ea to express our appreciation to the Depart- 
a of Chemistry and Biochemistry, University of 
Rei and the Radiology Department of the 
ane rince Alfred Hospital, Sydney, for donating 
Boh ent, We are indebted to the Department of 
TRA University of Sydney, for loaning the 
Fred Ma aa We are particularly indebted to 
Dni an hews, herpetologist, for his willingness to 
out 1 the , and handle the venomous reptile through- 
sua Study, Without his generous help this 

i Would not have been completed. 
made Tn appreciative of the valuable comments 

. Field i the preparation of this report by Peter 
Donald N. awrence A. Gustafson, Ulric Neisser, 
E. Shor re Connell, Emily C. Orne, and Ronald 
Re igo e also wish to thank Eleanor DeRubeis 
oye Valuable editorial assistance. 
author z the experiment was conducted the junior 

as supported by a University of Sydney 


Resear 
ology Studentship in the Department of Psy- 


experimental investigation is not a passive 
entity. Experimental evidence (Orne, 1959; 
Orne & Scheibe, 1964) has demonstrated that 
the subject takes an active role in interpreting 
the nature of the investigation and makes 
implicit assumptions about the hypotheses 
being investigated which influence his per- 
formance in the experimental situation. Nor 
is an experimenter free from the influence of 
his own investment in the hypotheses he is 
investigating. In a series of studies Rosenthal 
has shown that experimenters who have dif- 
ferent hypotheses about the outcome of a 
particular experiment may obtain results 
which are congruent with their hypotheses 
(for example, Rosenthal, 1964; Rosenthal & 
Fode, 1963). Such studies imply that it is 
necessary to consider the particular nature 
of the special interpersonal interaction which 
exists between the subject and the experi- 
menter in psychological experiments. 

Orne (1960, 1962) has emphasized that 
the experimental context legitimizes a very 
broad range of behavioral requests. Subjects 
have implicit faith that experimenters are 
responsible people, that they will not be asked 
to carry out tasks which are devoid of mean- 
ing, and that regardless of appearances they 
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will not be permitted to suffer any harm be- 
cause of obvious social sanctions. 

In a series of informal experiments in our 
laboratory, it has been impossible to devise a 
task which the subject perceives as completely 
“meaningless” within the context of an ex- 
periment. For example, subjects were con- 
fronted with a stack of paper, each page 
containing rows of random digits, The experi- 
menter instructed subjects to continue adding 
the rows of numbers successively, and after 
accurately completing each page, to tear it 
into a minimum of 32 pieces. Although sub- 
jects were given no reason to justify the task, 
they continued this apparently meaningless 
endeavor beyond the tolerance limits of the 
experimenters (Orne, 1962). 

Frank (1944) has reported some informal 
experiments in which subjects continued 
meaningless and impossible tasks, including 
trying to balance a marble on a small steel 
ball and transferring spilled mercury to a 
small bottle with a wooden paddle, even when 
an assistant tried to prevent them from trying 
to complete the tasks. No justification was 
given for performing the tasks other than 
that it was an experiment. 

In the same study, Frank also reported 
that subjects continued eating several un- 
savory, unsalted soda crackers for time 
periods in excess of what would seem reason- 
able for such an unpleasant task, and longer 
than subjects who were told they could stop 
eating them as soon as they wished. Shor 
(1962) has reported that subjects were willing 
to accept extremely high levels of electric 
shock when requested to select a level of 
intensity which was as high as they could 
tolerate for experimental purposes. Milgram 
(1963) has shown that subjects continue to 
administer what they believe are extremely 
high levels of electric shock, exceeding ap- 
parently dangerous levels, to another “sub- 
ject” in the context of a learning experiment. 

The limits of boredom, tolerance, pain, and 
fatigue which are accepted as reasonable re- 
quests within an experimental situation seem 
extremely broad. However, the actual range 
of social and behavioral control legitimized 
by the special contract implicit in the sub- 
ject’s agreement to participate in a psy- 
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chological experiment has received few en} 
plicit tests. 


ANTISOCIAL BEHAVIOR AND Hypnosis | 


It is generally considered that a hypnotizel| 
subject relinquishes considerable social and 
behavioral control to the hypnotist. A subje 
frequently reports that he felt compelled to 


by the hypnotist. This apparent increase il 
the amount of social control relinquished byi 
the hypnotized subject to the hypnotist has 
raised the unresolved question ‘whether 4 


perform apparently antisocial acts or be 
havior which is perceived as injurious 4 
dangerous to self or others. ok 
(1943), Rowland (1939), Weitzenhollé 
(1949), Wells (1941), Wolberg (1945), ani 
Young (1952) have stated that, providel 
adequate techniques are used, hypnotize 
subjects may be compelled to carry out ap 
parently antisocial actions, while Erick 
(1939), Meares (1960), and Schilder am 
Kauders (1927) have disagreed with thi 
viewpoint. The extensive literature presei 
ing the conflicting viewpoints has been 
viewed elsewhere (Barber, 1961; Ormi 
1960; Weitzenhoffer, 1953). The presel 
investigation is concerned with two studii 
frequently cited as evidence that hypnosi 
can be used to induce antisocial behavior, 4 

Rowland (1939). Two deeply hypnotize 
subjects were asked to reach through ? | 
window of a box and pick up a large, active 
diamondback rattlesnake. This request 
rationalized to them by suggesting that i 
snake was a coil of rope. One hypnolidil) 
subject immediately complied, but was Plt 
vented from handling the snake by a panet 
invisible glass. The other subject came out 4 
the hypnotic state and refused to contin 
with the experiment. Another two hypnotil@ 5; 
subjects attempted to grasp the snake, WS) 
when no attempt was made to delude tll 
about what it was, Similarly, two WA 
notized subjects were requested to throw %) 
centrated sulphuric acid, which they Œ 
been told was very dangerous, at the fact 
an assistant. The two subjects complied 
this request. 


dl 
nd 


SOCIAL CONTROL 


} By way of control, forty-two persons, of every 
age and degree of sophistication, were asked to 
come to the laboratory and pick up the snake... . 
With one exception all the persons were not only 
badly frightened at the appearance of the snake, 
but would not come close to the box .. . [Row- 
land, 1939, p. 116]. 


_ Young (1952). In a replication of Row- 
land’s study, Young asked eight deeply hyp- 
notized subjects to carry out similar tasks. 
Young reported, 


The results show that seven of the eight subjects 
would ‘enter into a situation which unhypnotized 
Observers shrank from, the subjects carrying out 
suggestions to handle snakes and throw nitric acid 
under conditions from which they themselves re- 
coiled in the waking state [p. 405]. 


One major aim of the present study was 
to confirm these results by replicating in 
exact detail the procedures outlined by 
Young (1952). It is apparent from the ac- 
counts of both Rowland and Young that 
strong pressure was placed upon the hyp- 
notized subjects to comply with the requested 
antisocial acts. However, Rowland did not 
exert similar pressure to comply when testing 
the independent, informal waking control 
group, nor did Young confront his hyp- 
notized subjects with the same type of pres- 
Sure in the subsequent waking condition. 
Consequently, it is doubtful if the informal 
Controls used by these investigators provide 
ony answer to the fundamental question 
being considered: does the degree of social 
Control which occurs under hypnosis actually 
exceed the amount of social and behavioral 


control already existing in the experimental 
situation? 


S 
OCIAL CONTROL, ANTISOCIAL BEHAVIOR, 
AND HypNosIs 


ie basic question may be formally stated: 
S the degree of social and behavioral con- 
Mine hypnosis exceed that which is 
een by the special social and be- 
i control implicit i z 
situation? implicit in the experimental 
I x 
in ai to determine that subjects have 
actions been compelled to carry out any 
: Which they would not have carried 
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out without the intervention of hypnosis it 
must be shown that: 

1. Subjects in hypnosis carry out actions 
which are not performed by nonhypnotized 
control subjects. 

2. Control subjects must be treated in an 
identical fashion to hypnotized subjects; 
both in regard to explicit instructions as well 
as implicit cues. 

If these actions are to be designated as 
antisocial or self-destructive it must be shown 
that they are perceived as such by the sub- 
jects, i.e., truly dangerous or harmful to 
themselves or others. The implicit cues are 
of crucial importance. A subject is aware of 
certain realities imposed by the experimental 
situation. It is as clear to a subject as it is to 
any scientist that no reputable investigator 
can risk injuring a subject during the course 
of an experiment. A subject knows that an 
experimenter will outline in advance any 
possible specific and deliberate danger which 
could be associated with his actual par- 
ticipation in a study. Consequently, any re- 
quested behavior which appears to a subject 
to be dangerous at face value may be re- 
interpreted in the context of a laboratory 
situation. In spite of the apparent objective 
danger of a task it may nonetheless be per- 
ceived to be harmless because the subject 
realizes that necessary precautions will be 
taken to avoid possible injury to him. If an 
apparently dangerous task is requested of a 
subject during an experiment the subject’s 
compliance, or refusal, may depend on 
whether he perceives that he is expected, or 
is not expected, to carry out the task. It is 
particularly vital in an experiment which 
depends on a contrived situation to determine 
what the subject, in different groups, per- 
ceives about the experimental situation and 
what is implicitly communicated to the sub- 
ject within different groups. 

Even though the experimenter is extremely 
careful to treat all groups alike, subtle and 
unintentional cues may be differentially com- 
municated by him to subjects in different 
experimental groups, particularly when the 
experimenter knows to which experimental 
group a specific subject belongs. These 
experimenter influences on results have been 
demonstrated in both animal (Rosenthal & 
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Lawson, 1963) and human research (Rosen- 
thal & Fode, 1963). x 

It is essential, therefore, that the non- 
hypnotized control groups and the hypnotized 
subjects are treated in an identical manner. 
The real-simulating hypnosis model de- 
scribed by Orne (1959, 1960, 1962) pro- 
vides one method of making reasonably sure 
that experimental and control subjects are 
treated alike. With this procedure both 
hypnotized and control group subjects are 
run “blind” by an experimenter, because he 
is not informed, and cannot readily detect, 
which subjects are hypnotized, and which 
subjects are awake but simulating hypnosis. 
The procedure has similar aims to those of 
the “blind” designs used in the evaluation 
of new drugs in psychopharmacology. 

The real-simulator design has been adopted 
as the main control procedure in the present 
study. If both real and simulating subjects 
carry out behavior which is apparently anti- 
social or self-destructive, it would seem in- 
appropriate to conclude that the hypnotized 
group was compelled to carry out these 
actions because of the use of hypnosis: 
rather, the behavior of both groups could be 
interpreted more parsimoniously as an ap- 
propriate response to the existing cues in the 
experimental situation. 

In addition to the use of the crucial simu- 
lating control group, the content of social and 
behavioral control existing within the experi- 
mental situation was explored by including 
a normal waking control group. This inde- 
pendent waking control group was treated, 
as far as possible, in the same way as the 
hypnotized and simulating groups. An at- 
tempt was also made to demonstrate that it 
is possible to communicate to a subject that 
he should fail to carry out the apparently 
dangerous acts. The procedure adopted was 
the same as our understanding of the informal 
controls used by Rowland (1939). In addi- 
tion, each hypnotized subject was retested 
in the waking condition, as his own control, 
similar to the control procedure of Young 
(1952). A parallel procedure using subjects 
with no previous experience with hypnosis 
was used as an additional waking control 


group. 
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PROCEDURE ê n 


Selection of Subjects * 


Hypnosis group. Several volunteer undergradu 
students were individually tested for susceptibility 
to hypnosis. After at least two hour-long screening 
sessions, six subjects were selected to participate 
in the study as the real hypnosis group. These 
subjects could readily achieve deep somnambulistic} 
hypnosis manifesting such typical phenomena as 
catalepsy and rigidities, positive and negative hal- 
lucinations, posthypnotic suggestions, and complet: 
posthypnotic amnesia. 

Simulating hypnosis group. Another six subjects 
who showed no hypnotic responses during at leat 
two extensive, individual hypnotic induction sessions 
participated in the study as simulators. Each simt 
lating subject was told that he was participating » 
in research investigating the nature of hypnosis, but 
he was not told that the study involved allegedly 
antisocial behavior. He was told that his task was to 
try to fool an experienced hypnotist, in an experi: 
mental setting, by pretending to be deeply hypno- 
tized. Even though he would probably not expen- 
ence or produce any real hypnotic response, he wat 
to act as if things were happening just as the 
hypnotist said they would. Although the hypnotist 
was described as an authority on the subject, il 
was stressed that it was possible, though ven 
difficult, for the subject to fool the hypnotist. 

No special training or instruction about how 0 
simulate was given to the subjects in this group 
nor were they told what phenomena would be 
involved in the experimental session. However, each 
simulator had individually watched a good hypnotic 


8 Several procedural details have been deposited 
with the American Documentation Institute. oa 
material contains ‘additional detail about selection o j 
subjects, choice of a venomous reptile, the eff if 
of the nitric acid on the coin and safety procedié f 
protecting the subject, description of the procedi 
for substituting the colored solution for the aahi 
and a scale diagram of the apparatus. Order Dott” 
ment No. 8203 from ADI Auziliary Publication 
Project, Photoduplication Service, Library of Cot 
gress, Washington, D. C. 20540. Remit in advan 
$1.25 for microfilm or $1.25 for photocopies ri 
make checks payable to: Chief, Photoduplicati! 
Service, Library of Congress. b 

4One experimenter (FJE), who selected SU 
jects, trained simulators, was assistant and obse 
during the study, subsequently confirmed ‘al 
hypnotized subjects were amnesic for the ei 
that simulators experienced no real effects of l 
hypnotic induction and suggestions, and conduc 
postexperimental inquiries. The second experimel | 
(MTO) induced hypnosis for the main session sie 
ducted the five groups in the experimental prot® 
and at the completion of each subject’s experimet 
participation, conducted a further independent i 
quiry after being informed which were 1? 
simulating subjects. 


‘subject being hypnotized, demonstrating motor and 
challenge suggestions, simple positive hallucinations, 
amnesia, and posthypnotic suggestion. 

Additional control groups. Six subjects, who did 

“not meet the criteria for either good hypnotic sub- 
jects or simulators, formed one of the additional 
control groups. The remaining two control groups 
contained subjects recruited from the student popu- 
Yation who did not know the experiment also 

‘involved hypnosis. 


Experimental Procedure 


The experiment follows the procedure described 
by Young (1952) as closely as possible.® 

Hypnosis was induced in a room adjacent to the 
experimental room. After the induction of hypnosis 
the subject was led, with eyes closed, into the 
‘experimental room and seated in front of a large 
wooden box which stood on a table. 

The box was divided into two compartments by 
a removable partition. The subject could easily 
reach into the front compartment through a window. 
A sheet of plate glass could be raised or lowered 
silently across the window, sliding along felted 
grooves. The compartment was illuminated by two 
“Concealed lights, so that no light was reflected 
directly onto the surface of the glass. The two 
lights were the only source of illumination in the 
toom, From the position of the subject it was not 
Possible to detect the presence of the glass.° 


We are indebted to P. C. Young for his kindness 
Mn providing an unpublished manuscript describing 
patus and procedure in detail and for clarifying 
poe details in private communications (1959). 
R he invisibility of the glass was attested by the 
Ri wing: (a) Inability of the experimenters to 
elect if the glass was in position unless they had 
a knowledge of its position. (b) The witness 
n bruised knuckles of several members of the 
piment of Psychology whose curiosity got the 
. Kit them. (c) The expressed concern of a 
A e student when watching a subject reach 
ee snake. Although he knew about the glass 
he did and although he had been watching closely, 
aie NEA see the screen being lowered. (d) Only 
the ee in a subsequent inquiry reported detect- 
“fulness nian This happened because of the forget- 
e windo the experimenters who failed to clean 
A eee before this particular subject was tested. 
and when nes expressed spontaneous surprise if, 
icra, a ey discovered the glass in their attempts 
ments Ho snake, often making exploratory move- 
lizah ae the surface of the window. 
acuity ee Erickson (1962) has argued that visual 
notized aay increase under hypnosis; hence, hyp- 
glass, ‘This ae would detect the presence of the 
Cut differe 'ypothesis would be relevant if clear- 
group a ee were found between the hypnotized 
nd other crucial groups in the willingness to 


“comply wi i 
“activities Tequests to perform the antisocial 
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Behavior Requested of the Subject 


Inanimate objects. The hypnotist suggested that 
the subject would have an irresistible urge to do 
whatever the hypnotist told him. The subject was 
then asked to open his eyes, to reach into the 
compartment, to pick up and identify objects he 
saw there, and to place them in a bag held by the 
hypnotist. A two-shilling coin, then a piece of 
chalk, were placed in the compartment. All subjects 
identified and removed these objects. 

Harmless reptiles. The subject was then told that 
a harmless, live reptile would be placed into the 
compartment. A thick-tailed gecko lizard (Gymno- 
dactylus milii) was placed in the compartment. 
When the lizard had been removed, a thin, 14-inch 
long, harmless, green tree snake (Ahaetulla punc- 
tulatus) was lowered into the compartment. 

The remaining three activities described below 
objectively appeared to be dangerous or harmful, 
either to the subject himself, or to a third person 
present in the room. 

Venomous snake. An assistant wearing long, thick, 
X-ray gloves placed a 14-inch long venomous snake 
into the compartment, The reptile, popularly known 
as the red-bellied black snake (Pseudechis porphy- 
riacus), is generally considered to be one of the 
two most venomous snakes found in Australia, 
capable of inflicting a mortal wound if the victim 
is not treated immediately upon being bitten. 

The hypnotist asked the subject to pick up the 
snake carefully about its middle, and place it in the 
bag. Unknown to the subject, the glass screen had 
been lowered silently across the window. If an 
attempt was made to grasp the reptile, the subject 
struck his hand firmly against the glass. 

Coin in acid. The screen, and the snake, were 
removed. The subject then watched the hypnotist 
pour fuming concentrated nitric acid (15 molar) 
in a small, shallow, Pyrex glass beaker. After watch- 
ing the acid actively dissolve a copper-alloy half- 
penny coin for approximately a minute, the subject 
was asked to remove the coin from the beaker as 
quickly as possible with his bare hand, and then to 
plunge both hand and coin into a nearby basin 
of soapy lukewarm water, 

Throwing acid. The experimenter stood between 
the subject and the box as he carefully washed 
the subject’s hand. While the subject was distracted, 
the acid was replaced with an identical beaker 
containing a harmless, colored solution which was 
the same color as the acid. The subject was simi- 
larly distracted if he did not remove the coin, so 
that changing the solution was not observed. 

The central partition in the box was removed, 
exposing the assistant who had been responsible 
for recruiting and training the subject, sitting at 
the opposite end of the open-ended box. The 
hypnotist suggested assertively that the subject was 
very angry with the assistant for placing him into 
the unpleasant situation of the experiment. The 
subject was told he felt so angry that he would have 
an irresistible urge to throw the acid firmly into 
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the face of the assistant and authoritatively told 
to pick up the acid and throw it. 


Postexperimental Interview 


The subject was taken back into the adjoining 
room, and hypnosis was terminated. Comprehensive 
interviews were conducted with each subject to 
allow him to express his subjective reactions to the 
experiment, and his thoughts and comments about 
the experiment. Appropriate questions were asked 
to establish that the subject had remained amnesic 
throughout the experimental session, or, in the case 
of simulators, to confirm that no trance effects 
occurred during the experiment. 


Treatment of Experimental Groups 


The procedure has been outlined as it was en- 
countered by the subjects in the hypnosis group. 
The hypnotized group of subjects was subsequently 
retested in the waking state, while still amnesic 
for the hypnosis session, and an additional four 
groups were tested. 

For clarity of exposition a detailed statement of 
the aims of testing each group will be presented 
below in the Results section, General modifications 
in the experimental procedures for the respective 
waking control groups will be quite apparent, The 
procedures outlined above were followed either ex- 
actly, or as approximately as the nature of the 
specific groups would allow. The major difference 
between groups was the manner and attitude of the 
experimenter, and the subtle means whereby expecta- 
tions of compliance or noncompliance with the 
apparently antisocial actions were conveyed. Dif- 
ferences in the approach of the experimenter, in 
terms of the aims and purpose of each group, will 
be discussed as the results for each group are being 
presented, 

There was, of course, no difference in the treat- 
ment of the hypnotized subjects and simulating 
subjects because the hypnotist remained unaware 
of the subgroup membership of these subjects 
throughout the experiment. 
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RESULTS 


The number of subjects in each group who 
attempted to comply with the requested tasks h 
is summarized in Table 1. A detailed analy- 
sis of the responses of each subject within | 
the various groups is presented in Table 2, 
together with ratings of the degree of sub- 
jective involvement in the tasks. The latter |” 
ratings were made by the assistant who was 
aware of the group membership of hypno; 
tized and simulating subjects, and must be f 
interpreted accordingly.’ 


7The data presented in Tables 1 and 2 are not 
amenable to further statistical analysis. There is a 
partial experimental dependence of the tasks be: 
cause of the way in which they were, of necessity, 
presented to the subject. Consequently, neither treat- 
ing each antisocial action as a separate activity, 
nor the alternative of tabulating the number of 
acts actually attempted by each subject, is a valid) 
method of treating the data statistically. Even if 
either alternative were possible, the analysis of 
frequency data, preferably by exact probability 
procedures, requires the determination of the et 
pected frequencies of occurrence. Hypothetical ex 
pected frequencies of six subjects in any group A 
successfully attempting each act could be defended? 
as readily as hypothetical expected frequencies of 
no subject in any particular group attempting ea 
act. Even the statistical determination of expected 
frequencies from marginal entries has limited merit 
in groups of this size, for this would impose a lev 
of precision on the data which is not warranted, # 
well as limiting the possibility of finding statistically 
significant differences. However, this does not T% 
flect upon the size of the groups, for the experi 
mental rationale is such that the mere occurrent” 
of apparently antisocial behavior is significant if i 
occurs in the crucial simulating control group. ‘The 
aim of the study is, in a real sense, to demonstrate 
the truth of the null hypothesis: to demonstrate that 
specially designed groups do mot differ in 
propensity to respond in a defined manner. 


TABLE 1 
FREQUENCY OF ATTEMPTS TO CARRY OUT APPARENTLY ANTISOCIAL ACTS 
f 
Fe a E | Take cls meta 
1. Real hypnosis 5 5 5 5 1 
la. Retest real, as waking control x J 3 2 
(Young) ; 
2. Waking control; press for 6 3 1 1 
fail (Rowland) 
3. Simulating hypnosis 6 6 6 6 - 
4, Waking compliance control 5 3 5 5 
5. Waking control: eliminate x 2 3 2 


“shaping” procedures 


* Six subjects in each group; see text for description of groups. 


Me 
b Includes small lizard and harmless snake. Two subjects refused both. Eliminated from procedure in waking retest of Grol 


and Group 5, 
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TABLE 2 
ANALYSIS OF ATTEMPTS TO CARRY OUT APPARENTLY ÅNTISOCIAL ACTS FOR EACH SUBJECT 


P G harml 
kami sex | Omenaa | ORNER O | gomace= | at assitata 
“1, Real hypnosis 
È, E a a 
es es es (A, T, 
F ee a ge Me Yes Yes (T) ) 
EB° F No: ER S ye 
5 SM F Yes Yes Yes (U) Yes 
fa, Retest real hypnosis subjects 
a waking atata 
'oung control 
JB F — No No No 
YM F = No Yes (H) Yes (A, T 
E cP M = No Yes eae 
se f a Nes Œ) Yes (A) mee (A, T) 
SM F = Yes No No 
Waking control: press for 
jalure (Rowland control) 
F M Yes S Yes Yes (H, V) 
JE F Yes Yes (A) No No 
ý PE M Yes o No No 
Í; B M Yes Yes (H) No No 
TO F Yes No No No 
i F Yes No No No 
3, Simulating hypnosis 
i aa $ oe gs H) Yes (A, U) ae K 
es es es 
a F Yes Yes Yes (H, U) Yes (A, T) 
U M Yes Yes Yes (V) 
z M Yes Yes (H) Yes (H,U) Yes (H) 
| M Yes Yes Yes (H, U) Yes (V) 
oe: De compliance control 
IM F Ye No Yes (H) Yes (H) 
es o es es 
a ‘| M Yes No Yes (A) Yes (H) 
wo M Yes Yes (H) Yes Yes 
$ M Yes Yes Yes Yes (V) 
J M Yes Yes Yes (A) Yes (V) 
* 5. ering control: elimination 
3 E shaping” procedures 
F = Yes (H) Yes Yes (H) 
NG 
M — No Yes (H) No 
K M res Yes Yes (H) Yes 
A MA M — No No No 
MA F OR No No No 
4: F tet Yes (?)# No No 
Note.—Ves = attempt to carry out requested activity; No = refused to carry out requested activity, 


a 
$ Gecko lizard and harmless snake. Procedures omitted f 
Ua esitant in attempting task; A = agitati 
T= Sateen in removing coin, but attempted 
* Subject mative throw, hardly splashing assistant; V. 

almost hysterical at sight of lizard: remaining activities omitted. 


k ‘ypnosis-group subjects retested. 


Subject reported detecting glass screen at eni 


‘or Groups 1a and 5 (see text), 

fon and emotional involvement markedly apparent. 
to extent of placing fingers in acid. 

= violent throw, 


for example, hurling solution and container. 
Counted failure (see Footnote 8). 


d of experiment (see Footnote 6). 


Replication of Results of Rowland (1939) 


and of Young (1952) 


_ Real Hypnosis: Group 1 


T 
rm The hypnotist told the subject 
e had an irresistible urge to carry out 


easily and readily all suggestions given to 
him. Each activity was preceded by a firm 
statement—“I want you to... .” The appar- 
ent dangers in the venomous reptile and acid 
situations were stressed. The experimenter 
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conveyed by his verbal tone and nonverbal 
manner and attitude that he expected compli- 
ance. Further pressure and persuasion were 
adopted if the subject showed hesitancy about 
» carrying out any of the activities. 

After the experimental tasks were com- 
pleted, hypnosis was terminated. A careful 
evaluation of the validity of the suggested 
complete amnesia for the session was made, 
and depth of hypnosis was checked by testing 
a posthypnotic suggestion administered by 
the hypnotist at the completion of the 
experimental tasks. 

Results. Five of the six hypnotized subjects 
carried out all of the suggested activities.® 
The major result of the earlier studies of 
Rowland (1939) and Young (1952) was thus 
replicated: hypnotized subjects will carry out 
apparently antisocial and injurious activities 
such as handling dangerous reptiles and 
concentrated acid. 


Hypnosis Subjects Retested as Own Control 
(Young, 1952): Group 1 


Treatment. The five subjects in the real 
hypnosis group who had carried out the 
actions returned the following day. Each sub- 
ject was asked, in the waking state, whether 
he would be willing to carry out the suggested 
activities involving the snake and acid. This 
was the control measure carried out by Young 
(1952). Although the verbal content of the 
request implied the possibility of compliance, 
it was conveyed by the experimenter’s general 
manner and nonverbal behavior that compli- 
ance with the request was not necessarily 
expected. 

Results. As found by Young (1952), and 
as expected in the present study, subjects 
generally refused to carry out the actions in 
the waking state, even though they had been 
quite willing to carry them out in the hyp- 
notic condition. However, two subjects could 


8 The remaining subject became emotionally dis- 
turbed at the sight of the harmless lizard and it 
was not possible to administer the remaining tasks. 
As the subject had passed the selection criteria, it 
was decided not to replace her. It is noted that she 
was one of the two hypnotic-group subjects whose 
previous experience with the experimenters was 
minimal, and it is possible that the interpersonal 
relationships existing were not as strongly developed 
as with other subjects, 
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be persuaded to attempt to pick up the 
venomous snake and to throw the acid at the 
assistant. Another subject removed the coin 
from the acid. 

Amnesia for the previous day’s hypnotic 
experience was again confirmed, and was then 
removed. Subjects reported that under hyp- 
nosis they felt more passive, were not par- 
ticularly concerned with the consequences of 
their actions or what safeguards existed, and 
generally were less disturbed by the situation 
than they were in the waking state. The sub- 
jects who attempted any of the activities 
claimed they were much more hesitant in the 
waking state than they had been in the 
previous hypnotic state. These reported dif- 
ferences were consistent with the observer's 
ratings of such differences between the two 
performances, summarized in Table 2. 


Waking Control Used by Rowland (1939): 
Group 2 


Treatment. As far as could be ascertained, 
the six subjects in this group were treated 
similarly to the informal control group in 
Rowland’s (1939) study. 

The experimenter implicitly communicated 
the expectation that the subject would refuse 
to carry out the requested activities. The 
apparent danger was stressed at least as 
much as it was with subjects in other groups: 
The request to carry out the various actions 
was phrased more in the form of a question 
of whether he would be willing to comply 
with such obviously dangerous actions. How- 
ever, persuasion and pressure were applied 
if a subject refused to carry out any activity, 
although not as insistently as for subjects 
in the hypnotized and simulating groups. 

Results. Three of the six subjects in this 
group attempted to pick up the snake, but 
only one of them attempted to take the coin 
from the acid, and throw the acid at the 
assistant, both after considerable hesitation: 
It had been expected that subjects in this 
group would refuse to carry out the at 
ties? Some subjects could be persuaded 1 


9It was frequently noted in the records made 
by the assistant that the experimenter exerted m0! 
pressure with subjects in the three control groups 
in which noncompliance was predicted tha bs 
understanding of the parallel procedures of Ri 


| 


MV 


SOCIAL CONTROL IN EXPERIMENTS 


carry out the tasks. It appears that either 
the experimenter could not refrain from being 
more insistent than Rowland, or the psycho- 
logical experiment is more institutionalized 
for University of Sydney students in 1960 
than it was for University of Tulsa students 
in 1939. 


Summary of Attempted Replication of 
Studies of Rowland (1939) and 
Young (1952) 


The results obtained from the three groups 
reported in the present study are in substan- 
tial agreement with the findings reported by 
Rowland and by Young. Their results, as far 
as comparisons are relevant, have been 
confirmed, and their procedure has been 
replicated successfully. 


Limits of Social and Behavioral Control in 
the Experimental Situation 


Rowland and Young concluded from their 
data that hypnosis was instrumental in pro- 
ducing antisocial and dangerous behavior 
which could not be elicited otherwise. How- 
ever, these results cannot answer the funda- 
mental question of whether the apparently 
antisocial behavior elicited under hypnosis 
exceeds the limits of behavior which may be 
legitimized by the experimental situation per 
se, To answer this question, it is necessary 
to test whether the request to carry out the 
apparently antisocial and self-injurious acts 
exceeds the limits of social and behavioral 
Control implicitly existing in the experimental 
Situation, 


Simulators: Group 3 


Treatment. Because the experimenter did 
not know whether he was testing a hypno- 
tized or a simulating subject at any time, 
both the hypnotized and simulating groups 
of subjects were treated alike, and were 
aged to the same demand characteristics 
: ncerning compliance with the experiment- 
T's requests. Consequently, compliance with 
iit Tequests of the experimenter by the simu- 

ors would indicate that the requested be- 


land Š 5 
and Young warranted. The excessive persuasion 


w z 
. Was not intended: we do not know if this ac- 


co E 
ated for the unpredicted successful attempts in 
ese groups, 
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havior is within the limits of what they per- 
ceived as legitimate and reasonable within 
the experimental context. Such a result would 
demonstrate that the present tasks do not 
provide a test of whether hypnosis can be' 
used to compel the subject to carry out 
antisocial behavior. 

Results. All six subjects who were simu- 
lating hypnosis attempted to comply with the 
apparently antisocial and injurious activities 
requested of them. They objectively at- 
tempted to carry out the various tasks at 
least as readily as the subjects who were 
actually hypnotized. 

Qualitatively, it appeared that the simu- 
lators were more hesitant than the hypnotized 
subjects about grasping the snake, less suc- 
cessful in their attempts to remove the coin 
from the acid, but more violent and less hesi- 
tant about throwing the acid solution at the 
assistant *° (see Table 2). 


Waking Compliance Control: Group 4 


Treatment. There are special motivational 
aspects involved when a subject simulates 
hypnosis which are not present when a sub- 
ject is actually hypnotized. A simulator is 
motivated by two special features: he must 
attempt to please one experimenter by 
striving to deceive a second experimenter. It . 
is possible that these special motivations may 
increase the range of social and behavioral 
control implicit within the unique type of 
experimental situation which exists for the 
simulating subject. 

In an attempt to evaluate this possibility, 
an additional set of six subjects served as 
the usual type of control group. They were 
told that they were a normal waking control 
group in a study employing hypnotized sub- 
jects. As far as possible, they were treated 
identically to the hypnosis and the simulating 
groups; the experimenter’s manner conveyed 
an expectation that subjects would comply 
with his requests. This group differs from the 

10 For example, some simulators (and some mem- 
bers of Group 4 below) hurled both solution and 
container at the assistant. In comparison the hyp- 
nosis group, though throwing the solution, some- 
times failed to splash the assistant with it. These 
subjective differences were observed by the as- 
sistant who was aware of the group membership 
of the subjects. 
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informal controls employed by Rowland (see 
Group 2 above) in that these subjects were 
treated as formal experimental subjects and 
compliance with the requests was taken for 
"granted, whereas such pressure was not ap- 
plied in Group 2, described above, or by 
Rowland. 

Results. Five subjects attempted to carry 
out the two tasks involving the acid, but two 
of these subjects refused to handle the ven- 
omous snake. The performance of this group 
was only slightly inferior to that of the hyp- 
notized and simulating groups. It would ap- 
pear that consideration of the special motiva- 
tions which existed for subjects simulating 
hypnosis is not necessarily essential to 
account for the present results. 


Elimination of “Shaping” Procedures: 
Group 5 


Treatment. It is possible that the pre- 
liminary tasks, taking inanimate objects from 
the compartment and handling harmless rep- 
tiles, could operate as a type of “shaping” 
or “conditioning” procedure, gradually lead- 
ing the subject to tasks which in themselves 
look frightening, but in the context of the 
“shaping” procedure lose much of their 
fearsome potency. 

A separate waking control group was con- 
ducted without the preliminary harmless 
tasks. These subjects were asked to partici- 
pate in a psychological experiment, but they 
were not told that they were controls in a 
study involving hypnosis. Apart from the 
elimination of the innocuous tasks, the pro- 
cedure was essentially the same as for Group 
2, Unfortunately, the pressure exerted to gain 
compliance was generally greater than it was 
for Group 2.4 


11 There was some confusion between the two 
experimenters when subjects in this group were 
being tested. While the experimenter was testing 
some subjects in this group, the relevant question 
was seen as to whether expectation of failure could 
be as readily communicated with and without the 
preliminary “shaping” procedures, in which it was 
predicted the results would be similar to Group 2. 
For other subjects the question was implicitly seen 
as whether subjects would comply as readily with 
the tasks when the innocuous tasks were eliminated 
compared to the ease of gaining compliance when 
they were included. It was predicted from this view- 
point that results would be similar to Group 4, and 
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Results. Two subjects attempted to handle 
the snake and to throw the acid, and a third 
subject attempted to take the coin from the 
acid. The results are somewhat equivocal, but 
it appears that the “shaping” procedure em- 
ployed is not an essential determinant of 
subsequent behavior. 


Informal Controls 


A number of colleagues were informally 
shown the experimental tasks. These faculty 
members were treated in a fashion similar 
to the treatment of the control group used 
by Rowland (1939). 

They invariably refused to carry out even 
the least objectionable of these three tasks, 
the removal of the penny from the nitric acid. 
The fellow faculty members who were asked 
to perform these tasks could not be pressured 
or persuaded in any way to undertake them. 
It is significant that their behavior was 
identical to that of Rowland’s informal 
control group. 

Two aspects appear relevant in interpreting 
these informal observations: 

1. The faculty members were not in @ 
formal experimental situation. 

2. A different role relationship existed 
between faculty members and the experi- 
menters compared to that existing between 
subjects and experimenters. These differences 
emphasize the importance of the special con- 
texts provided by the combined subject — 
experimenter relationship and the nature of 
the experimental situation in terms of the 
willingness of the subjects to cary, out the 
experimental tasks. 


Postexperimental Inquiry 


With few exceptions, subjects who ab 
tempted to carry out the requested activities 
reported in the postexperimental inquiry that 
they felt quite safe in the experimental situa 


even the simulating subjects, Group 3. Clearly tw? 
separate groups should have been tested. It is certat 
that some subjects within this group were treat 
quite differently from others, The confusion was 
not recognized until after the completion of i 
study, and it is not known to what extent 
equivocal results are concordant with the oe 
expectations that were undoubtedly induced. 
results for the group are reported only for 
sake of completeness, 


the 


: 
l 


However, 


tion. Though they reported feeling rather 


“qmcertain about the tasks, and reported 
“strong emotional reactions to the repugnant 


activities, the subjects reported that they 
were quite convinced that they would not 
be harmed because the context was an experi- 
mental one, presumably being conducted by 
responsible experimenters. All subjects ap- 
peared to assume that some form of safety 
precautions had been taken during the experi- 
ment, Although no subject reported suspect- 
ing the presence of a glass screen, subjects 
felt that the snake had either been milked 
of its venom or defanged. Some felt that the 
solution was not really acid, even when they 
plunged their hand into it; others suspected 
the assistant could duck in time to avoid 
the acid; or the glass screen would be in 
place again (having previously discovered its 
presence during the procedure with the 
snake); or perhaps the assistant would even 


_ plunge himself into a nearby tub of water; 


or in one case, that the assistant was not 
actually there—it was an illusion produced 
bya complex arrangement of mirrors. 


Discussion 


1. No conclusions can be drawn from the 
Present investigation about the potential use 
of hypnosis to induce antisocial behavior. 
A the study clarifies conclusions 
which may be drawn from two previous 
Studies. After replicating the studies by 
Rowland (1939) and Young (1952) and con- 
firming their results, it has been shown that 
similar apparently antisocial behavior can 
be elicited in control groups treated in an 
Identical fashion to a group of hypnotized 
Subjects, The apparently antisocial actions 


Were also carried out successfully by sub- 


Jects who were not hypnotized, indicating 
that the tasks are within the broad range 
of activities which are perceived as legiti- 
mized by the nature of the situation: they 
were requests made by experimenters, viewed 
Y subjects as responsible scientists, in the 
Context of a psychological experiment. 

is 8 The present study is essentially method- 
ti gical, demonstrating experimental condi- 
Ons which are necessary to investigate the 


~ Antisocial hypothesis. The burden of demon- 


Strating the production of antisocial behavior 
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by hypnotic techniques lies with the investi- 
gator, who must demonstrate that the so- 
called antisocial behavior does exceed that 
which is legitimized by the experimental 
situation, and that the behavior is perceived. 
by the subject as truly dangerous or anti- 
social. It is our belief that it may not be 
possible to test the antisocial question in an 
experimental setting because of the prob- 
lems of finding tasks which are not seen as 
legitimized by the experimental context. 

3. It may seem surprising that the waking 
control group subjects who were pressed to 
comply with the apparently dangerous be- 
havior were quite willing to follow the 
experimenter’s commands. This confirms the 
anecdotal evidence reported above from our 
own laboratory: we still have not found an 
aspect of behavior which is sufficiently- safe 
to request of a subject, and which a subject 
will refuse to carry out if the expectation 
of compliance is communicated to him. In the 
present study the experimenter could virtu- 
ally predetermine the nature of the resulting 
behavior by deciding in advance whether he 
would consciously, but subtly, communicate 
to a subject an expectation either of failure 
or of compliance. Such a result is concordant 
with other findings (for example, Rosenthal, 
1964) showing the relative ease with which 
an experimenter may nonconsciously bias the 
results of a study by subtly communicating 
his own expectations and hypotheses. 

Failure to gain compliance with hypnotized 
subjects would indicate inadequate replica- 
tion of the exact details of Young’s investiga- 
tion. Because the experimenter is unaware of 
the group membership of real and simulating 
subjects outcome-bias should not differen- 
tially affect performance of subjects in these 
two crucial groups. It is partly because of the 
problem of outcome-bias that simulating 
subjects are included as the crucial control 
group. 

4. The purpose of the simulating group 
was to examine whether the chosen behavior 
exceeded the limits of behavior legitimized 
by the special nature of the experimental 
context. The waking control subjects (Group 
4) also complied with the requested activities, 
and in this sense the simulating subjects were 
not essential to demonstratė that the alleged 


200 


antisocial actions were within the realm of 
what subjects accept as reasonable requests 
in the experimental situation. However, the 
simulating group of subjects cannot be elimi- 
nated from future investigations of this type. 
It is not always possible to determine whether 
waking control subjects have been treated 
differently from the nonhypnotized subjects, 
even though an experimenter is not aware 
of such differential treatment. 

5. The present investigation demonstrates 
the misleading conclusions that may be drawn 
when casual, informal “control” groups, of 
the type employed by Rowland (1939) and 
Young (1952), form the basis for evalu- 
ating experimental performance. It cautions 
against making untested assumptions about 
the way in which subjects will behave in 
an experimental situation, or making as- 
sumptions about what aspects of behavior 
are within the repertoire of the subjects, 
however reasonable these assumptions may 
appear, when subjects are participating ir 
the social phenomenon known as “the 
psychological experiment.” 
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VISUAL BEHAVIOR IN A DYAD AS AFFECTED BY INTERVIEW 
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Delaware 


Hypotheses concerning effects of content, concealment instructions, and sex 


of partner upon willingness to engage 


in mutual glances were tested in a 


2X2X 2X 2 factorial design. 40 male and 40 female students (Ss) were inter- 


viewed by 1 graduate student of each sex 


(E). Interviewers gazed steadily at S 


while asking very personal or innocuous questions. Mutual glances were 
recorded and results showed: (a) Ss, when speaking, looked at E significantly 


more during the innocuous interview; (b 


) female Ss looked significantly more 


regardless of E’s sex; (c) only sex differences were found in a postexperimental 
discussion; and (d) female Ss were more affectionate and inclusion oriented. 


Results are discussed in terms of motiv 


es to conceal, cathect, and/or reduce 


distractions. Mediating effects of personality variables are suggested. 


This study reports results from an investi- 
gation of factors which affect visual inter- 
action, i.e., looks and glances, in groups and 
dyads. Cherry (1957) points out that there 
is no communication without a system of 
signs, and that a sign can be defined as any 
physical event used in communication, i.e., “a 
natural or conventional motion used instead 
of words to convey information [p. 7].” 
Since looking at or away from another is a 
natural motion and since people character- 
istically read meanings into such fixations 
or avoidances, we assume that visual inter- 
action is a form of nonverbal communication. 

There is evidence that -visual interaction 
will lend itself to a study of nonverbal com- 
munication. Gibson and Pick (1963) have 
shown that one knows with a high degree 
of certainty when another is looking him full 
in the face, and that considerable accuracy 
marks the reception of a glance delivered 
from an angle of 70 degrees. Thus we know 
that the signs can be perceived. Exline 
(1963) has reported differences among men 
and women in the exchange of mutual glances 
and has also found that affiliative need and 
competitive conditions interact to determine 
the incidence of such a glance. Thus we know 
that the emission of such signs (looks and 


A rie report, a revised and expanded version of 

‘i meet read at the XIVth International Congress 

a fa led Psychology, Copenhagen, 1961, is based 
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glances) can be related to meaningful social- 
psychological variables. 

Precise meanings of the exchange or avoid- 
ance of mutual glances remain to be deter- 
mined. In general a continued exchange of 
glances would seem to signal a willingness 
or a desire to become involved with one 
another, or to maintain an ongoing inter- 
action. Avoidance, on the other hand, would 
seem to indicate a lack of interest in initiating 
a relationship, or in the case of an ongoing 
interaction, would indicate that one or both 
parties wished to break away. If, for example, 
one wished to avoid receiving or sending 
signs of affective arousal one might seek 
to accomplish these ends by avoiding eye 
contact with the other. 

The present study was designed to test 
the above assumptions with respect to with- 
holding signs of one’s own affective arousal. 
The affect selected was that of embarrassment 
since the embarrassed person usually is 
strongly motivated to conceal the fact of its 
arousal. The investigators decided that an 
interview wherein an interviewer would keep 
an interviewee under constant surveillance as 
he asked for very personal or for quite in- 
nocuous information would be suitable to 
create different degrees of embarrassment. 
Not only is the subject asked to talk about 
very self-revealing personal matters, but is 
asked to do it in a situation where it is ap- 
parent to him that another both hears and 
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Pen Recorder 
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Fic. 1. Schematic representation of arrangements 
for the observation of visual attention of subject 
to interviewer. 


can see signs of consternation aroused by the 
question. 

Assuming, then, that the more personal the 
question the greater is the likelihood that the 
respondent will be embarrassed, we hypothe- 
size that mutual visual interaction is greater 
in those interviews which deal with relatively 
innocuous rather than with highly personal 
questions. 

It is possible that the concealment of any 
kind of information, whether affective or not, 
inhibits the mutual glance. Thus half of the 
subjects given each type of interview 
were instructed to try to conceal their true 
opinions and attitudes from the interviewer. 

A second hypothesis tested in this study 
concerns the generality of sex differences in 
tendencies to engage in mutual visual inter- 
action. Results from a previous study (Exline, 
1963) showed mutual glances to be more 
prevalent between women than between men. 
Since men and women interacted only with 
others of the same sex, the data provided 
only limited information about the generality 
of the sex difference. Assuming that the previ- 
ously discovered sex difference is not limited 
to like-sex dyads, it is hypothesized that 
women subjects paired with a male or a fe- 
male interviewer will look more at the inter- 
viewer than will male subjects paired with 
the identical interviewer. 
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METHOD { 


Subjects and Procedures 


Forty male and 40 female students randomly 
drawn from physical education classes required of 
all freshmen and sophomores at the University of 
Delaware, were matched as to sex and intimacy of 
interview protocols. One half of the subjects of each 
sex were interviewed by a male experimenter whereas 
the remaining subjects were interviewed by a female 
experimenter. Subjects in one half of each of the 
four groupings were instructed to conceal their 
feelings and opinions, while the other half were — 
not so instructed. 

Subjects were brought into the laboratory and 
seated at a hexagonal table facing a one-way 
vision mirror, An experimenter (the observer) told 
the subjects that the study was designed to learn 
more about the value of face-to-face interviews as 
a method of obtaining information about others’ 
opinions. The experimenter then introduced the 
interviewer (the experimenter) and left the room, 
or if the design required, gave the concealment 
instructions, introduced the interviewer, and left the 
room. The observer then entered the observation 
booth and recorded the subject’s visual behaviors, 

The interviewer (the experimenter) sat opposite 
the subject with his back to a one-way mirror (see 
Figure 1). The distance between the experimentet 
and the subject was approximately 4 feet, and a 
partition, 12 inches high, prevented the subjects 
from reading the interview questions. 

Following the interview the subject rated the 
interview, described the interviewer, and filled out 
Schutz’ (1958) FIRO scales. The experimenter 
then notified the subject that the experiment was 
over, and, omitting any reference to visual inter- 
action, discussed the study with the subject for 
approximately the same period of time as the » 
interview proper. The experimenter gazed steadily 
at the subject’s eyes as in the basic interview, and 
the observer once again recorded the subjects 
visual behavior. The interview and discussion 
periods were recorded on audio tape for Tater 
synchronization of verbal and visual behavioti 
The postinterview discussion was designed 1 
provide a record of the subject’s usual visual 
activity in a relatively less embarrassing situation. 


Induction of Embarrassment 


Different degrees of embarrassment were induced 
by varying the nature of the questions put to the 
subjects. Each interviewer questioned one half o 
his assigned subjects about very personal mattet® 
for example, their fears, desires, needs, and ES 
pulses; the quality of the personal interview 
suggested by the following question: “it seems w 
no matter how careful a person is he sometime 
has embarrassing moments. What experience ma 
you ever had that was so embarrassing that yo 
felt like sinking through the floor?” 2 The remainitg 


2 Copies of both sets of interview questions H 
tables of raw data have been deposited with 
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were asked about their recreational in- 
" terests, preferred movies, books, sports, and the 
like. The private nature of the first (personal) 
| schedule was assumed to create a more embarrassing 
l situation for the subject than would the relatively 
jnnocuous questions of the second (recreational) 
' schedule, Each interview consisted of 10 questions. 


` subjects 


Concealment Induction 


All subjects given the concealment induction were 

read the following instructions by the experimenter- 

observer who introduced them to the experimental 
situation, 


‘As you know, we are trying to test how effectively 
this method [the face-to-face interview] obtains 
true and valid information. When being inter- 
viewed, people often conceal their true feelings 
and opinions. We would like you to do the same 
today so that we can compare one type of 
interview with another. In other words, we 
would like you to do your best to conceal as 
much as possible from the interviewer. 


Interview and Interviewer Ratings 


Following the interview and prior to the dis- 
cussion, subjects marked a 7-point scale to in- 
dicate how much they liked the interview. Since 
Bates and Cloyd (1956) have pointed out that 
! friendliness is a strong norm of behavior among 
| Collegians, Osgood’s (Osgood, Suci, & Tannenbaum, 
| 1957) semantic differential technique was used to 
| Overcome the subjects’ anticipated resistance to the 
| direct expression of negative feelings about the 
interviewer. Subjects’ perceptions of the experi- 
menter were measured by means of their responses 
H the four adjective pairs which-have been found 
| pontin the highest loadings for each of three 

eee namely, potency, evaluation, and activity. 

ese 12 pairs were arranged in random order for 

Presentation to the subjects. 


Personality Data 


Pe to explore the possibility that general 

Bra mac interacting with others might be 
Bice with visual behavior, Schutz’ (1958) 
i iani of Fundamental Interpersonal Relations 

Fach A (FIRO) was given to all subjects. 

with) re e three FIRO orientations was measured 

cain spect to description of one’s own behaviors 

aa others and with respect to the behaviors 
. wants from others, 


. aa oea ni don Institute. Order Document 
Photodupli rom ADI Auxiliary Publications Project, 
ashiigto cation Service, Library of Congress, 
me oni D. C. 20540. Remit in advance $2.00 
checks pa: S or $3.75 for photocopies and make 
ibrary eain Photoduplication Service, 
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Mutual Visual Interaction 


The arrangements for the observation and record- 
ing of mutual visual interaction are schematically 
represented in Figure 1. Since the incidence of 
mutual glances depends upon the willingness of 
both parties to simultaneously look at one another, 
the interviewer was instructed to look the subject in 
the eyes throughout the interview, and the dis- 
cussion which followed it. Mutual visual interaction 
was thus dependent upon the subjects’ tendencies 
to avoid or to meet the gaze of the experimenter, 
A chronological record of the frequency and dura- 
tion of each subject’s glances was recorded on an 
Esterline-Angus 20 pen operations recorder by the 
observer who, screened by a one-way vision mirror, 
sat facing the subject and slightly above and off to 
the left of the interviewer. Each time the subject 
looked at the experimenter, the observer depressed 
a switch which caused an inked pen to trace a 
new course until the switch was released. Pre- 
tests of the above procedure showed that the 
observer and the interviewer agreed from 88% to 
98% in their judgments of the frequency and 
duration of the subjects visual fixations on the inter- 
viewer. In view of the high degree of agreement 
the experimenter was released from the distracting 
necessity of recording data. 

Precise coordination of audio and visual tapes 
permitted later synchronization of the speech and 
visual activity of both the subject and interviewer 
(see Figure 2). An index of mutuality was derived 
by expressing the subject’s looking time as a ratio of 
his total possible looking time during the period 
under consideration. The resulting percentage scores, 
prior to being subjected to a complex analysis of 
variance, were transformed by the arc-sine method, 
recommended by Edwards (1960). 


RESULTS 
Interview Length 


Almost identical numbers of questions and 
probes took almost exactly the same number 
of seconds to ask in both types of interview.* 
The personal interview, however, was both 
longer and marked by proportionately longer 
periods of silence than was the interview 
concerning recreational interests (see Table’ 
1). Table 1 also shows that the concealment 
instructions resulted in shorter interviews. 
The application of analysis of variance to 
the above data showed difference in inter- 
view length to be significant beyond the. .01 
level, as shown by F ratios of 18.42 and 


3 Including probes, the experimenter took an aver- 
age of 79.9 seconds to ask 17.9 questions with the 
personal schedule and took an average of 804 
seconds to ask 17.3 questions with the recreational 
schedule. The differences were not significant. 
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Fic, 2. Two-minute samples of mutual glances between male subjects and female interviewer (the 
menter) taken from two interviews. 


10.04 for type of interview and instructions, 
respectively (df = 1/64). The difference be- 
tween the percentage of interview time spent 
in silence in personal and recreational in- 
terviews was significant beyond the .001 
level (F = 25.97, df = 1/64). This finding 
has implications for the index of mutuality 
appropriate to test Hypothesis 1. 


Tests of Hypotheses 


If the index of mutual glances is derived 
in terms of the total time taken to complete 
the interview, the fact that the subjects speak 
proportionally more in the personal compared 
to the recreational interview will system- 
atically bias the test of Hypothesis 1. In 
order to eliminate this bias, separate indexes 
were developed to represent the incidence of 
mutual glances during the experimenter’s 
speech (visual attention), during the sub- 
ject’s speech (visual address), and during 
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periods of silence. The index of mutuality 
visual attention is thus the number of sec 
that the subject looked at the experimen 
while speaking, divided by the total num 
of seconds that the subject spoke. Indexes 
visual address and the silent periods 
similarly derived. The above three itt 
also were used to test the hypothesis © 
cerning sex differences. a 
Mean mutual glance index scores in Tab 
and results of an analysis of variance 
Table 3 provide partial support for both) 
potheses 1 and 2. Regardless of sex, SUDIE 
given the personal interview looked at 
interviewer only 39.3% of the time du 
their own speech, while those given 
recreational interview looked at the ™ 
viewer 49.8% of the time. The differ 
was significant beyond the .05 level. Simi 
those given the personal interview looked 
the experimenter only 27.1% during P% 


d 
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‘of silence while subjects who discussed re- 
creation topics looked at the experimenter 
during 30.3% of the silent time. This latter 
difference while in the predicted direction 
did not reach significance at the .05 level. 
The results did not support the general 
hypothesis during the periods when the 
experimenter spoke, for the data showed that 
subjects gave much visual attention to the 
speaker, regardless of the content of the 
interview, the sex of both the subject and the 
experimenter, and the instructions to conceal. 
The incidence of mutual glances between 
interviewer and subjects was much greater 
when subjects listened than when they spoke. 
Table 2 indicates that this occurred in both 
personal and recreational interviews, and 
examination of each interview showed that 
in 89% of them, subjects looked relatively 
more when listening than when speaking. 
Visual attention to a speaker, regardless of 
topic, would seem to be a marked behavioral 
convention in the population used in this 
study. 

No interview effect was found when mutual 
glances were recorded during the postinter- 
view discussion. Once out of the interview 
situation those given the personal interview 
behaved similarly to those given the recrea- 
tional interview. Both groups of subjects 
Increased their mutual glances with the inter- 


TABLE 1 


pee Interview LENGTH AND PERCENTAGE OF 
fetes By SUBJECTS CATEGORIZED BY SEX, 
NTERVIEW CONTENT, AND CONCEALMENT 


Instructions 
I i r X 
a A E 
M o M o 
Interview 
ersonal 411.95" | 141.69 | 16.36 | 7.73 
ecreational | 299.12 | 102.15 | 9.16 | 5.20 
a subject 
Pea 343,20 | 117.81 | 13.01 | 7.96 
e 367.88 | 150.64 | 12.51 | 7.01 
Thstructions 
onceal 314.00 | 96.41 | 13.64 | 7.60 
conceal |397.08 | 155.34 | 11.88 | 7.31 
Note CN = 40 per mean, 


* Grou; 
significant apo POldfaced means are those associated with 
fects in the analysis of variance. 
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TABLE 2 


MEAN PERCENTAGE OF Time SPENT IN MUTUAL 
GLANCES DURING INTERVIEWERS’ SPEECH, SUB- 
JECTS’? SPEECH, AND SILENT PERIODS 
By SUBJECTS CATEGORIZED AS TO 
Sex, INTERVIEW CONTENT, AND 
CONCEALMENT INSTRUCTIONS 


Measurement period 


Variable» 
Subject | Subject Both Total 
listens | speaks silent | interview 
Interview topic 
Personal 69.4 39.3> | 27.1 42.5 
Recreational 66.3 49.8 30.3 52.2 
Sex of subject 
Male 63.8 39.2 24.2 42.9 
Female 71.9 49.9 33.2 51.8 
Concealment set 
Conceal 67.8 43.8 30.2 47.7 
No-conceal 67.9 45.3 27.2 47.0 
Sex of interviewer 
Male 63.7 45.1 27.1 41.4 
Female 72.0 44.0 30.3 47.3 


Note.—N = 40 per mean. 

«For purposes of simplification, the only significant inter- 
action (Sex of Subject X Concealment) is listed in text, 

b Boldfaced means are those associated with significant ef- 
fects in the analysis of variance. 


viewer but the mean increase recorded for 
those given the personal interview was sig- 
nificantly greater (M personal interview in- 
crease = 9.8%, M recreational interview in- 
crease = 4.7%; F=5.70, p<.05, df= 
1/64). 

The data strongly supported the hypothesis 
that women engage in more mutual glances 
than do men. Women looked at the inter- 
viewer 49.9% and 33.2% of the time during 
the subject’s speech and the silent periods, 
respectively. Men looked at the experimenter 
only 39.2% and 24.2% during the same two 
periods. In each case, sex differences were 
significant beyond the .05 level. Similarly, 
women tended to look more at the experi- 
menter during the experimenter’s speech 
(visual attention) than did men (71.9% to 
63.8%). Although the latter difference did 
not reach significance at the .05 level the 
effect was significant (see Table 3) for those 
given the concealment set. Women instructed 
to conceal, looked 74.8% of the time the 
experimenter spoke compared to 69.0% 
recorded for the uninstructed women. The 
reverse was true for men who looked 60.8% 
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TABLE 3 


ANALYSIS OF VARIANCE OF MEAN PERCENTAGE OF TIME SPENT IN MUTUAL GLANCES DURING 
i INTERVIEWERS’ SPEECH, SUBJECTS’ SPEECH, AND SILENT PERIODS 


R. Extinz, D. Gray, AND D. ScHUETTE 


Subject listens Subject speaks Both silent Total interview 
Variables df 
MS F MS F MS F MS 
Interview topic 1 | 13.03 -11 | 840.00 | 6.80* | 62.22 39 | 666.61 
Sex of subject 1 | 396.01 | 3.35 | 855.75 | 6.93* | 771.34 | 4.88* | 583.36 
Sex of interviewer 1 | 436.22 | 3.69 8.94 | .07 90.16 | .57 26 
Concealment set 1 9.20 .08 21.73 18 | 113.26 .12 5.26 | 06 
Sexof subject X Concealment set 1 | 534.42 | 4.51* | 190.93 | 1.55 19, 01 174.97 | 2.10 
Residual (error) 64 118.37 123.50 158.19 83.24 
Total 79 
a For ante and clarity of presentation no nonsignificant interaction effects are reported. 
p <05. 
p <.01. 


when instructed compared to 66.8% when 
uninstructed. The investigators have no ready 
explanation as to why, when instructed to 
conceal, women should pay more and men 
less visual attention to a speaker than when 
not so instructed. Perhaps the concealment 
strategies of men and women differ with 
respect to attentiveness. 

In the informal discussion which followed 
the interview women also participated in 
significantly more mutual glances (60.3%) 
than did men (49.0%) (MS = 879.82 and 
81.55, F= 10.65, p<.01, df= 1/64). 
Women, it is clear, were more willing to 


TABLE 4 


INCIDENCE or MUTUAL GLANCES BY SEX OF SUBJECTS IN First AND SECOND Hatves OF LONG 
AND SHORT PERSONAL INTERVIEWS 


-nor did the sex of their partner seem tj 


tolerate a mutual glance than were menj 


affect the relationship. | 
It is possible that the obtained difference) 
can be accounted for by fatigue or satiation 
effects associated with the greater length d 
the personal interview. Data reported 
Table 4 show that the length of the person 
interview did indeed affect the inciden 
of mutual glances between the experimen! 
and the subject. Personal interviews were fit 
divided into two groups according to w 
median length, then each interview was spil 
into two equal time units. Means in Table 4 


Interview length 
Subject 
speaks 
Sex and period Long Short 
Ms Mb Ms Mv M 
Male 
First half 37.2 71.5 34.8 69.3 36.0° 
Second half 28.7 65.9 33.9 66.1 31.3¢ 
Means: Length X Sex 32.9 68.7 34.4 67.7 33.74 
Female 
First half 41.2 15.2 54.5 73.4 47.9° 
Second half 28.0 59.1 54.7 79.7 41.4° 
Means: Length X Sex 34.6 67.2 54.6 76.6 44.64 
Means for Period X Interview 
Length 4 
First half 39.2 73.4 44.7 71.4 42.0° ye 
Second half 28.4 62.5 44.3 72.9 36.4° ot 
Means for Interview Length 33.8 68.0 44.5 12.2 39.2 10, 


à Reported in terms of percentage of subjects’ speaking time. 


b Reported in terms of percentage of experimenters’ speaking time. 


$ Means for Sex X Period. 
4 Means for 
e Means for Interview Period. 


VISUAL BEHAVIOR IN A DyAD 


‘dearly show that whether visual interaction 
was measured when the experimenter spoke, 
or when the subject spoke, subjects looked 
Yess at the experimenter in the second half of 
the longer interview. Mean differences of 
10.8% (visual address) and 10.9% (visual 
attention) compared with negligible first- 
versus second-half differences in the shorter 
interviews resulted in Period X Length inter- 
action effects which were significant at the 
(01 level for the subject’s speech and the .05 
level for the experimenter’s speech (Fs of 
13.89 and 5.37, df = 1/36, were obtained for 
the subject and the experimenter’s speech, 
tespectively). Since the same questions were 
covered in the first half of both long and 
short interviews, we conclude that the results 
Were due to interview length rather than 
to the nature of the questions. 

Tn view of the results reported above we 
equalized the length of the personal and 
Tecreational interviews, recomputed mutual 
glance indexes and reanalyzed the data. Ad- 
justing the length of the interviews had 
Practically no effect upon the differences 
Previously reported in Tables 2 and 3. In 
all cases mean differences were of similar 
Magnitude, in the same direction, and, with 
One exception, at the same level of sig- 
nificance. 

We conclude that the differences between 
the incidence of mutual glances in the per- 
sonal and recreational interviews cannot be 
attributed to the greater length of the per- 
an interview. We suggest that the rela- 
iis Wi greater embarrassment resulting from 
ithe a 4 personal questions attenuated 
Sup Rae glances during that interview. 
a. ing the above interpretation is the 
Bie ne during the postexperimental inter- 
“the a mutual glances of subjects given 
ies interview did not differ sig- 
REA rom those of subjects given the 
i nal interview. 

Pris i single change concerned the subject’s 
in Ra attention to the interviewer’s speech. 
EA analysis a tendency for all 
male les look more at the female than the 
ag rs when listening to them did 
similar ane: In the reanalysis a 
female erence (73% mutual glances with 

versus 64% with male interviewer) 
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reached significance at the .05 level (F= 
4.05, p < .05, dj = 1/64).* 


Personality Differences and Visual Interaction 


Data from Schutz’ measure of FIRO sug- 
gest that women’s greater visual activity is 
related to sex differences in interpersonal 
orientation. Women state that they both give 
to and want from others more inclusion and 
affection behavior than do men. Mean in- 
clusion and inclusion wanted scores for 
women were 6.63 and 5.63, respectively, 
compared to 5.10 and 3.58 for men. Similarly, 
women recorded mean affection and affection 
wanted scores of 4.20 and 5.38 while men 
rated themselves at 2.58 and 3.90. Sex dif- 
ferences as shown by the F test, reached sig- 
nificance beyond the .01 level for all four of 
the above scales, The F was 13.17, 12.75, 
10.02, and 8.97 (df= 1/64) for inclusion, 
inclusion wanted, affection, and affection 
wanted, respectively. Mean differences at- 
tributable to interview topic, sex of inter- 
viewer, concealment set, and their various 
interactions were consistently low and not 
significant. The results suggest that willing- 
ness to engage in mutual visual interaction 
is more characteristic of those who are 
oriented towards inclusive and affectionate 
interpersonal relations, a view supported by 
after-the-fact analysis of mutual glances. 
Men and women who made above median 
scores on both inclusion and affection dimen- 
sions were compared to those scoring below 
the median on both dimensions. High scoring 
subjects mutually used the line of regard to 
a significantly greater degree than did their 
less inclusion- and affection-oriented counter- 
parts.’ The differences are striking but it 
would be unwise to accept the data as 
definitive. The number of cases is small (four 
women and three men in the highs, and only 
five men and four women in the lows): and 
subjects were not perfectly counterbalanced 
across interview conditions. The results sup- 


4 This finding is probably due to a stabilization of 
within-group variance brought about by the cor- 
rection for length. Thus a difference which ap- 
proached significance in the original analysis (Table 
3) reached significance in the reanalysis. 

5 Mean mutual glance time, highs = 62.1%; mean, 
lows = 29.7%}; T =5.53, p < 01. 


208 


port the value of further research along these 
lines. 


Ratings of the Interviewer 


Subjects given the personal interview 
rated the interviewer as more potent on 
Osgood’s (Osgood et al., 1957) semantic 
differential than did those subjects given the 
recreational interview. A mean potency score 
of 14.50 (X =the sum of the ratings of the 
four potency loaded opposed adjective pairs) 
recorded for the personal interview was sig- 
nificantly higher (F = 6.89, p< .05, df= 
1/64) than was the mean of 12.49 recorded 
for the recreational interview. In addition 
men rated the interviewer as more potent 
than did women, and those men and women 
instructed to conceal their opinions “evalu- 
ated” the interviewer less highly than did the 
uninstructed subjects. (These differences are 
both significant beyond the .05 level—F = 
6.22 and 4.62, respectively, df = 1/64). 

To perceive another as potent is to per- 
ceive him as having authority, control, and 
dominion. The investigators suggest that the 
fact that subjects given the personal inter- 
view perceived the interviewer to be more 
potent is consistent with the expectation that 
the personal interview is more embarrassing 
than the recreational interview. 

The finding that men rated the interviewer 
as more potent than did women was un- 
expected, and its meaning speculative. On the 
other hand, the lower evaluation of the 
interviewer by subjects instructed to conceal 
information from him is consistent with the 
finding that subjects rated the concealment 
interview as less pleasant (M pleasantness 
conceal = 3.88, nonconceal = 4.50; F = 4.52, 
p< .05, df=1/64). The finding that con- 
cealment instructions did not affect subjects 
visual behavior with the interviewer suggests 
that the instructions affected subjects’ 
orientation to the total situation rather than 
to the interviewer. 


Discussion 


The data clearly show that subjects looked 
more at the interviewer when listening than 
when speaking to him or her. Female sub- 
jects also showed greater overall eye contact 
with the interviewer, and subjects of both 
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sexes looked less if given the interview which 
dealt with potentially embarrassing topics, 

There is evidence that the greater incidence |, 
of eye contact when listening is not Specific 
to the conditions of this study, for Nielsen 
(1962) reported similar findings with a groui 
of Harvard students. Though not measuring 
mutual glances per se, Nielsen noted that 
during a heated discussion, ego looke! 
relatively more at alter during the period olf 
alter’s speech, While such differences in bef 
havior may not be general across class ot 
cultural boundaries, the similarity of out 
results to Nielsen’s suggests that a norm of 
visual attentiveness to another’s speech maj 
well exist within the population represented 
by American college students. 

Our data not only replicate an earlier find 
ing that pairs of women enage in more mutu 
glances than do pairs of men (Exline, 1963) 
but also suggest that within the limits im 
posed by the situations we have studi 
(namely, discussions and interviews betwee a 
persons of the same age in situations i! 
which sexual interest in the other is not! 
salient factor), women may be more willing 
to maintain eye contact regardless of the 
sex of the other person. Our data also susi 
gest that sex differences in eye contact ate!) 
result of womens’ greater orientation towa 
affectionate and inclusive relationships wil 
others—an interpretation which is consistet! 
with Parsons’ (1955) theory concerning s 
role differentiations learned in the ; 


family. “a 
A final point concerns the propositi 
that avoiding the glance of the other * 
motivated by a desire to conceal one’s affec 
from him. Our results showed that subject 
did indeed look less at the interviewer aE 
answering questions about embarrassing Pu 
sonal topics. While such data provide ma 
support for the operational hypothesis 4 
signed to test the proposition, there a 
grounds to doubt that the data confirm ‘at 
more general proposition itself. The failure i 
the concealment induction to intensify , 
effect in the personal interview argues ma 
jan 
X 


the proposition. So do the findings w 


al 


ot 
Ths) 


q 
: 3 mer : + dance 
interview topic did not affect visual ayoll 
when subjects listened to the experiment 


exchanged glances with him in silence. 


VISUAL BEHAVIOR IN A DYAD 


e cannot say with confidence that the visual 
yoidance we observed was due primarily to 
‘desire to conceal information from the 
r. If such a motive existed it may well 
have been supplemented by a desire to reduce 
distractions while recalling forgotten events, 
y resentment felt toward one who asked 
personal questions, or by a wish to avoid 
seeing another’s reactions to one’s personal 
revelations. Until these and additional pos- 
sibilities are systematically investigated, our 
data concerning visual reactions to embarrass- 
ment would seem to be most reasonably 
interpreted as suggesting that individuals 
whose composure is threatened by the nature 
jÍ their interaction with another, may, per- 
haps unconsciously, signal a desire to main- 
tain psychological distance from the other by 
ayoiding eye contact with him. 


Vj 
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A COMPARISON OF INDIVIDUAL AND GROUP PERFORMANCE 
FOR VARYING COMBINATIONS OF INITIAL ABILITY 


MORTON GOLDMAN 


University of Missouri, Kansas City 


This study compares the performance of individuals with that of 2-member 
groups, the Ss in the groups having varying combinations of ability. The 
results showed that: (a) Ss improve significantly when working in pair-groups 
over working as individuals; (b) the group effect is greatest for low-level Ss; 
(c) Ss working with partners at their same level do not improve over working 
with partners below their level; (d) Ss working with partners above their 
level do improve significantly over working with partners at or below their 
level, but, within the limits considered, the extent that the partner is above 
does not matter. In comparing the performance of an individual with that 
of a group, the level of ability of the Ss involved should be described. 


An area of research which has had a long 
history is the comparison of individual per- 
formance with group performance. Studies in 
this area have been reviewed by Kelley and 
Thibaut (1954), Lorge, Fox, Davitz, and 
Brenner (1958), and most recently by Hare 
(1962). In general, most studies find that the 
group performance is superior to that of 
individuals. These studies have examined the 
relationship between group and individual 
performance where the task varies along sev- 
eral dimensions, for example, a comparison 
between tasks which allow and do not allow 
for a division of labor, a comparison between 
manual and intellectual tasks, and a compari- 
son between creative and noncreative tasks. 

Even though it would appear that the 
characteristics or abilities of the members of 
the group would affect the performance of 
that group, few studies have focused on this 
problem. The research of Timmons (1942), 

| for example, was primarily concerned with an 
evaluation of the effect that group discussion 
has on the ranking of five possible solutions 
concerned with the adoption of a parole 
system. Comparisons were made where sub- 
jects worked as individuals or in one of six 
different kinds of groups which were formed 
on the basis of the ability shown on the per- 
formance of the first day. Timmons’ study 
was quite complex and the variables respon- 
sible for his results are difficult to determine. 
For example, after the subjects in his study 
initially ranked five solutions to the parole 
system, they were asked to read a pamphlet 
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containing authoritative information abou 
this subject. The control group, which worked 
as individuals, was asked to reread and stud 
this pamphlet. The other groups, instead d 
rereading the pamphlet, were asked to discus 
the material. In addition, competition wa 
introduced for the subjects working in th 
control group. y 

The current study concentrates primarij! 
on the comparison of the performance 0 
groups composed of two members with th 
of individuals working alone, where th 
groups are composed of subjects having valy’ 
ing combinations of ability. This study wil 
yield information on the following types d 
questions: Does group performance in genet 
improve more than individual performance 
Do individuals functioning in groups improv 
their performance differentially according th 
their initial ability? Do high-scoring subje 
perform better as individuals, in groups a 
only in groups of other high-scoring subject 
Do middle-scoring subjects improve their p 
formance as much when paired with lof 
scoring subjects as when paired with middle 
scoring subjects? Disregarding improvemie” 
of performance and considering only absolutë 


level, does a group composed of two 3 


1As will become apparent when the conditions ' 
the experiment are more fully explained below, n 
individual-working-alone treatment designates om! 
viduals working independently on problems in & 1 
where other individuals are also independently W 
ing on similar problems. Thus the individual oF f 
alone implies working independently withol 
partner, but does not imply working in isolation: 
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if ` 
-soring subjects perform as well as a high- 
scoring subject working alone? 


METHOD 


The subjects for this experiment were 239 under- 
graduate students at the University of Missouri at 
Kansas City. Using prepared test instructions, the 

subjects in their regular instructional classrooms 
were administered Form F of the Wonderlic Intel- 
ligence Test. The test was administered without a 
time limit being imposed. However, practically all 
of the students completed the form in 25 minutes. 
On the basis of the initial scores on the test, 
subjects were arbitrarily divided into three categories, 
The subjects who scored 29 and below were desig- 
nated as low (L). The subjects who scored between 
30 and 36 were designated as middle (M). The 
subjects who scored 37 and above were designated 
‘as high (H). The above divisions were made so 
‘as to have an approximately equal number of 
i subjects in each of the three categories. 

Five weeks after the first administration, 20 L- 
scoring subjects, 33 M-scoring subjects, and 46 
H-scoring subjects were administered an equivalent 


second form of the Wonderlic, Form D. These sub-_ 


jects took this test again on an individual basis. 
They worked as individuals in their regular class- 
rooms in groups numbering 20-30 in size, This 
fn form was also administered as an untimed 

est. 
The remaining subjects, with the same interval 
of 5 weeks intervening, were paired in the following 
combinations: I, two subjects with H scores on 
the initial form of the test; II, two subjects with 
M scores on the initial test; III, two subjects with 
L scores; IV, one subject with an H score and one 
Subject with an M score; V, one subject with an H 
Score and one subject with an L score; and VI, one 
Subject with an M score and one subject with 
an L score. In Combination I there were 15 pairs 
of subjects; Combination II, 12 pairs; Combination 
11 pairs; Combination IV, 11 pairs; Combina- 
aa V, 12 pairs; and Combination VI, 9 pairs. 
pee eroupa were told to take the test together, 
E ag each item, and reach a mutual solution as 
oa e correct answer. Thus each pair worked to- 
A er as a team discussing each question in turn 
n the second form of the test. The subjects in the 
eo then recorded the mutually agreed-upon 
E The pairs of subjects were permitted to 
=. ot any time limit imposed. No record 
ae pi of the time required to complete the test. 
to te etal setup used allows each subject 
ans fat his own control. With an initial score 
ie Score for each subject, the improvement 
aad eS be computed by subtracting the initial 
of each at the final score. In addition, the scores 
Gh bs of the six combinations of paired subjects 
Combin. eored with the scores of similar 

ations of subjects working as individuals. 


211 


RESULTS 
First Analysis 


The first analysis will organize the data to 
answer the following questions: Do subjects 
working together in groups of two improve 
their performance scores over subjects who 
work as individuals? Is this improvement a 
function of the initial scoring level of the 
subjects? 

To obtain information dealing with the 
above questions, a 2 X 3 complex analysis of 
variance test was carried out. The six means 
compared appear in Table 1. Comparing the 
three levels of performance (H, M, and L) an 
F = 8.235 was obtained, which, with df 
= 2/169, is significant beyond the .001 level. 
Comparing the improvement of individuals 
with the improvement of subjects working 
together in pair-groups, an F = 35.223 was 
obtained, which, with df = 1/169, is signifi- 
cant far beyond the .001 level. The F for 
row-column interaction was less than 1. 
Thus the data support the expectation that 
subjects improve significantly when working 
together in groups of pairs more than when 
working as individuals. Also the extent of the 
improvement is inversely related to the initial 
performance level of the subjects. The latter 
result appears to be more readily explained by 
the effect of regression and the fact that the 
H scores have less room to move upward than 
the L scores. 


Second Analysis 


Would better performance be obtained by 
having H-scoring subjects work individu- 
ally rather than have them work with “poor” 
companions? Will pairing H subjects with 


TABLE 1 


MEAN ĪMPROVEMENT SCORES FOR SUBJECTS 
CATEGORIZED INTO THREE LEVELS 
Wuo WORK AS INDIVIDUALS 
AND IN PAIR-GROUPS 


H M L 
Working as —.630 1.727 2.000 
individuals (n = 46) | (n= 33) | (n = 20) 
Working in 3.133 4.458 5.954 
pair-groups (n = 30) | (n = 24) | n= 22) 
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TABLE 2 


MEAN ImproveMENT SCORES FoR SUBJECTS 
CATEGORIZED INTO THREE LEVELS WHEN 
WORKING AS INDIVIDUALS, AND WITH 
L, M, AND H PARTNERS 


.. |Working as| Working | Working | Working 
Simen ata] ARE | MEERY | eee 
(1) (2) (3) 

1.H —.630 1.417 1.636 3.133 
(n = 46) | (n = 12) | (a = 11) | (n = 30) 

2. M 1.727 2.444 4.458 8.727 
(n = 33) | (n = 9) | (n = 24) | (n = 11) 

3. L 2.000 5.954 11.444 13.750 
(n = 20) | (= 22) | (n= 9) | (n = 12) 


either H, M, or L companions differentially 
affect the H subject’s performance? 

Row 1 of Table 2 gives the mean improve- 
ment score of the initially H-scoring subjects 
when they retake the equivalent second test 
individually, working together with initially L 
subjects, working with M subjects, and work- 
ing with H subjects. Using an analysis of 
variance test to examine differences among 
these four means, an F = 8.177 was obtained, 
which, with df = 3/95, is significant beyond 
the .001 level. The mean improvement of the 
H subjects working individually was signifi- 
cantly lower than the mean improvement if 
H subjects were paired with other subjects 
whether the partner was an H, M, or L sub- 
ject. No significant difference improvement 
was found for the H subjects when they 
worked in pair-groups with H, M, or L 
subjects. 

Thus, it was found: that initially H-scoring 
subjects significantly improve their score when 
they work in pair-groups over working indi- 
vidually, regardless of the level of the subjects 
with whom they are paired; and that H sub- 
jects do not significantly improve their score 
when they work with other H subjects over 
the scores they obtained when working to- 
gether in pairs with either M or L subjects. 

This type of analysis answers similar ques- 
tions for the M-scoring subjects. Row 2 of 
Table 2 gives the mean improvement score of 
the initially M-scoring subjects when they 
retake the equivalent second test individually, 
working together with initially L subjects, 
working together with M subjects, and work- 
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ing with H subjects. Using an analysis of | 
variance test to examine differences among 
these four means, an F = 10.884 was ob- 
tained, which is significant beyond the 001 |’ 
level. The mean improvement of the M sub- 
jects working individually was significantly | 
lower if the M subjects were paired with M 
or H subjects, but not if paired with L sub. | 
jects. M subjects when working with H sub. | 
jects significantly improved over working with | 
either M or L subjects, but M subjects did 
not significantly improve when working with 
M subjects over working with L subjects, 
Thus, for the M-scoring subjects, the re- | 
sults show that M-scoring subjects improve 
their score when working in pair-groups ovet 
working as individuals when they are paired | 
with M or H subjects, but not when paired 
with L subjects; M-scoring subjects when | 
paired with H-scoring subjects significantly 
improve their scores over being paired with | 
either L- or M-scoring subjects, but M 
scoring subjects do not improve their scores | 
significantly when paired with M subjects 
rather than L subjects. | 
Finally, examining the L-scoring subjects, 
Row 3 of Table 2 gives the mean improve: | 
ment score of the initially L-scoring subjects | 
when they retake the equivalent second test | 
individually, working together with initially | 
L-scoring subjects, working together with M 
subjects, and working with H subjects. An | 
analysis of variance test examining differences 
among these four means yields an F = 19.618, | 
which, with df= 3/59, was significant fat 
beyond the .001 level. The mean improve}, 
ment of the L subjects working individually 
was significantly lower than the mean im 
provement if the L subjects were paired with 
either L, M, or H subjects. L subjects whet 
paired with other L subjects were significantly f 
lower in improvement than if paired with 
either M or H subjects, but L subjects di | 
not significantly improve when working with 
1 


M subjects over working with H subjects. 
Thus, for the L subjects, the results show 
that L subjects significantly improve thet 
scores over working as individuals whe! 
paired with either L, M, or H subjects; al 
pairing L subjects with either M or H sub: 
jects gives significant improvement over pal! 
ing them with other L subjects, howev 
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significant improvement differences are not 
obtained by pairing L subjects with M over 
pairing L subjects with H subjects. The im- 
portant factor seems to be that of pairing L 
subjects with subjects who initially score 
better than the L subjects, the degree to 
which the other subjects score higher than 
the L subjects does not seem to be essential. 
Summarizing the second analysis, the re- 
sults show that: 


1. Both initially L- and H-scoring subjects, 
when paired with either L, M, or H subjects, 
significantly improve their scores over work- 
ing as individuals. The initially M-scoring 
subjects, when paired with M and H subjects, 
significantly improve their scores over working 
as individuals but did not improve their score 
when paired with L subjects. 

2. H subjects, when they work with other 
H subjects, do not improve their scores sig- 
nificantly over the scores they obtain when 
working with either M or L subjects. M- 
scoring subjects, when they work in pairs 
with other M subjects do not improve their 
scores significantly over the scores they ob- 
tain when working with L subjects, but they 
do improve their scores when working with 
H subjects over the scores they obtain when 
working with other M or L subjects. L sub- 
jects, when they work in pairs with either 
M or H subjects, improve their scores sig- 
nificantly over working with other L subjects; 
they do not improve their scores significantly 
when working with H subjects over working 
with M subjects. In other words, subjects 
working with other subjects at their same 
level do not improve their scores significantly 
i working with subjects below their level, 

ut subjects working with other subjects 
above their level do improve significantly 
a working with subjects at or below their 
evel. Subjects working with other subjects 
above their level do not improve significantly 


when working with subjects who are further 
above their level. 


Third Analysis 


AE previous analysis examined the effects 
level ze Performance of subjects of a given 
of diff ability when working with partners 

erent levels. The focus was on subjects 
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of a constant level where the partner with 

whom the subjects of a similar level worked 

was allowed to vary over L, M, or H. The 

third analysis reverses this procedure, focusing 

on subjects who vary in level from L, M, and : 
H, while the partner remains on the same 

level. j 
Column 2 of Table 2 gives the mean score 

of the improvement of H subjects when 

working with L partners over initially work- 

ing alone, the improvement of M subjects 

when working with L partners over initially 

working alone, and the improvement of L 

subjects when working with L partners over 

initially working alone. Comparing differences 

among these three means with an analysis of 

variance test, an F = 4.825 was obtained, 

which, with df = 2/40, is significant beyond 

the .05 level. 

Column 3 of Table 2 gives the mean score 
of the improvement of H subjects when 
working with M partners over initially work- 
ing alone, the improvement of M subjects 
when working with M partners over initially 
working alone, and the improvement of L sub- 
jects when working with M partners over 
initially working alone. Comparing differences 
among these three means with an analysis 
of variance test, an F = 17.721 was obtained, 
which, with df = 2/41, is significant beyond 
the .001 level. 

Column 4 of Table 2 gives the mean im- 
provement of H subjects when working with 
an H partner, the mean improvement of M 
subjects when working with an H partner, 
and the mean improvement of L subjects 
when working with an H partner. Comparing 
differences among these three means with an 
analysis of variance test, an F = 36.866 was 


_ obtained, which, with df =2/50, is significant 


beyond the .001 level. 

The results of the third analysis show, as 
might be expected, that the performance of 
subjects who initially score low are more read- 
ily improved than subjects who initially score 
high, and that H subjects tend to help their 
partner more than L subjects. 


Fourth Analysis 


The fourth analysis compares the absolute 
level of scores received by the subjects on 
the second administration of the intelligence 
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TABLE 3 
MEAN SCORES OBTAINED ON A SECOND ADMINISTRATION OF THE Wonvertic INTELLIGENCE TEST 
FOR NINE COMBINATIONS oF SUBJECTS WORKING AS INDIVIDUALS AND IN PAIR-GROUPS 
H-H pairs | H-M pairs | H-L pairs |H individual| M-M pairs | M individual| M-L pairs | L-L pairs L individual 
43.000 42.182 41.333 40.109 37.125 35.606 34.389 33.136 27.400 
(nm = 30) | (n = 22) | (n = 24) | (n = 46) (n = 24) | (n = 33) | (n = 18) | (n= 22) | (n = 20) 


test working in the various combinations as 
individuals and as pair-groups. In this analy- 
sis, the interest, rather than being in the 
improvement of the second test over the first, 
is on the level of performance attained by 
the various combinations of subjects used in 
this experiment. Altogether, there were nine 
combinations: H, M, and L subjects working 
individually; HM, M -M, and L-L subjects 
working in pair-groups; and H-M, H-L, and 
M-L subjects working in mixed fesa pair- 
groups. The nine means for these combina- 
tions of subjects are presented in Table 3. 
The H-H pair-group received the highest 
mean score, followed by H-M, H-L, H indi- 
vidual, M-M, M individual, M- -L, L- si and L 
individual, in that order. This orderi is main- 
tained in the table going from left to right. 
Using an analysis of variance test to ex- 
amine the differences among the means, an 
F = 31.342 was obtained, which, with df 
= 8/230, is significant far beyond the .001 
level. Table 4 shows, using a ¢ test, which 
means significantly differed from each other. 
In the comparisons involving the H sub- 
jects, the results show that: the means of the 
three paired groups, H-H, H-M, and H-L 
do not significantly differ Kan one another; 


TABLE 4 


COMPARISON OF THE MEANS OBTAINED FOR THE NINE COMBINATIONS 
FOR SIGNIFICANT DIFFERENCES USING A f TEsT 


comparing the paired groups with the H indi- 
viduals working alone, the H-M and the HL 
pair-groups means were not significantly 
higher than the individual H mean, but the 
H-H pair-group mean was; any combination 
of a pair-group which included an H partner 
(H-H, H-M, or H-L) as well as the H work- 
ing alone scored significantly higher than the 
next best combination (M-M), where an H 
subject was excluded. 

For the comparisons involving the M sub- 
jects, the results show that: the MM 
pairs do not score significantly higher than 
the M working as individuals, but the M-M 
pairs do score significantly higher than the 
M-L pairs; the M-L pairs are not sig 
nificantly higher than the L-L pairs, but 
both the M-M pairs and M working as indi- 
viduals do score higher than the L-L pairs. 

In the comparisons involving the L sub 
jects, the results show that the mean for the 
L-L pairs was significantly greater than the 
mean for the L working as individuals. 

Summarizing the results of this final analy- 
sis, it was found that: 


1. Pair-groups composed of subjects at the 
same level do significantly better than the 
subjects at that level working individually: 
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INDIVIDUAL AND GROUP PERFORMANCE 


This was found for the H and L subjects, 
put not for the M subjects. 

_ 2. The three combinations of pair-groups 
involving H subjects (H-H, H-M, and H-L) 
do not significantly differ from each other, 
but the M-M pairs scored significantly higher 
than the M-L pairs. 

3. The H-L pair-groups did not score higher 
than the H individual working alone, and the 
M-L pair-groups did not score higher than 
the M individual working alone. 

4, The H individual working alone scored 
higher than the M-M pair-groups, and the 
M individual working alone scored higher 
than the L-L pair-groups. Thus the results 
indicate that in certain instances pair-groups 
do better than individuals working alone, 
but that this statement should be qualified 
according to the ability of the subjects when 


this comparison is made. 


be 


Discussion 


The results of this study show that it is 
necessaty to consider the ability of the indi- 
viduals involved when making comparisons 

een the performance of individuals and 
groups. It was found that pair-groups of each 
level of ability (L, M, and H) improve their 
Performance more than individuals of the 
Same level working independently. The added 
improvement when working as paired groups 
Over working as individuals differs according 
to the level of the subjects. This study finds 
that the group effect is highest for subjects 
Who initially fall in the L category. This find- 
ing is contrary to that of Comrey and Staats 
(1955), who report that the group effect was 
Bteater for subjects who made high individual 
scores, 
The Purpose or goal to be achieved must 
Considered in deciding how to place two 
Individuals in a pair-group, the individuals 
ee the same or different abilities. In an 
Elna or school setting, grouping by 
ha 1S occasionally done. Therefore, if the 
eae of performance of an L subject 
Aa aot Pairing an L subject with either 
this a M subject would be indicated by 
S study. Pairing an L subject with an H 
pr: ae does not improve the L subject's score 
im Pairing him with an M subject. If the 
Provement of performance of an M subject 
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is desired, then he should be paired with an 
H partner. Pairing an M subject with an M 
partner does not improve his score over 
pairing him with an L partner. The H sub- 
ject does not improve his score differently 
when paired with an L, M, or H partner. 
Either of these pairings, however, will enable 
him to improve over what he could do by 
working alone. As stated above in the sum- 
mary of the second analysis, a subject work- 
ing with a partner at his same level does not 
improve significantly over working with a 
partner below his level, but a subject working 
with a partner above his level does improve 
significantly over working with a partner at 
or below his level. A subject working with a 
partner one step above his level does not 
further improve when the partner is even an 
additional step above the subject’s level. 

Also, the results show that the L subject 
when working with an L, M, or H partner 
improves more than would M or H subjects 
when working with partners on these three 
levels. An M subject does not improve more 
than an H subject when working with an L 
partner, but the M subject improves more 
than does the H subject when working with 
M partners and the M subject improves more 
than the H subject when both are paired 
with an H partner. 

In many situations the interest is not, 
however, in optimizing or improving the 
score of an individual, but in optimizing 
group output. If it is desired to maximize 
the economy of talent in placing individuals 
in groups for highest group performance, then 
this study shows that: it would suffice to 
have a paired group consisting of an H and 
an L subject rather than have the group 
consist of two H subjects or an H and an M 
subject; an H individual working by himself 
does better than a paired group consisting of 
two M subjects and an M individual working 
by himself does better than a paired group 
consisting of two L subjects; and a pair- 
group consisting of an H and an L subject 
does not perform better than an H individual 
working alone and a pair-group consisting of 
an M and an L subject does not perform 
better than an M individual working alone. 

Observing the order of the means in Table 
3 for the various combinations of paired 
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groups and individuals, there appears to be a 
definite pattern with one misplacement, the 
M individual should follow rather than pre- 
cede the M-L pair. This pattern would be 

* expected if it could be assumed that a fixed 
probability existed that an individual with a 
given initial ability would be able to solve a 
certain item. Then in the case of two H sub- 
jects working together the probability would 
be greater that the combined right answers of 
both subjects would yield a higher score than 
the combined right answers of an H and an 
M subject. In a similar fashion, there would 
be a small amount of additional right answers 
that an L subject would have correct that 
an H subject would miss, and thus the proba- 
bility would be high that an H subject and 
an H subject together would have more right 
answers than an H subject working alone. 
This same analysis could be applied to the 
scores made for all the various combinations 
of paired subjects considered in this study. 
This interpretation, however, makes the as- 
sumption that if two subjects are working 
together where one subject has the right 
answer and the other has the wrong answer, 
that the right answer will be the one selected. 
The possibility also exists that the incorrect 
answer would be chosen. 

The above interpretation, in addition, does 
not take into account the possibility that two 
individuals working alone could both get an 
item incorrect, but when interacting with one 
another would answer the same item correctly. 
Unfortunately the data as it was collected in 
this study did not permit the examination of 
this issue. An additional study could well be 
done to resolve this point. 

When one considers the size of the gaps 
between the various means, as presented in 
Table 4, and examines the gaps which are sta- 
tistically significant, the more complete inter- 
pretation given in the summary of the fourth 
analysis in the Results section appears to 
follow. One notes also the more comprehensive 
interpretation that was given in the summary 
of the second analysis in the Results section, 
where again significant differences between 
the various combinations of paired groups 
of subjects were taken into account. 
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The final analysis in the Results section, | 
being concerned with the absolute leve 
of performance rather than the improvement 
of an initial level of work, yields data which 
oppose a generalization stated by Lorge et 
al. (1958). They felt that the group does not 
perform better than the best individual in 
the group and further that the high perform. 
ance of a group resulted from the presence 
of high performing individuals in the group. 
The data obtained from the study reported 
here show that, while an H subject working 
alone performs better than two M subjects 
working together and that an M subject 
working alone performs better than two L 
subjects working together, nevertheless, two 
H subjects working together do perform bet- 
ter than an individual H subject working 
alone. Further, while two M subjects do not 
perform better than an M subject working 
alone, two L subjects working together d 
perform better than an L subject working 
alone. Thus, in making comparisons of per 
formance between the group and an indi- 
vidual, the current study finds that the initial 
ability of the subjects who are involved in the! 
comparisons must be described. Statements) 
which may be correct when comparing groups 
with individuals for subjects at one particular 
level may not necessarily be correct when 
involving subjects who are at a different level 
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THE RELATION BETWEEN INTENTIONAL AND IN- 


CIDENTAL CONCEPT LEARNING AS A FUNCTION 
OF TYPE OF MULTIPLE STIMULATION AND 
COGNITIVE STYLE* 


HARRIETT AMSTER 
University of California, Berkeley 


Intentional and incidental, learning were studied (a) under conditions in 
which only 1 set of stimuli was present and (b) under multiple stimulation 
in which there were 2 sets of stimuli. In the latter case positively and nega- 
tively arousing multiple stimuli were compared. The difference between in- 
cidental and intentional learning was greatest when either type of multiple 
stimulation was present. Incidental learning was significantly reduced by the 
multiple stimulation but intentional learning was relatively unaffected. Other 
significant differences between intentional and incidental learning were ob- 
served. In addition, interference proneness and other cognitive styles, i.e., 
verbalness, rejectiveness, perceptual efficiency, and neuroticism were studied in 
an exploratory manner in relation to the effects of the experimental treat- 
ments, and found to have statistically reliable effects. For example, under 
intentional set, low-interference-prone Ss showed reliably better recall than the 
high-interference-prone Ss under intentional set, but they were inferior under 


incidental set. 


The present study considers the relative 
amounts of intentional and incidental learn- 
Ing as affected by whether or not multiple 
Stimulation is entailed and the nature of its 
emotional quality. In addition, interference 
_ Proneness and other cognitive styles which 
Were hypothesized to relate to the experi- 
mental variables were studied. 

When, as in many studies of incidental 
learning, two sets of stimuli are presented to 
the Subject, the learning relative to one set 
of stimuli may be affected by the presence 
-and nature of the other set. In this study 
learning in the presence and absence of a 
Second set of stimuli is compared. In the case 
Be two sets of stimuli are present, effects 

; ey and negatively arousing stimuli 
this aa The experimental design of 
Ges Gea is more complex than that of 
“a ades on the same problem because it 
for pts to combine in one experiment aims 
Which two experiments would ordinarily 
required: it attempts to study incidental 


1 
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study a principal investigator. The data of this 
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learning which is concomitant with inten- 
tional learning, and with the aid of inde- 
pendent groups, to study intentional and 
incidental learning of the same task. This 
design permits the study of intentional and 
incidental learning separately and in relation 
to each other. 

Two general hypotheses concerning the 
contrasting nature of intentional and in- 
cidental learning were (a) that the effect of 
an intentional set on certain aspects of con- 
cept formation, i.e., concept learning and 
memory for examples of the concept, might 
be somewhat different from the effects of an 
incidental set (Podell, 1958); and (b) that 
intentional learning involves verbal reason- 
ing to a greater extent than incidental learn- 
ing. Accordingly, verbal and nonverbal meas- 
ures of two aspects of concept formation, 
concept learning and recall, were employed. 

A cognitive style which was hypothesized 
to affect the relation between sets was inter- 
ference proneness which is said to reflect an 
individual’s ability “to maintain a set in the 
face of strong and persistent, but irrelevant 
stimulation” or “resist . . . interfering re- 
sponse tendencies [Lazarus, Baker, Brover- 
man, & Mayer, 1957].” Consequently, an 
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interference-prone individual would be ex- 
pected to accomplish less under an inten- 
tional set in the presence of multiple stimula- 
tion than those not as prone to interference. 
Other individual differences which may be 
relevant to the adoption of a particular 
instructional set or determine reactions to 
several types of emotionally arousing stimuli 
were considered in an exploratory manner. 


EXPERIMENTAL PROCEDURES 
Subjects 


The subjects were 57 undergraduate fraternity 
members at the Worcester Polytechnic Institute, 
Worcester, Massachusetts, ranging in age from 18 
to 25 years. They belonged to eight fraternities, and 
were tested in groups of 4 or 5 from the same 
fraternity. They had never served as subjects in 
a psychological experiment nor taken a course in 
psychology, and although they volunteered, their 
fraternities were paid for their participation. 


Experimental Task 


One set of stimuli consisted of two slightly differ- 
ent designs called “Zaregs” (Podell, 1958). They 
had in common a free-form and a triangle inside, 
and a triangle and two appendages outside. (For a 
more detailed description with photographs, see 
Podell, 1958.) The other set of stimuli which were 
alternated with Zaregs was designed to induce 
one of three types of arousal. The positive 
stimuli were pictures of children with pleasant ex- 
pressions; negative stimuli were pictures of phys- 
ically abnormal or diseased children; and the con- 
trol stimulus was a blank screen. 

The stimuli were presented so that an arousal 
stimulus (happy child, abnormal child, or blank 
screen) was always alternated with a Zareg; each 
different instance of Zareg appeared four times; 
each arousal stimulus, twice. Sixteen slides, in- 
cluding blanks in the control condition, were 
presented to each subject. Each stimulus was ex- 
posed for 2.5 seconds by means of an automatic 
slide projector used with a photographic timer. One 
random sequence of the stimulus materials was 
used throughout. 

All subjects were given learning instructions, to 
learn common features, with respect to one set of 
stimuli, but the instructional set is herein called 
intentional when it was intentional with respect to 
Zaregs, For incidental set subjects were requested 
to notice the features in each photo and those 
common to all the photos; for intentional set they 
were requested to notice the features in each Zareg 
and those common to all Zaregs. Under the control 
conditions the incidental learners were to notice 
their thoughts while the screen was blank and to 
observe the similarities in their thoughts during 
these periods. Subjects were not only instructed to 
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notice the features that characterized all Photos 
(Zaregs) but also to look at, but not to pay Special | 
attention to, the spacers which appear between ee 
amples of the class of photos (Zaregs). They wen | 
also told that the first slide would be a line drawing 
spacer, example of Zareg, and that this would by 
followed immediately by a photo (spacer), In the 
control condition incidental learners were instructed | 
that the first slide would be a spacer which isa 
line drawing and that it would be followed by th 
blank screen to which they should attend, 7, 


Measures of Learning and Recall 


For the verbal test of concept learning, subjects 
were asked to write a definition of the line draw- 
ings, called Zaregs or spacers, and were told to 
mention all of their common characteristics, For 
the test of recall they were instructed: 


Please draw a picture of a Zareg [spacer], It) 
need not be exactly like any one that you saw, but 
it must be a Zareg [spacer]. 


The instructions were paraphrased if necessary. 
After 2 minutes, the subject was requested to draw 
another picture of a Zareg (spacer). After tht 
same instructions, they were directed to: 


Try to make it as different as possible from th 
one that you already drew, though still be 4 
Zareg [spacer]. It is not necessary that it be# 
Zareg [spacer] which you have seen. 


After 2 minutes, the subjects were asked to draw? 
third picture with the instruction to “make it ® 
different as possible from the two you already 
drew... .” 

The definition and the three drawings wet 
separately and independently scored by means 0 
the same check list which listed the common (cot 
rect) features, Each pair of judges agreed on ove 
90% of the possible check points. Agreement wat 
reached by discussion and majority decision in U 
case of disagreement. 5 

The measure of verbalized concept learning w 
the total number of common features mention if 
the verbal definition. Partial credit was allowed i 
the common features were described correctly N 
not presented as essential to the definition. 
example, the description might be prefaced dl 
“Some of them had a...” instead of “Al 
them...” i 

Two measures of representational recall wee a 
The measure of immediate recall consisted of | 
number of common features in the first pa 
This is the only drawing which reflects memo 
alone since the instructions for the other a 
suggested that subjects could construct neW © 
amples of Zareg which should differ from ber 
original drawing. Consistent recall was the Bi 
of common features that appeared consistenti 
the three drawings. Thus, for a particular fei 
for example, triangle, to receive credit Ta 
mediate recall, it need appear only on the first 


g, but for credit on consistent recall, it must have 
appeared on all three drawings. 


Test of Cognitive Style 


“Tests of personality and style, including the test 
of interference proneness, the color-word test 
_ (Jaensch, 1929; Stroop, 1935) were administered 
following the experiment. This test consisted of three 
cards approximately 8 X 8 inches presented in the 
“following order: A which requires reading of the 
‘words red, blue, and green; B which requires naming 
the colors of red, blue, and green rectangles; and C 
“which requires naming the incongruous ink color 
in which the words red, blue, and green areyprinted. 
“The test was administered individually, and the time 
was recorded for each card. Interference proneness 
was measured by speed of naming colors on C. 


RESULTS 
Concept Learning and Recall 


{ 


_ Table 1 presents the correlations between 
the three measures of concept learning and 


y TABLE 1 


py 
(CORRELATIONS BETWEEN VERBALIZED CONCEPT LEARN- 
C ING AND Two MEASURES OF REPRESENTATIONAL 
‘ RECALL UNDER INTENTIONAL AND INCIDENTAL SET 


Immediate re- | Consistent re- | Immediate re- 


st call and con- | call and con- | call and con- 
5 sistent recall | cept learning | cept learning 
as 
Thtentional 16 19 .46* 
incidental 574 .26 :46* 
oth 42 33 .65* 
hy 
N $ <.05. 


Tecall for the two instructional sets. Though 
hone of the correlations was reliably different 
34 the two sets, the correlations tend to be 
ore substantial under incidental set than 
oes intentional set. The recall measures 
a the lowest correlation for intentional 
ka ers and highest for incidental learners 
or whom the correlation was reliable 


e 05), The verbal definition measure, 
ee signicantly (p< .05) with im- 
3 te recall for both instructional sets and 


a a analysis of variance, mixed design 
p ards, 1951), of the two representational 
E ieee and an analysis of variance 
oe Concept learning measure, employing 
In 2 
tt groups, were performed. The 
a mer analysis, summarized in Table 2, shows 


A at the main effect of Task was significant, 


i] 
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TABLE 2 


ANALYSIS OP VARIANCE OF ĪMMEDIATE AND CONSISTENT 
REPRESENTATIONAL RECALL AS A FUNCTION 
OF INTERFERENCE PRONENESS 


Source af | MS F 
Between subjects 56 | 119.18 
Set (S) 1 [2305.63 | 37.80** 
Arousal (A) 2 | 102.23| 1.68 
Interference proneness (P) 1 4.80| 0.08 
xP 1 | 307.20] 5.04* 
SXA 2 | 297.04 | 4.87* 
AXP 2 | 145.60| 2.39 
SXAXP 2 | 110.80| 1.82 
Error (b) 45 61.00 
Withinsubjects 57 | 132.82 
Task (T. 1 [4788.03 | 114.33*** 
AXT 2 37.63 | 0.90 
SXT 1 | 572.03 | 13.66*** 
PXT 1 .13 | 0.00 
AXSXT 2 47.43| 1.13 
AXPXT 2 21.43| 0.51 
SXPXT 1 4.80} 0.11 
AXSXPXT 2 54.30] 1.30 
Error (w) 45 41.88 
p .05. 
we £ -001. 


p< 001. The means presented in Table 3 
indicate that almost twice as many common’ 
features were immediately recalled as were 
consistently recalled and that intentional 
learning was significantly better than in- 
cidental learning for all three measures, as 
shown in Tables 2 and 4. The Task x Set 
interaction shown in Table 2 was significant, 
p < .001. The superiority of intentional over 
incidental learners was greater for immediate 
recall than for consistent recall, and the 
superiority of intentional learning to inci- 
dental learning was reliably greater under 
both conditions of multiple stimulation than 
under the control condition. (See the inter- 
action of Arousal X Set in Tables 2 and 4, 
and the means in Table 5.) 


TABLE 3 


MEANS OF VERBALIZED CONCEPT LEARNING AND 
REPRESENTATIONAL RECALL UNDER INTENTIONAL 
AND INCIDENTAL SET 


Representational recall 


Verbalized 
Set |< —7— concept 
Consistent | Immediate Hea 
Intentional 15.3 32.3 17.8 
Incidental 10.9 19.2 9.2 
Both 13.1 25:8 13.5 
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TABLE 4 


ANALYSIS OF VARIANCE OF VERBALIZED CONCEPT 
LEARNING As A FUNCTION OF INTERFERENCE 


PRONENESS 
Source af MS F 
Between conditions 11 
Set (S) 1 | 1109.40 | 28.66** 
Arousal (A) 2 3.07 .08 
Interference proneness (P) | 1 60 .02 
P 1 1.07 -03 
SXA 2 | 162.05| 4.19* 
AXP 2 66.36 | 1.71 
SXAXP 2 | 154.52| 3.99 
Error 45 38.71 
*p < .05. 
**p <.01 


Interference Proneness 


Compared with high-interference-prone sub- 
jects, low-interference-prone subjects’ repre- 
sentational recall was better under intentional 
set, yet inferior under incidental set. The 
amount by which both measures of recall 
were affected by the two sets was reliably 
greater, p < .01, for the less interference- 
prone subjects (Tables 2 and 6). However, 
the interaction between set and interference 
Pproneness was not reliable for verbalized 
concept learning (Tables 4 and 6). 


Other Cognitive Styles 


The results in this section concern the 
additional tests of cognitive style which were 
analyzed in all by nine analyses of variance, 
mixed design, in which cognitive style varied 


TABLE 5 


EFFECTS OF AROUSAL ON THE MEANS FOR VER- 
BALIZED Concert LEARNING AND REPRESENTA- 
TIONAL RECALL UNDER INTENTIONAL AND 
INCIDENTAL SET 


Arousal 
Negative Positive Control 

Immediate recall 

Intentional set 33.0 35.0 28.9 

Incidental set 16.9 16.8 23.9 
Consistent recall 

Intentional set 15.4 14.6 15.9 

Incidental set 6.7 10.0 16.0 
Concept learning 

Intentional set 18.9 20.4 14.2 

Incidental set 8.4 7.3 12.0 


Note.—Interaction: Arousal X Set, p < .05 in both analyses 
of variance. 
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from one analysis to another.? A separate | 
median for each cognitive style was deter. | 
mined for each treatment condition, and | 
individuals were thus classified as above ot 
below the median. As in one of the analyses 
already reported, the two dependent variables 
in every analysis were immediate and con- 
sistent recall, and the independent variables 
consisted of three kinds of arousal, two condi- 
tions of set, and one individual difference, 
variable. The nine stylistic variables were | 
relatively independent of one another. A de 
scription of tests, measures, correlations be 
tween the measures of style, and results in| 
tabular form are available.* Only statistically | 
reliable results are abstracted here. 

A second measure derived from the color- 
word test, Verbalness,* was found to be inde- 
pendent of the first, interference proneness 
(r= .06). The interaction between Set, 
Task, and Verbalness was significant, p < 01 
(F = 6.72, df = 1/45). The effect of set on 
consistent recall was greater for the high 
verbal subjects than for the low verbal: 
high-verbal subjects were facilitated by the 
intentional set and retarded by the incidental 
set, the respective means being 18.3 and 9.1; 
low-verbal subjects were not affected by the 
set, the means being 12.3 and 12.1. For 
immediate recall the differences between sets 


2 Interpret results with caution since the analyst 
involve repeated tests on the same data. Results 
are described as reliable if p<.01. The distribu- 
tion of true probabilities nominally significant at) 
a particular level of confidence (©) is given by 
(© +(1— 0)" in which © represents the nomindl 
significance level being used, and n refers to it 
number of significance tests employed. If eigh! 
comparisons are made, the probability that thet 
will be significant at the .01 level, is < .01. 

3This material has been deposited with th 
American Documentation Institute, Order Doci- 
ment No. 8204 from ADI Auxiliary Publication 
Project, Photoduplication Service, Library of Con- 
gress, Washington, D. C. 20540. Remit in advatl 
$1.75 for microfilm or $2.50 for photocopies R 
make checks payable to: Chief, Photoduplicatio 
Service, Library of Congress. i 

*Verbalness is the ratio of the time to name i 
colors on the color card (B) to the time to te 
the words on the word card (A). Lazarus, Babel 
Broverman, and Mayer (1957), and Brovermat # 
Lazarus (1958) suggest that the ratio of rea 6 A 
and color naming (B-A/A, or B/A) reflects oe 
ceptual ability of a verbal sort, beyond the MOM — . 
skills in reading. 


CONCEPT LEARNING 


TABLE 6 


Mean CONCEPT LEARNING AND REPRESENTATIONAL 
RECALL UNDER INTENTIONAL AND INCIDENTAL SET 
AS A FUNCTION OF INTERFERENCE PRONENESS 


Set 


Intentional | Incidental 


Immediate recall 


High interference proneness 30.3 20.8 

Low interference proneness 34.3 17.5 
‘Consistent recall 

High interference proneness 13.7 12.1 

Low interference proneness 16.9 9.7 
Concept learning 

High interference proneness 17.8 9.5 

Low interference proneness 17.9 9.0 


Note—Concept learning: Set X Personality, ns; immediate 
and consistent recall: Set X Personality, p <.05. 


were practically the same for the two groups. 

There were also significant effects of Per- 
ceptual Efficiency which was the threshold 
for tachistoscopically presented neutral 
words, Rejectiveness which was the number 
of disliked pictures (Gardner, Holzman, 
Klein, Linton, & Spence, 1959; Schlesinger, 
1954) and Neuroticism as measured by the 
Winne scale (Winne, 1951). The correlation 
between Neuroticism and Perceptual Ef- 
ficiency was .03, between Neuroticism and 
Rejectiveness .04, and between Perceptual 
Efficency and Rejectiveness .03. 

Perceptually inefficient subjects were found 
to recall more common features than the 
perceptually efficient (F = 6.32, df = 1/45, 
< .01). Their mean recall score was 21.22 
compared with the mean of 17.65 in the per- 
ceptually efficient group. Rejective subjects 
pated better than nonrejective subjects 
Ee measures of recall (F = 6.13, df 
T ae $< .01). The means of the two 

v res were 21.6 and 17.8, respectively. 

Ea interaction of Task, Arousal, and 
(P= aie was statistically significant 
ENA a df = 2/45, p < .01). The differ- 
i se ects of arousal were only reliable 
Hes consistent recall measure and these 
rae ae teported below. The neurotic sub- 
Ti i of the abstract stimuli was greatly 
Matai: a the nonneurotics’ under positive 
tea, e mean number of features con- 
ait 4 recalled by the neurotics was 7.4 

nde e nonneurotics recalled 16.7 features. 

T negative arousal the neurotics recalled 
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10.7 features and the nonneurotics 11.4 
features. By contrast, under the control 
condition the neurotics recalled 18.8 features 
while the nonneurotics recalled 13.5 features. 
Also, the neurotics’ recall was better under 
the control condition (mean recall, 18.8) than 
under positive arousal (mean recall, 7.4). 
However, the reverse tended to be true for 
the nonneurotics. 


Discussion 


Relation between Intentional and Incidental 
Learning 


In an analysis of the process of incidental 
learning (Podell, 1958), it was hypothesized 
that incidental concept formation is strongly 
dependent on memory in that features of an 
example of a class which tended to be remem- 
bered clearly would tend to be regarded as 
common to members of the class. Accord- 
ingly, all features which are strongly remem- 
bered should tend to occur in all measures 
of concept formation. Accordingly, the cor- 
relation between immediate and consistent 
recall was found to be greater for the inci- 
dental set than for intentional set. For the 
intentional set, on the other hand, concept 
formation could be relatively independent of 
memory because subjects would recall many 
features which they knew were not common 
to the whole set of instances. As expected, 
the difference between the two measures of 
recall was greater for intentional set than for 
incidental set. 

The results on the effect of multiple stimu- 
lation were relevant to two aspects of the 
relation between intentional and incidental 
learning. According to a response-competition 
view, suggested by Maltzman (1955), and 
applied to incidental learning by Mayzner 
and Tresselt (1956), the presence of multiple 
stimuli should reduce conceptual thinking 
under both sets. This would have entailed 
a main effect of multiple stimulation which 
was not obtained. According to a comple- 
mentary view suggested by Postman and 
Bruner (1949), among others, and supported 
by empirical evidence (Bahrick, 1957), there 
should be a complementary relationship be- 
tween intentional and incidental learning 
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which would have entailed an increase in 
incidental learning whenever a decrease in 
intentional learning occurs. The results sup- 
port neither of these views since only inci- 
dental learning tended to be affected by the 
presence of multiple stimulation. 

The fact that multiple stimulation did not 
reduce intentional learning might be expected 
on the basis of a well-substantiated finding 
(for example, Ross, Rupel, & Grant, 1952): 
that there is a kind of insulation of inten- 
tional cognitive behavior against stress and 
distraction. The present study further sug- 
gests not only that insulation of intentional 
learning occurs, but that it happens at the 
expense of incidental learning. Consequently, 
intentional learning may remain relatively 
constant in contrast to incidental learning 
which is more vulnerable to affect and the 
presence of multiple stimulation. 


Interference Proneness 


The type of intentional thinking which 
characterizes the non-interference-prone indi- 
vidual probably involves strong, well-learned 
habits which interfere with incidental re- 
sponses. This suggests that his incidental 
responses are not necessarily weak but only 
appear so because his intentional responses 
are strong. However, this assumption does 
not account for the fact that the effect of 
interference proneness was not confined to 
conditions of multiple stimulation. For inci- 
dental learning under the control condition 
in which no multiple stimulation occurred, 
the intentional responses could not have been 
greatly interfering, and in this case where 
there would be minimal compensation from 
strong intentional responses, recall by the low- 
interference-prone subjects was also inferior 
to recall by high-interference-prone subjects. 
Consequently, the facts are best described by 
assuming that low-interference-prone indi- 
viduals are characterized both by strong in- 
tentional processes and ineffectual incidental 
processes. 


Other Cognitive Styles 


The low-verbal subjects were unaffected by 
the two sets, while for high-verbal subjects 
superiority of intentional concept formation 
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over incidental concept formation was con 
siderable. One possibility is that the low. | 
verbal subjects approached the task jp the 
same perceptual way regardless of set, 
thereby learning the same amount under the 
two sets. Moreover, verbalness has been re 
lated to the tendency of individuals to fune | 
tion conceptually rather than perceptually 
(Broverman & Lazarus, 1958), and this| 
would seem to involve emphasis on reasoning, 
Consequently, the low-verbal individual may 
on the one hand, be inefficient in intentional 
thinking which involves verbalization and. 
on the other hand, be efficient in the sort of | 
incidental thinking which involves imagery 
and other nonverbal responses. 
Perceptually efficient subjects _ showed 
poorer recall than the perceptually inefficient, 
Possibly, regardless of instructions, subjects 
high in perceptual efficiency habitually pay 
more attention to meaningful stimuli (photo: 
graphs) than to abstract stimuli (line draw- 
ings), or perhaps they do not employ verbal 
mediation as readily as others. The superior 
performance of rejective subjects may, toy 
speculate further, reflect an underlying crit 
ical and possibly analytical attitude. 
Considering that the presence of two setsi 
of stimuli might evoke competing responses 
and considering neuroticism as a measure 0 
drive, it might be expected that for neurotics 
arousal conditions would increase drive t0 
an exceedingly high level, resulting in a de 
crease in both intentional and incidental 
performance. Under control conditions, how 
ever, the initially high level of drive woul 
remain optimal because the situation woul 
not entail response competition or arousal. 
Similarly, the arousal conditions should in 
crease the drive of the nonneurotics from at } 
ineffective to a facilitative level. The results f 
are clearly in accord with this analysis. 


REFERENCES 


Bauricx, H. P. Incidental learning at five stages i | 
intentional learning. Journal of Experimental Ps 
chology, 1957, 54, 259-261. jvidual 

Broverman, D. M., & Lazarus, R. S. mini 
differences in task performance under condi 
of cognitive interference, Journal of Personthh r 
1958, 26, 94-105. ‘ail 

Epwarps, A. L. Experimental design in psychol06 
research. New York: Rinehart, 1951. 


CONCEPT LEARNING 


Garoner, R., Horzatan, P. S., Krem, G. S., LINTON, 
Harrer P., & SPENCE, D. P. Cognitive control: 
A study of individual consistencies in cognitive 
behavior. Psychological Issues, 1959, 1(4), 1-186. 

JAENSCH, E. Grundfjormen menschlichen Seins. 
Berlin: Otto Elsner, 1929. 

Lazarus, R. S., Baker, R. W., BROVERMAN, D. M, 
& Mayer, J. Personality and psychological stress. 
Journal of Personality, 1957, 25, 559-577. 

Marran, I. Thinking: From a behavioristic point 
of view. Psychological Review, 1955, 62, 275-286. 

Mavaner, M. S., & Tressert, M. E. The effect of 
the competition and generalization of sets with 
respect to manifest anxiety. Journal of General 
Psychology, 1956, 55, 241-247. 

Popett, Harrmert A. Two processes of concept 
formation. Psychological Monographs, 1958, 72(15, 
Whole No. 468). 


223 


Postman, L., & Bruner, J. S. Multiplicity of set 
as a determinant of perceptual behavior. Journal 
of Experimental Psychology, 1949, 39, 369-377. 

Ross, B. M., Ruvet, J. W., & Grant, D. A. Effects 
of personal, impersonal, and physical stress upon 
cognitive behavior in a card sorting problem. 
Journal of Abnormal and Social Psychology, 1952, 
47, 546-551. 

ScutesincEr, H. J. Cognitive attitudes in relation to 
susceptibility to interference, Journal of Person- 
ality, 1954, 22, 354-374. 

Srroor, J. R. Studies of interference in serial verbal 
reaction. Journal of Experimental Psychology, 
1935, 18, 643-661. 

Wane, J. F. A scale of neuroticism: An adaptation 
of the Minnesota Multiphasic Personality Inven- 
tory. Journal of Clinical Psychology, 1951, 7, 
117-122. 

(Received July 30, 1963) 


Journal of Personality and Social Psychology 
1965, Vol. 1, No, 3, 224-228 


AND PERIPHERAL MEMBERS * 
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A member of a group whose position is more central, because his acts are a 


prerequisite for locomotion by fellow members, is conceived as having a 
stronger desire for his group to achieve success. This greater motivation should 


ASPIRATIONS FOR THE GROUP CHOSEN BY CENTRAL 


have consequences for the aspiration he proposes for his group. In a laboratory 
experiment, central members, more than peripheral ones, had greater in- 
volvement in the group’s task, perceived themselves as more responsible for 
the outcome of the group, and selected aspirations for the group indicative of 
a stronger desire for that group to perform successfully. 


In many groups some member is respon- 
sible for facilitating movement toward com- 
pletion of the group’s task; unless he does his 
part, the group can make little progress. A 
person occupying such a central (C) position, 
we assume, is likely to have a stronger desire 
for achievement of success by his group than 
one who is in a peripheral (P) position be- 
cause C perceives that the success or failure 
of his group is largely due to his own effort. 
It has been observed, in support of this 
assumption, that members exert greater effort 
in behalf of their group the larger their share 
of the group’s task (Shaw, 1960) and the 
more important their function for the group 
(Pepitone, 1952). 

Desire for group achievement (dg Ach) is 
a motive to strive for the success of a group 
in which one is a member. It is aroused by 
membership in a working group and is 
stronger the greater one’s responsibility for 
the performance of the group or the greater 
one’s strength of membership in the group. 
Even though it is a socially oriented motive 
and not an enduring disposition of the per- 
sonality, dg Ach is conceived to be an ana- 
logue of the individual motive for personal 
achievement (n Ach). Atkinson (1957) has 
described how n Ach affects an individual’s 
level of aspiration. We anticipate by analogy 


1 The research reported herein was performed pur- 
suant to a contract with the United States Office 
of Education, Department of Health, Education, and 
Welfare. 

We are grateful for the assistance of David 
Dustin during the administration of the experiment. 

2 Now at the Case Institute of Technology. 


that dg Ach will determine a member's level 
of aspiration for his group. 

Assume that a group is constrained to at 
tempt any one of a set of tasks, each ol 
which has a given degree of objective dit 
ficulty, as in a typical level of aspiration 
situation. In the approach suggested by 
Atkinson, a person with a stronger person 
motive to achieve prefers a personal task ol 
intermediate difficulty rather than a very eas) 
or very difficult one. C, more than P, it {ol 
lows, will prefer his group to attempt a task 
with an intermediate level of difficulty rathet 
than a very easy or very difficult one 
This is the primary hypothesis in the preset! 
study. 

Because C has stronger dg Ach than dos 
P, we expect that he is more likely to viel 
the group’s level of aspiration (the level 0 
task agreed upon by the members) as 4 
standard of performance the group ought 
achieve. Thus, the standard will be used mo 
by C than by P as a criterion for evaluatilé 
the quality of the group’s performance: 4 
successful group will be rated more favorably 
by C than by P and an unsuccessful gol? 
more unfavorably. The performance of 
group is more likely to determine C’s evalua; 
tion of his personal performance than 


self-evaluation. 

The central and peripheral positions are 
parts of the group’s task structure. One m 
also conceive of degrees of centrality » A 
group’s decision structure. In a centrali 
decision a particular member makes 
decision for the group and in a decentral 
decision each member has an equal share 
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ing the choice of the group. We 
that the differential effects of 
s C and P, already noted, will be 
in a group where C makes a cen- 
| decision on the group’s level of 
on than in a group where C and P 
te in a decentralized group decision. 


METHOD 


Nas 
jects were 180 eleventh- and twelfth-grade 
in a middle-class suburban community. A 
in the school was used for the laboratory. 


e subjects, assigned by a school official, ap- 
at each experimental session. The experi- 
introduced himself as a representative of the 
ity Testing Service (a fictitious organization) 
ed that he was testing all junior and 
joys for Organized Work Aptitude, ability 
a team relationship. The subjects were 
a single table but concealed from one 
wooden partitions. 

re told that each was to construct on 
before him, within a limited time, an 
Uplicate of a design made of dominos, that 
uld each work on the same design, that they 
k in accord with particular rules for the 


were 14 large poster cards, each displaying 
‘ic design made of dominos. The designs 
increased in complexity. The dominos in 
S were placed end to end, or end to side, 
mino being numbered from 1 through the 
pieces in the design; the simplest design 
5 dominos and the most complex, 18. 
card there was also a large number 
With a percentage sign. The subjects 
d that these numbers referred to the propor- 
teams like their own who had been able 
lete that design within the allotted time. 
" Percentages, intended to give the subjects 
cation of the probability of being able to 
each level of difficulty, were high for the 
design (98%) and low for the most difficult 
7), with fairly equal gradations for each 
3 ie these two extremes. The series of 
ee constituted a 14-point scale of 
Da very easy to very difficult. 
of ras ject was furnished a bucket with a supply 
0S sufficient to complete the design on 
Was working, plus an equal number of 
ig intended to increase the difficulty of 
ae instructed to withdraw dominos 
D et five at a time, as rapidly as he 
» before any domino could be placed in 
e was told, it was necessary for him 
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to have the preceding one in place. Thus, he could 
not begin his design until a domino with one dot 
was found (Number 1) and put in place, after 
which he sought a domino with two dots (Number 
2) and put it in its place, and so on. 


Task Centrality 


Within each group one member was to be a central 
(C) person and the other two peripheral persons 
(Pı and Ps). The assignments to positions were 
determined by having the subjects draw lots before 
they were seated at the table. Degree of centrality 
was created by a restriction on the way the work 
was to be done. They were informed that the 
persons whose lots had been Pı and Ps could not 
place a domino in their design until C had done so. 
While working on the task, Person C was free to 
move as rapidly as he could in completing his 
design (each piece in its proper order) with the 
requirement that he call aloud his progress through 
the various steps in his design. Persons Pı and Ps 
were also free to draw dominos from their buckets 
as fast as they could, but they could not place 
them in their design (each piece in its proper order) 
until C had provided his enabling act. The actions 
of C, then, were more central in the task structure 
than those of Pı and Pə even though all three 
members performed exactly the same physical 
movements. 


Decision Centrality 


Members were informed that the design on which 
the group would work (the group's level of aspira- 
tion) was to be determined within the team. In 
order to invoke a centralized decision, half of the 
groups were told that the decision was to be made 
by C. He was to make this decision alone without 
any advice from his teammates. All subjects first 
completed private ballots in which they stated the 
design they preferred for the group (member's 
aspiration for group) ; these were given to the 
experimenter and were not exchanged among sub- 
jects. C’s suggestion was announced as the group’s 
aspiration, In order to create a decentralized decision, 
subjects were told that each would privately vote 
on the design they wished the group to select and 
that majority rule would decide. Members’ votes were 
tabulated by the experimenter and the results were 
announced, anonymously. The announced decision 
was a fabricated one to match a design level chosen 
in one of the centralized decision groups. This 
fabrication was necessary in order that teams in 
the centralized and decentralized decision conditions 
have comparable group aspirations, that is, that 
they work on designs with comparable levels of 


difficulty. 


Success and Failure 


Subjects were informed that their effort on a 
given design would be rated either a success or a 
failure; a success if they finished within the time 
limit, otherwise a failure. The experimenter carried 
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TABLE 1 7 
FELT MOTIVATION AND EVALUATION OF EFFORT BY PARTICIPANTS 


Group Group 
Central Peripheral i oot fails. 
M M succeeds et 1 
1. How large a part you had* 5.42 4.38 6.50** 5.23 4.21 6.37 
2. How much mates depended on 6.30 4.90 8.75** 5.49 5.24 ns 
you? 
3. Personal responsibility for group* 5.32 3.93 9.26** 4.58 4.21 2.45* | 
4. How hard people tryt 6.73 5.08 2.24* 5.37 5.30 ns 
5. How hard mates triede 6.43 5.82 6.77** 6.29 5.76 5.88 p 
6. How let down if group fails‘ 5.10 4.54 2.43* 5.06 4.40 2.86% | 
7. How well team performed# 4.27 3.92 3.88** 5.96 2.11 42.66** 
8. How well you performed» 4.32 4.98 4.83** 5.63 3.78 15,414 
a Query: How large a part do you feel you played in your team's performance? (none at all-very large) 
b Query: How much did you feel that others depended on you to help your team succeed? (not at all-very much) 
G pie Hoy, responsible do you think you personally were for your team’s success or failure? (not at all responsible-ll my 
Oow, hard do you think people would usually try on a test like this? (not at all-would try very hard) 
* Query: How hard do you think the other members of your team tried? (not at all-tried very hard) 
í Query: How let down do you think people usually are when their team fails on this test? (not at all very much) 
s Query: How well do you think your team performed on this test? (very poor-excellent). 
t Duery: al in all, how well do you think you performed your part on this test? (very poor-excellent) 
< 025. 
a5 < 001. 
we p < .0001 


a stopwatch and clicked it shortly before C had 
finished his design (thus, P, and Ps had of course 
not finished their designs) in order to create a per- 
ception of failure, and clicked it shortly after all 
three members had completed their designs in order 
to create a perception of success. After appropriate 
inspection of his watch, he announced to the group 
whether it had succeeded or failed on the task. 
Half of the groups were given a success and half 
were given a failure on all trials within each of the 
treatments already described, 


Testing Procedure 


In order to ensure that they understood the 
instructions, the groups were first given a practice 
trial on a design smaller than any in the test series. 
All teams were told that they had been successful 
in their practice trial. They were then informed 
that the rules required that they begin work on 
Design Number 7 (the midpoint in the scale of 
difficulty). Half of the teams succeeded and half 
failed on this design. Thereupon, they chose (either 
by a centralized or a decentralized decision) the 
design for the next trial and were tested on that 
design; this procedure was repeated twice more, 
except that they were not actually tested on the 
fourth trial. 

After selection of the design for a fourth trial, 
subjects were given a questionnaire. It was pre- 
sented to them as an interruption in the test and 
as a means whereby the University Testing Service 
might learn to do a better job when testing high- 
school students. Queries on this instrument were 
in fact to help us validate the experimental condi- 
tions and to assess the effects of these conditions 
on motivation and self-evaluation. The queries are 
shown in Table 1. All were Likert-type, 7-point 
rating scales, 


Following the questionnaire the test was described. 
as completed and the subjects were told about the 
Purpose of the study, any deceptions, and th 
reasons for these. Their questions were answi 
and appropriate assurances were given them abou 
the nature and meaning of their behavior on tht 
test. They were asked to maintain secrecy abot 
the experiment until a stated deadline had passé 


RESULTS 
Stated Motivation 


One team member (C) was placed in 4 
position with greater influence upon 
output of the group than the other membets 
(Ps). Were the participants aware of 
relative effect? Queries on this issue wet 
contained in the postexperimental questioni 
naire. The responses shown in Rows 1-4 0 
Table 1 reveal that Cs compared to Ps ptt 
ceived they had the larger part in the group 
task, that other members were more ee 
ent on Cs, and that Cs were more responsi 
for the quality of the group’s product. It” 
noteworthy, moreover, that there was 4 98 
nificant interaction for the last one of t ; 
three queries (Row 3) such that Cs percei 
they were more responsible for the perform 
ance of the group than did Ps when the stl! 
had failed but not when it had succeé 
(F = 14.72, p< .001).2 A 

3 All values of F in this report have ajail 
The relevant means here are: C-success, 491 
success, 4.39; C-fail, 5.70; P-fail, 3.45. 


Not only did the Cs view themselves as 
re responsible for the performance of their 
group, the also perceived other members as 
"more ready to exert effort in behalf of the 


"group, as can be seen in the Rows 4-6 of 


‘Table 1. 
The type of decision method, centralized 
versus decentralized, had no differential 
effect upon these results. 
A. 
Mimea of Aspiration Choice 
~ It was predicted that C more than P will 
"Prefer his group to attempt tasks with inter- 
“mediate levels of difficulty. This prediction 
does not unequivocably suggest that the mean 
aspiration proposed by Cs will differ from the 
“mean aspiration proposed by Ps, and there 
Was no difference in those means. 
fh The hypothesis does suggest that Cs and 
“Ps will differ in the extent to which their 
pirations for the group differ, one aspira- 
tion from another, during a series of trials. 
me recall that the members of each team, 
x 28 and Ps alike, received identical information 
} os bout their team’s performance (success or 
~ failure) on the group’s task. Presumably this 
task had the same degree of objective diffi- 
culty for both Cs and Ps. Following a success 
Beet. failure by the group, the level of dif- 
 ficulty perceived to have a .5 probability of 
C Successful attainment is usually shifted to 
Some degree, upward following a success and 
downward following a failure (Zander & 
~Medow, 1963).* The direction of this shift 
a Perceived probability is presumed to be the 
i a. for both the Cs and the Ps. When select- 
mg a new task for the group, then, the 
a othesis asserts that Cs will propose one 
that is Close to the level currently perceived 
a having a .5 probability of success, and 
“hence Cs’ choices will be closer together than 


alee for the group than will Ps. 
One way to test this prediction is to com- 
meee the variances of the aspirations for the 
‘toup privately proposed by Cs and Ps after 
a groups had worked on exactly the same 


EIER: 


e 
; Beene” Present study the direction of shifts in 
to the ts’ aspirations for the group adhered strongly 
P< succeed, raise; fail, lower” rule, x? = 176.09, 
P S 0001. Cs and Ps were not significantly different 


a the direction of these shifts. 


+ 
x 
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Ps’. In sum, Cs will choose more similar _ 
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task, for example, the design for Level Num- 
ber 7, the beginning task for every group. 
Here the variance should be smaller among 
the group aspiration levels chosen by Cs than 
among the group aspiration levels chosen by, 
Ps. Another way of testing the prediction is 
to compare the variances in the average 
amount that members shifted their aspira- 
tions for the group from trial to trial regard- 
less of the tasks the group attempted. Here 
the amount of shift in aspirations for the 
group should be more similar among the Cs 
than among the Ps. 

The results reveal that the variance of the 
aspiration levels chosen by Cs was smaller 
than the variance of the levels chosen by Ps 
following the first trial (p < .025), within 
the centralized decision groups. The variance 
in the amount that members shifted their 
aspirations for the group across the trials was 
smaller among Cs than among Ps, regardless 
of whether the group had succeeded or failed, 
again within centralized decision groups, but 
not within decentralized ones (p< .01). 
These results support the hypothesis that 
Cs more than Ps prefer aspirations for their 
group in the intermediate range of difficulty, 
closer to the .5 level of perceived probability. 
But the hypothesis is supported only when 
Cs’ effect upon the group’s outcome is greater, 
that is, in the centralized decision method. 


Evaluation of Performance 


It was predicted that, because of his 
greater involvement in the achievement of 
the group’s goal, C would rate the team’s 
performance more favorably when it had suc- 
ceeded and unfavorably when it had failed, 
than would P. This prediction was not sup- 
ported. Instead, C rated the performance of 
the team more favorably (than did P) 
regardless of whether the team had succeeded 
or failed (see Row 7 in Table 1); 

Cs were significantly less favorable about 
their personal performance than were Ps, as 
can be seen in Row 8 of Table 1. There was, 
moreover, a noteworthy interaction between 
C-P and success-failure, such that Cs de- 
valued their personal performance more than 
did Ps after a group failure, but Cs and Ps 
did not significantly differ after a group 
success (F = 19.50, $< .01). Examination 
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of the means in each treatment of the experi- 
mental design (because of a significant 
second-order interaction among centrality, 
group success-failure, and decision type, 
„F = 6.29, p< .025) reveals that Ps rated 
their personal performance just as high when 
the group had failed as they did when the 
group had succeeded, in the decentralized 
condition; but rated themselves low after 
a group failure and high after a group success 
in the centralized condition. Cs, in contrast, 
judged their own performance unfavorably 
if the group had failed and judged it favor- 
ably if the group had succeeded regardless 
of the method for deciding the group’s level 
of aspiration.® 

Failure by the group, in sum, had a 
stronger impact upon C’s evaluation of his 
own performance than upon P’s evaluation of 
own performance and Ps more often avoided 
this impact when they had a part in choosing 
the group’s aspiration. 


Discussion 


In setting an aspiration for his group, a 
member is anticipating the future behavior 
of others (plus himself). The determinants 
of the member’s aspiration appear to be very 
similar to the determinants of an individual’s 
aspiration, as noted in earlier investigations 
(Zander & Medow, 1963; Zander, Medow, 
& Efron, in press). The aspect of the 
present study worthy of further scrutiny is 
that the member’s motive to have his group 
do well (dg Ach) appeared to operate, under 
the conditions provided, as an analogue of a 


5 The means relevant to the last two interactions 
follow. For Cs: central-success, 5.60; decentralized- 
success, 6.00; centralized-fail, 2.86; decentralized-fail, 
2.78. For Ps: centralized-success, 5.80; decentralized- 
success, 5.60; centralized-fail, 3.83; decentralized- 
fail, 4.70, 
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personal need for achievement. A socially |) 
generated and socially concerned desire func. 
tioned in a fashion similar to a so-called 
personality characteristic. 

The relatively weaker involvement of Ps 
in the success of the group apparently made 
it possible for them to perceive the group's 
performance, particularly when that per- 
formance was poor, as an unreliable indicator | 
of their personal ability. Cs, in contrast, 
evaluated their own personal performance 
poorly when the group had failed and per- 
ceived themselves as more responsible for 
the performance of the group when it had 
failed. Cs, moreover, rated the group's 
performance better than did Ps. 

The separate decision types, centralized 
and decentralized, had few differential effects, 
Results reported, and other findings too 
tangential to be included here, suggest that 
greater involvement was generated for both 
Cs and Ps in the centralized decision situa- 
tion than in the decentralized one. Loyalty 
to C’s decision appeared to be more involving 
for Ps than having a say oneself in selecting 
the task of the group. 
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This study investigated the effects o. 


psychiatric patients were reinforced with “Good” 


f anxiety on verbal conditioning. Male 
for beginning sentences with 


I or we. Scores on the Taylor Manifest Anxiety scale defined patients’ 


anxiety level. Awareness of correct 
contingencies was evaluated by means. 
As expected, performance gains on 
primarily by aware Ss, 


pronoun for which they were aware o; 


and correlated response-reinforcement 
of a detailed postconditioning interview. 
the conditioning task were determined 


whose increments in performance were specific to the 


f a reinforcement contingency. Unaware 


Ss did not differ from a randomly reinforced control group. Contrary to ex- 


pectation, there was no relationship 
high-anxiety patients gave 


aware low-anxiety patients. 


Taffel (1955) has reported that psychiatric 
patients with high anxiety (HA), as defined 
by scores on the Taylor (1953) Manifest 
Anxiety (MA) scale, showed performance 
gains on a verbal-conditioning task, but that 
low-anxiety (LA) patients did not. The 
conditioning task employed by Taffel re- 
quired subjects to construct sentences begin- 
ning with one of six personal pronouns and 
each sentence which began with either I or WE 
ue reinforced by the experimenter’s saying 

Good.” Since his findings were generally 
Consistent with the results of a number of 
studies utilizing simple learning tasks in 
which the performance of HA subjects sur- 
passed that of subjects with LA (for example, 
Spence & Farber, 1953; Spence & Taylor, 
1951; Taylor, 1951), Taffel concluded that 

degree of responsiveness” was positively 
related to level of anxiety. Taffel also made 
the observation, widely accepted in clinical 
Practice, that anxious patients were more 
amenable to psychotherapy than nonanxious 
aa and suggested that his findings indi- 
ei: at this might be due to the greater 

adiness of anxious patients to talk about 
ee wish to acknowledge their gratitude 
en NP oa, Clinical Psychologist of the 
Hospital, esi arolina, Veterans Administration 
thet ae : See in making hospital facili- 
research ae ae patients available to us. This 
first author i Bee by Grant MH 7446 to the 
Health, OHI St le National Institute of Mental 
ates Public Health Service. 


between anxiety and awareness. Aware 


significantly fewer I-we sentences over trials than 


themselves, i.e., to verbalize material having 
self-reference as reflected in sentences begin- 
ning with first-person pronouns. 

Buss and Gerjuoy (1958) and Sarason 
(1958) also investigated the effects of anx- 
iety on verbal conditioning for psychiatric 
patients, and both used variants of Taffel’s 
sentence-construction task. While Sarason’s 
results were generally consistent with those 
obtained by Taffel, the results of Buss and 
Gerjuoy were strikingly different. In the 
latter study it was found that LA patients 
showed performance gains as a function of 
reinforcement, and that the performance 
curve for HA patients remained relatively 
flat, Buss and Gerjuoy, in comparing their 
findings with Taffel’s, offered little explana- 
tion for the contradictory results of the two 
studies other than that these may have re- 
sulted from systematic population differences 
or random fluctuations in sampling. How- 
ever, the findings in a number of recent 
verbal-conditioning studies (for example, 
DeNike & Spielberger, 1963; Dulany, 1961, 
1962; Levin, 1961; Spielberger, 1962) sug- 
gest that the effects of anxiety on condition- 
ing in the Taffel, Sarason, and Buss-Gerjuoy 
experiments may have resulted from the per- 
formance of subjects whose awareness of the 
contingency of reinforcement went undetected 
because of the insensitivity of methods em- 
ployed in these studies for evaluating 
awareness. 
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Taffel defined awareness in terms of sub-' 


jects’ responses to a brief, four-question inter- 
view conducted after the completion of the 
conditioning task. He reported that none of 
his 90 subjects could verbalize the purpose 
of the study nor the relationship between 
their responses and the experimenter’s be- 
havior. Working with the identical verbal- 
conditioning task and a similar population 
of male psychiatric inpatients, Levin (1961) 
found that only 3 of 60 subjects verbalized 
a correct response-reinforcement contingency 
to Taffel’s four-question interview, but that 
16 additional subjects verbalized a correct 
contingency in response to an extended inter- 
view. Furthermore, the conditioning effects 
noted in Levin’s study were determined 
primarily by the performance of these aware 
subjects. Moreover, subjects reinforced for 
constructing 1-wE sentences who verbalized a 
response-reinforcement contingency for only 
one of these pronouns showed performance 
gains specific to the pronoun for which they 
were aware of the contingency of reinforce- 
ment. 

With respect to awareness, Sarason in- 
dicates that a brief interview was conducted 
to determine the subjects’ ability to verbalize 
the response-reinforcement contingency. Ac- 
cording to Sarason, only 7 subjects were even 
aware of the reinforcement “mm-hm” which 
was uttered by the experimenter when- 
ever the subject used a verb of the class 
verbal activity (for example, “talked,” 
“whispered”) and “None of the Ss as judged 
by the interviews was aware of the principle 
on the basis of which Æ emitted ‘mm-hm’ 
[Sarason, 1958, p. 341].” That only 7 of 60 
subjects noticed the “mm-hm”? while 48 
of the 60 subjects in Levin’s experiment were 
aware of the “Good” could reflect the fact 
that “mm-hm” was less salient as a re- 
inforcing stimulus than “Good.” No informa- 
tion was provided by Sarason on whether 
subjects who were aware of the reinforcement 
had hypotheses involving response-reinforce- 
ment contingencies which were correlated 
with the experimenter’s principle of reinforce- 
ment. Tatz (1960) and Dulany (1961) have 
demonstrated that subjects with correlated 
hypotheses showed significant performance 
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gains on verbal conditioning tasks, whereas 
unaware subjects did not. 

Additional evidence consistent with the 
hypothesis that Taffel’s and Sarason’s find- 
ings were influenced by the awareness of 
their subjects is provided in two studies by 
Eriksen and his colleagues. Eriksen, Kuethe, 
and Sullivan (1958) found that college stu- 
dents with high scores on the MMPI Psy- 
chasthenia (Pt) scale were partially aware in 
a pseudotachistoscopic verbal conditioning 
task significantly more often than were sub- 
jects who scored low on this scale, and 
Eriksen and Davids (1955) reported a cor- 
relation of .92 between the MA scale and 
the Pt scale, suggesting that the two scales 
measure essentially the same qualities. To- 
gether these findings suggest the possibility 
that there were more aware or partially aware 
subjects in the HA groups in the Taffel and 
Sarason studies than in their LA groups, and 
that the performance gains obtained for HA 
subjects in these studies were determined by 
the performance of undetected aware or 
partially aware subjects. 

Buss and Gerjuoy (1958) did not provide 
any information on the awareness of the sub- 
jects in their anxiety groups. However, in 
another article based on the same data, Buss, 
Gerjuoy, and Zusman (1958) indicated that 
the experimenter asked each patient if he 
knew what the experiment was about, and if 
he knew of any connection between what he 
said and what the experimenter did. In re- 
sponse of these questions, 16 of 52 patients 
were able to state a correct response-rell 
forcement contingency even though they were 
not questioned until after an extinction period 
in which they constructed 80 sentences with- 
out reinforcement. That a substantial propo! 
tion of Buss and Gerjuoy’s subjects became 
aware of the response-reinforcement Col 
tingency is not too surprising when it is note 
that some of their subjects were reinford 
with chips exchangeable for cigarettes an 
candy. Moreover, according to Buss et 4 
the cigarettes-candy group received instruc- 
tions at the beginning of the conditioning 
task which were more likely to produc? 
awareness than the instructions given subject 
in their study who were reinforced Wit 


“Good.” And contrary to the statement of 


j 


and Gerjuoy (1958) that “differences 
the Cigarettes-Candy group and the 
group were small and random [p. 
9,” data presented by Buss et al. (1958, 
143, Table 4) clearly indicated that this 
not the case. Thus, in the Buss-Gerjuoy 
ment it would appear that the effects 
anxiety on verbal conditioning were 
ly confounded with those of awareness, 
cond tions of reinforcement, and instructions. 
The goals of the present study were to 
e the relationship between anxiety and 
conditioning when subjects’ awareness 


t contingencies was evaluated in a de- 
postconditioning interview. On the 
of the previous findings, it was ex- 


correct or correlated contingencies, and 
only aware subjects would show verbal 
tioning effects. Given the conflicting 


cal deficiencies associated with these 
S with respect to the evaluation of 
mess, there was little basis for pre- 
ng the effects of anxiety on the verbal- 
itioning performance of aware subjects. 


METHOD 


subjects were 48 male neurotic and psy- 
‘open”-ward patients from the Durham, 
Carolina, Veterans Administration Hospital. 
‘patients were told that the experiment was a 
art of the hospital’s routine testing program and 
assigned appointment times for the “test.” All 
e patients approached in this manner agreed 
ticipate. The patients ranged in age from 
to 56, with a median age of 36. Only 6 patients 
een in the hospital for more than 2 months 
time of the experiment, and none had been 
alized for as long as 9 months; the median 
th of hospitalization was 12 ‘days. Scores ‘on 
iS aylor MA scale were obtained for each subject 
either an MMPI which was routinely ad- 
tered to patients referred for clinical psycholog- 


items) containing the MA scale. The latter was 
to those subjects who had not been referred 
cal evaluation. All of the subjects took the 
Within 18 days of their participation in the 
ent, and all but 4 took it within 5 days of 
e they participated. The median interval be- 
taking the MA scale and participating in the 
loning task was 3 days. 
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The subjects were assigned to experimental and 
control groups on a random basis in a manner such 
that 2 of every 3 subjects were in the experimental 
group. It was necessary to eliminate 4 subjects 
whose lack of reading skills interfered with their 
performance on the conditioning task. Three of 
these subjects had been assigned to the control group ` 
and the other to the experimental group. One ad- 
ditional subject in the experimental group was 
subsequently eliminated because he began all of his 
sentences with first-person pronouns during the 
operant period, which precluded the possibility that 
his performance could be positively influenced by 
reinforcement. Of the remaining 43 subjects, 31 
were in the experimental group and 12 were in the 
control group. There were no apparent differences 
between the experimental and control groups in 
age, length of hospitalization, or MA scale scores. 


Procedure 


The conditioning procedures were carried out in 
a small room in which there were a table, two 
chairs, and a tape recorder. Each subject was tested 
individually on a sentence-construction task which 
was essentially the same as that used by Taffel, 
The stimulus materials consisted of 100 3 X 5 inch, 
white index cards. There was a different past-tense 
verb in the center of each card. The six personal 
pronouns I, WE, YOU, HE, SHE, and THEY were 
typed in a different sequence on each card below 
the verb. The stimulus cards were the same as 
those previously employed by Levin (1961) and 
Spielberger, Levin, and Shepard (1962). 

The experimenter who carried out the condition- 
ing procedures introduced the experimental task by 
stating, “This is a part of a routine testing program 
that some patients in all Veterans Administration 
Hospitals are taking.” The subject was then seated 
at the opposite side of the table facing the experi- 
menter, and the recorder was turned on, A brief 
conversational interview, similar to that described by 
Taffel, served to establish rapport. In this inter- 
view, the subject was asked his age, occupation, 
highest school grade completed, and when he was 
admitted to the hospital. The following instructions, 
taken verbatim from Taffel (1955) were then given 
to the subject: 


When I turn these cards over [E pointed to 
stimulus cards . . .] you will see a word in the 
center of each card. I want you to make up a 
sentence using this word. Below the word in the 
center, you will see a group of other words, Take 
any one of these and use it to start your sentence. 
[Pause] Now, it doesn’t matter whether the 
sentence you make up is long or short, or even 
if it is complicated or simple. It is important that 
you answer with the first sentence that enters 
your mind. It isn’t easy to do this but you will 
find that if you try to answer as quickly as pos- 
sible, you are more likely to give the first thing 
that comes into your mind. Any questions? 
[Pause] Let’s begin [p. 497]. 
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All questions were answered by repeating ap- 
propriate parts of the instructions. The deck of 
stimulus cards was then presented to the subject 
who was instructed to expose the cards one at a 
time and to turn each card face down after he had 
used it to make a sentence. Thus, each subject pro- 
“ceeded at his own pace, and the experimenter re- 
corded the initial pronoun in each sentence con- 
structed by the subject. For the first 20 cards 
(trials), the procedure was the same for all subjects. 
No reinforcement was given on these trials, which 
served to establish the subject’s operant rate for 
I-WE sentences. On Trials 21-100, subjects in the 
experimental group were reinforced with the word 
“Good” stated by the experimenter in a flat, un- 
emotional tone after each sentence which began 
with rt or we. After the operant period, subjects in 
the control group were reinforced with “Good” on 
a predetermined random basis for 40% of their 
sentences. Providing random reinforcement for the 
control group in the present study, rather than 
giving them no reinforcement as was Taffel’s pro- 
cedure, served to make the experimental and con- 
trol groups more comparable with respect to the 
general encouragement provided the subjects by the 
experimenter’s saying “Good.” 

Upon completing the conditioning task, the sub- 
ject was directed to an adjoining room in which 
he was interviewed regarding his awareness of the 
response-reinforcement contingency by a second 
experimenter who had no knowledge of the sub- 
ject’s performance on the conditioning task. The 
first four questions of the interview corresponded 
with those employed by Taffel. The remaining 
questions, which are reported in Spielberger (1962, 
p. 86), became more focused and specific as the 
interview progressed. All of the subjects’ verbaliza- 
tions relevant to the reinforcing stimulus, and to the 
use of particular pronouns in constructing sentences, 
were recorded. Except for the more detailed post- 
conditioning interview and the random reinforcement 
of the control group, the experimental procedures in 
the present study differed from Taffel’s procedures 
in only relatively minor respects.2 


2The present study differed from Taffel’s in the 
following respects: (a) There were 80 reinforced 
conditioning trials as compared with the 60 rein- 
forced trials in Taffel’s study. (b) The experimenter 
who conducted the postconditioning interview had 
no knowledge of the subjects’ performance on the 
conditioning task, whereas in Taffel’s study the 
same experimenter carried out both the conditioning 
procedures and the awareness interview. (c) The 
MA scale scores for some of the subjects were ob- 
tained from an MMPI given as part of the hospital’s 
testing program, which made it possible for the 
experimental procedures to be carried out with these 
subjects in a single experimental session. However, 
for a majority of the subjects, the MA scale was 
administered during one session and the conditioning 
task was given during a second session as was the 
case in Taffel’s study. (d) In the present study the 
subject turned the cards himself and proceeded at 
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RESULTS 


The relationship between anxiety and 
verbal conditioning for subjects in the experi- 
mental group of the present study was first 
examined without taking awareness into ac- 
count. In this analysis, Taffel’s cutoff scores 
were used to identify high-, medium-, and 
low-anxiety groups. The cutoff scores for 
these three anxiety groups were: low anxiety 
(LA), 0-24; medium anxiety (MA), 25-34; 
and high anxiety (HA), 35-50. The number 
of systematically reinforced subjects in the 
LA, MA, and HA groups was 13, 11, and 7, 
respectively. 

Conditioning curves for the three anxiety 
groups of the present study are compared 
with those of the comparable groups in the 
Taffel and Buss-Gerjuoy experiments in Fig- 
ure 1, in which the conditioning data for the 
three studies were grouped in blocks of 20 
trials and the first trial block consisted of 
nonreinforced trials for establishing operant 
level. Sarason’s findings were not included 
because he did not report conditioning curves 
for his anxiety groups which, moreover, were 
not comparable to the HA, MA, and LA 
groups in the other studies. It may be ob 
served in Figure 1 that, although at least two 
of the three anxiety groups in each of the 
experiments showed performance gains, there 
was little consistency between studies with 
respect to the particular groups which im 
proved. 

When the data for the three anxiety groups 
of the present study were evaluated by a? 
analysis of variance, with anxiety a between- 
subjects variable and trials a within-subjects 
variable (Lindquist, 1953, Type I design), 


p 


only the Trials effect (F = 11.48, df = 4/1% 


< .001) was significant. This finding and 
the failure to obtain either an effect of Am 
iety or an Anxiety X Trials interaction, whe! 
considered along with Figure 1, would seem 
to indicate that when awareness was “0 
taken into account, all three anxiety group 
improved and differences in the magnitude 
and rate of improvement among the thie? 
anxiety groups could be attributed to chant 
The possibility that the significant ©” 


his own pace, whereas in Taffel’s study the exP™ 
menter presented each card for 15 seconds. 


= 
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Present Study 


Sra Lol aon nae 


TRIALS 


Fic, 1. Conditioning curves for the LA, MA, and HA groups in the present study compared with those of 
the comparable anxiety groups in the Taffel and Buss-Gerjuoy experiments. 


ditioning effects noted in Figure 1 were deter- 
mined by the performance of aware subjects 
will be examined after the relationship be- 
tween anxiety and awareness is evaluated in 
the next section. 


Anxiety and Awareness - 


On the basis of the findings of Eriksen 
etal. (1958) it was hypothesized that a larger 
Proportion of HA subjects would become 
ives of correct or correlated response-rein- 
ae contingencies. In order to test this 
ee and subsequently to evaluate the 
ees between anxiety and perform- 
3 a subject in the experimental group 
a independently classified as either aware 

unaware by two judges.’ The judges, who 
kena of randomly reinforced controls 
eh eae ae ie judges and two control SA 
cated that he aun Poatie One of eaii as 
when he talked a t we exper meni said oe 
he believed ee out imself; the other stated hat 
with T and kee a followed sentences beginning 

. The mean number of I-WE sen- 


had no actual contact with the subjects nor 
any knowledge of their performance on the 
conditioning task, were instructed to rate a 
subject as aware if, in response to the post- 
conditioning interview, the subject clearly 
stated that reinforcement, i.e., the experi- 
menter’s “Good,” approval, or encourage- 
ment, was in some way related to his giving 
I-wE sentences, and that he had been aware 
of this relationship during the conditioning 
trials. Thus, a subject was rated as aware if 
he stated a correct contingency, i.e., that 
“Good” followed sentences beginning with 


tences given by these subjects on the operant and 
reinforced trial blocks was 9.5, 12.5, 10.5, 11.0, and 
12.0. Since these subjects received 40% random 
reinforcement and had relatively high operant rates 
for I-wE sentences, it might be speculated that the 
reinforcing stimulus provided sufficient information 
for them to develop and report correlated hypothe- 
ses, but not enough for them to act upon their 
hypotheses with consistency. We are grateful to Ira 
H. Bernstein and Richard G. Ratliff, who served 
as the judges for this experiment. 
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either I or WE, or if he verbalized a correlated 
contingency, i.e., any hypothesis relating re- 
inforcement to his verbal behavior which if 
acted upon would, in the opinion of the 
judges, result in his constructing more 1-wE 
sentences over trials. Subjects who, according 
to the judges, did not verbalize correct or 
correlated response-reinforcement contingen- 
cies were classified as unaware. 

There was agreement between the judges in 
the dichotomous classification of 28 of the 31 
subjects in the experimental group. Eleven of 
the subjects on whom the judges agreed were 
rated as aware and 17 were rated as unaware. 
The classification of the 3 subjects on whom 
the judges disagreed was determined by the 
decision of a third judge, the experimenter 
who had conducted the postconditioning 
interview. On this basis, 2 of these subjects 
were assigned to the aware group. Of the 
13 aware subjects, only 3 verbalized correct 
or correlated contingencies in response to the 
four questions which corresponded to Taffel’s 
entire interview. With respect to awareness of 
the reinforcing stimulus, 20 subjects spon- 
taneously indicated in the postconditioning 
interview that they had noticed the “Good” 
during the conditioning trials, and 10 addi- 
tional subjects acknowledged that they 
noticed the “Good” in response to the inter- 
view question on which they were confronted 
with it (“Actually, he did occasionally say 
‘good.’ Thinking back now to when you were 
going through the cards, do you remember his 
saying ‘good’?”). Thus, 30 of the 31 subjects 
in the experimental group indicated that they 
had been aware of the “Good” during the 
conditioning trials. 

The number of aware and unaware subjects 
in the LA, MA, and HA groups is indicated 
in Table 1, in which it may be noted that 
the proportions of aware subjects in the three 
anxiety groups were not markedly different. 


TABLE 1 


NuMBER OF AWARE AND UNAWARE SUBJECTS IN THE 
Low-, MEDIUM-, AND HicH-Anxrety Groups 


Group N Aware Unaware 
HA 7 3 4 
MA 11 3 8 
LA 13 7 6 
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Reinforced 
Pronoun; 
Subject 
Aware 


Unaware 


Control 


Reinforced Pronoun! 
Subject Unawore 


Meon Number of First-Person Pronoun Responses 
o 
T 


shee 


BS o 


Reus 4, 5 i 2 raii 
BLOCKS OF TWENTY TRIALS 
Fic. 2. Conditioning curves. (Part A: The 


mean number of first-person pronoun responses of 
the aware, unaware, and control groups on the 
conditioning task. Part B: Conditioning curves for the 
RP, and the RPy for the nine subjects who wert 
aware of the contingency of reinforcement for only 
one of the reinforced pronouns.) 


For purposes of statistical analysis, the HA 
and MA groups were combined. The non- 
significant chi-square (x? = .60) computed 
for the resulting 2 X 2 contingency table indi- 
cated that, contrary to expectation, there was 
no relationship between anxiety and aware 
ness. The possibility that the conditioning ef 
fects noted for the three anxiety groups M 
Figure 1 were determined by the performance 
of aware subjects is evaluated in the next 
section, 


Awareness and Verbal Conditioning 


The performance on the conditioning task 
of the aware, unaware, and control groups oe 
the present experiment is presented in Figu? 
2, Part A. It may be observed that while the 
mean number of 1-we sentences given by the 
aware group increased markedly over trials, 
the curves for the unaware and control groups 
were relatively flat. These data were evalt- 
ated by an analysis of variance (Type | 
design) which yielded a significant Group 


| 
| 
| 
| 
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Trials interaction (F = 2.58, df = 8/160, 
bs 05). This finding, when considered along 
vith the curves in Figure 2, Part A, would ap- 
ar to indicate that aware subjects gave in- 
creasingly more I-WE sentences over trials, 
whereas unaware and control subjects showed 
little tendency to do so. The significant effects 
‘of Groups (F = 3.64, df = 2/40, p < .05) and 
Trials (F= 12.24, df = 4/160, p<.001) 
‘obtained in this analysis would also appear 
‘to be due primarily to the performance of 
aware subjects. 
_ The impression that the results in the over- 
all analysis of variance were determined 
largely by the performance of aware subjects 
‘was confirmed in further analyses of the con- 
itioning data in which the aware, unaware, 
ia control groups were compared with one 
another. The results of these separate analy- 
es of variance (Type I design) are sum- 
marized in Table 2. In these analyses, the 
ost important findings were the significant 
Groups X Trials interactions for the aware 
īsus unaware and aware versus control 
Comparisons, which indicated that the propor- 
EA of I-wE sentences given by the aware 
“subjects increased more rapidly over trials 
than for subjects in the unaware and control 
“groups, The finding in the unaware versus 
“Control analysis that only the Trials effect 
hes Significant, when considered along with the 
piyes for these groups in Figure 2, Part A, 
‘showed that although the unaware and con- 
trol groups did not differ from one another, 
oa they evidenced a slight but highly 
> tendency to give more I-WE sen- 
$ fa. over the five trial blocks. Analyses of 
near trend (Winer, 1962, pp. 71-73) for 
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the unaware and control groups yielded F 
ratios of 9.29 (df = 1/68, p < .01) and 2.84 
(df = 1/44, p < .10), respectively, indicating 
that the unaware group showed significant 
performance gains over successive . trial 
blocks, and that the performance gains of the 
control group approached significance. | 
Of the 13 aware subjects, 7 verbalized 
response-reinforcement contingencies involv- 
ing only sentences beginning with 1, and 2 
subjects related the reinforcement only to 
sentences beginning with we. The condition- 
ing data for these 9 aware subjects were 
evaluated in order to determine if they 
responded differentially to the reinforced pro- 
noun for which they were aware of the rein- 
forcement contingency (RP), as compared 
to the reinforced pronoun for which they 
were unaware of the reinforcement contin- 
gency (RPy). The mean frequencies for these 
subjects for sentences beginning with the RP, 
and the RPy over the five trial blocks are 
presented in Figure 2, Part B, in which it may 
be observed that the number of sentences be- 
ginning with the RPs increased markedly 
over trials while the number of sentences 
beginning with the RPu declined. Since there 
was little overlap between the RP, and RPy 
score distributions after the second trial block, 
statistical evaluation of the relationship de- 
picted in Figure 2, Part B, was deemed un- 
necessary. The specificity of conditioning in 
this study to the pronoun for which subjects 
were aware of correct or correlated contin- 
gencies was consistent with the results ob- 
tained in other recent verbal conditioning 
experiments (Levin, 1961; Spielberger & 
Levin, 1962; Spielberger et al., 1962). Only 


TABLE 2 


SUMMARY or ANALYSES OF VARIANCE OF CONDITIONING SCORES FOR THE CoMPARISONS 
OF THE AWARE, UNAWARE, AND CONTROL Groups 


i Aware versus unaware 
Source 


Aware versus controls 


Unaware versus controls 


af MS F af MS F af MS F 
E Groups (A) a T 
5: 1 k 68.88 | 5.39 1 ai 
Tek a | “132 | 5 | os. 28 | 74.09 
©) 4 | 11393 |1185] 4 30.25 | 9.35% a | 29.27 | 4.19% 
4 29.19 | 3.04* 4 38.10 | 3.99** 4 2.21 = 
116 9.61 2 9.55 112 6.99 
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Low Anxiety (LA) 


High Anxiety (HA & MA) 


Meon Number of First-Person Pronoun Responses 


l 2 3 4 5 
BLOCKS OF TWENTY TRIALS 


Fic, 3, The mean number of first-person pro- 
noun responses of the HA and LA groups on the 
conditioning task. 


the question of whether or not anxiety af- 
fected the performance of aware subjects 
remains to be evaluated. 


Effects of Anxiety on the Conditioning of 
Aware Subjects 


Since there were only three aware subjects 
in each of the HA and MA groups (see 
Table 1), the data for these groups were 
pooled. For the aware subjects in the com- 
bined HA and MA groups, redesignated the 
high-anxiety group, performance on the con- 
ditioning task is compared in Figure 3 with 
that of the aware subjects in the LA group. 
These data were evaluated by an analysis 
of variance (Type I design) which yielded 
significant effects of Groups (F = 15.09, 
df = 1/11, p<.01) and Trials (F = 9.59, 
df = 4/44, p < .001). When considered along 
with the curves depicted in Figure 3, these 
findings showed that the high-anxiety group 
gave consistently fewer I-wE sentences over 
trials than the low-anxiety group, and that 
for both groups the number of 1-we sen- 
tences increased in successive trial blocks. 
These findings, along with the failure to find 
a significant Groups X Trials interaction, in- 
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dicated that both anxiety groups 
I-WE sentences over trials, the l 
group gave more I-WE senten 
high-anxiety group, and the qj 
ferences in the slopes of the cur 
two groups could be attributed to 


Discussion 


In this investigation, subjects 
tematic reinforcement who were a 
reinforcement was in some way 
upon their beginning sentences wi 
gave significantly more I-WE seni 
trials than did unaware or contro 
Although subjects judged to be 
correct or correlated response-rein 
contingencies also gave significanti 
I-WE sentences over trials, the per: 
of the unaware subjects did not d 
that of the control subjects who i 
random reinforcement. Thus, for th 
anxiety groups in the present sti 
formance gains on the conditioning té 
determined primarily by subjects 
aware of correct or correlated contin 
These findings, which were consiste 
the results of a number of recent V 
conditioning experiments (DeNike 
berger, 1963; Dulany, 1961, 1962; 
1961; Spielberger, 1962; Spielberger, 
& Howard, 1963), suggest that 
reported relationships between anxié 
verbal conditioning (Buss & Gerjuoy, 
Sarason, 1958; Taffel, 1955) may hav 
unduly influenced by the performa 
subjects whose awareness went unde 
cause of insensitive interviewing pr 

On the basis of previous findings ( 
& Davids, 1955; Eriksen et al, 1% 
was expected in the present study 
larger proportion of HA subjec 
become aware. It was found, howe j 
the proportion of aware subjects in th 
MA, and LA groups did not di 
cantly. Perhaps this is not too $ 
when it is noted that the anxiety 
the patients in the present study und 
averaged much higher than those 
college students for whom Eriksen 
found a relationship between Pt 
awareness. At the higher anxiety le 
characterized our psychiatric patien 
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possible that there were competing response 
tendencies produced by anxiety which neu- 
tralized the advantages of the obsessive and 
intellectualizing defenses generally attributed 
to HA patients. HA college students, on the 
other hand, might be expected because of 
their intellectualizing defenses to be more 
sensitive to experimental conditions and more 
inquisitive about them than LA students, and 
hence more likely to become aware of 
response-reinforcement contingencies. That 
obsessive and intellectualizing defenses are 
associated with HA for college students was 
dearly indicated by the strong positive rela- 
tionship between the MA scale and the Pt 
scale reported by Eriksen and Davids. 

The finding in the present study that 
aware subjects with HA gave consistently 
fewer I-wE sentences over trials than aware 
subjects with LA was exactly opposite to the 
relationship between anxiety and verbal con- 
ditioning reported by Taffel (1955) and 
Sarason (1958). The fact that our patients 
with HA were /ess willing to talk about them- 
selves than were those with LA, as reflected 
in the number of sentences which each group 
began with first-person pronouns, would seem 
to challenge Taffel’s suggestion that HA pa- 
tients are more amenable to psychotherapy 
because they are more responsive and ready 
to discuss their problems. However, our find- 
mg is consistent with observations from 
clinical Practice that HA patients are often 
eal and harbor deep-seated feelings 
ee ery. On the assumption that such 
tel stem from life experiences of failure and 
ies ce interactions with other people, and 
EN e recounting of these experiences in- 

i es anxiety, it might be expected that 
shan would be more reluctant to talk 
S, emselves than patients with LA. This 
ae is supported by the significant 
tain ae correlation (r = —.42, p < .01) ob- 
ree a present study between MA scale 
alven aM 4 the number of I-WE sentences 
topo, S e operant trial block. It is also 
(1960) i by evidence reported by Rogers 
Tess in th, at anxious college students talked 
intervie operant period of a “quasi-therapy 
axions et verbal-conditioning task than non- 
ine Students, and referred to themselves 

unfavorably in what they said. Thus, 
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when they are not specifically encouraged by 
reinforcement to express themselves, it would 
appear that HA subjects talk less about 
themselves and refer’ to themselves less 
favorably than LA subjects. 

HA patients may have been especially 
reluctant to talk about themselves in the 
relatively threatening context of the present 
study, which was introduced to them as a 
part of the Veterans Administration Hospi- 
tal’s testing program being carried out by 
the Psychology Department. Furthermore, 
the subjects’ initial contact with the experi- 
menter in the present study was at the time 
that the conditioning procedures were carried 
out. In Taffel’s study, the HA patients may 
have been more responsive to reinforcement 
because the experimenter had previous con- 
tact with them, and this might have served 
to reduce the threatening characteristics of 
the conditioning task. In a supportive context 
in which patients are encouraged to express 
themselves and especially where they are 
encouraged to express feelings of self-criticism 
and personal inadequacy (for example, in a 
psychotherapy relationship), HA patients 
might be more willing to talk about them- 
selves than LA patients. Evidence consistent 
with this hypothesis was recently reported by 
Sarason and Ganzer (1962). They found that 
college students with HA gave significantly 
more “negative self-references” than did LA 
students when this particular response class 
was reinforced in a _ verbal-conditioning 
experiment, Sarason and Ganzer’s findings, 
in conjunction with those of the present 
study, suggest that HA subjects may be 
generally more reluctant to talk about them- 
selves than LA subjects, but when they are 
encouraged to express negative things about 
themselves they will give more negative self- 
references than LA subjects. 

Although the unaware group in the present 
study did not differ from the randomly re- 
inforced control group, the fact that it showed 
a slight but significant tendency to give more 
-wE sentences over trials could be inter- 
preted as evidence of learning without aware- 
ness. However, in a post hoc evaluation of 
the interviews of unaware subjects, it was 
discovered that the performance gains for the 
unaware group were determined entirely by 
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eight subjects who verbalized response dis- 
positions that would lead them to construct 
T-WE sentences. For example, two subjects 
who regarded the experimental situation as a 
personality test thought that they were sup- 
posed to make up sentences about themselves 
and their families; another took the instruc- 
tions to respond “as quickly as possible” 
quite literally, and indicated that since he 
thought he was supposed to respond quickly, 
he began most of his sentences with 1 “be- 
cause it was faster that way.” Had these 
unaware subjects stated that the experiment- 
er’s “Good” was contingent upon their con- 
structing sentences in accordance with their 
hypotheses about the experiment, they would 
have been classified as aware of correlated 
response-reinforcement contingencies, 

The tendency for subjects with response 
dispositions to construct 1-wE sentences to 
give increasing numbers of I1-WE sentences 
over trials suggests either that these subjects 
were influenced by the reinforcement to act 
upon their response dispositions without 
awareness that reinforcement was affecting 
them, or that they were aware of correlated 
Tesponse-reinforcement contingencies which 
were not detected in the postconditioning 
interview. In any event, the 10 subjects 
who were unaware of correct or correlated 
response-reinforcement contingencies, and 
who did noż verbalize correlated response dis- 
positions, showed no tendency to give more 
TWE sentences over trials, Clearly, reinforce- 
ment did not produce automatic increments 
in the performance of these subjects. If 
learning without awareness occurred in this 
experiment, it was limited to the eight sub- 
jects who verbalized dispositions to give the 
reinforced response class. However, for the 
majority of the subjects who showed im- 
proved performance on the conditioning task, 
the data of this experiment seemed consistent 
with the hypothesis that awareness of the 
contingency of reinforcement mediated their 
performance gains, 

The finding in the present study that 
randomly reinforced subjects tended to give 
more I-WE sentences over trials suggests that 
a control group which receives no reinforce- 
ment may not be appropriate in verbal- 
conditioning experiments. With but a single 
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exception, all of the subjects in this 4 
noticed the reinforcing stimulus, and mos 
subjects interpreted the “Good” as a form 
of encouragement. A control group which 
receives random reinforcement permits an 
assessment of the possible selective effects of 
nonspecific reinforcement (general encourage. 
ment) on preexistent response dispositions, | 
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REACTIVE SCHIZOPHRENICS IN A MONOTONOUS 
REPETITIVE TASK* | 


MARTIN ZLOTOWSKI 


Veterans Administration Hospital, Coatesville, Pennsylvania 


Process and reactive patients as well as nonpsychotic and normal Ss were 
tested on a prolonged repetitive task which allowed for thousands of variations 
in response. All Ss were tested under instructions to produce as many different 
responses as they could. The data were analyzed in terms of type and manner 
in which the responses were produced, and the responses were classified into 
a number of categories which represented differing degrees of change. Process 
patients differed from the other 3 groups in that their responses were more 
stereotyped and more resistant to change, The reactives did not differ in their 
performance from the nonpsychotics and normals. 


When schizophrenic patients with poor pre- 
morbid adjustment (process) are compared 
with those whose premorbid adjustment is 
good (reactives) several consistent differences 
between these two groups are obtained.? In 
their responses on a variety of psychological 
tests (Becker, 1956; Fine & Zimet, 1959; 
Kantor, Wallner, & Winder, 1953; Kantor & 
Winder, 1959) reactives generally exceed 
process patients in level of personality de- 
velopment, and the behavior of the two groups 
differs in a variety of laboratory tasks 
(Reisman, 1960; Smith, 1959; Zlotowski & 
Bakan, 1963). 

When task conditions are constant over 


1The present study is based on a doctoral dis- 
sertation submitted to Michigan State University 
as part of the requirements for the PhD degree in 
Psychology. The author would like to express his 
gratitude to the members of his committee: M. Ray 
Denny, Paul Bakan, Milton Rokeach, and Louis L. 
McQuitty; to the staff of the Battle Creek Veterans 
Administration Hospital where this study was car- 
ried out as part of a larger project headed by 
Stewart G. Armitage; and to James C. Diggory for 
his editorial suggestions. 

Part of this study was presented in summary 
form at the 1961 American Psychological Association 
Convention held in New York City. 

2 A reactive patient is one who has shown a good 
premorbid adjustment, generally with a well-defined 
stress situation leading to the onset of psychotic 
symptoms, and who consequently has a good prog- 
nosis for recovery, A process patient has shown poor 
premorbid adjustment, where the final psychotic 
symptoms are an exaggeration of already existing 
personality pattern rather than a product of a specific 
stress situation, and who consequently has poor 
prognosis for recovery. 


BEHAVIORAL DIFFERENCES BETWEEN PROCESS AND 


shows a decrement in later trials, but proces) 
subjects continue to improve; when conditions 
are changed, reactives show immediate in- 
provement but process subjects show an initial 
decrement (Smith, 1959). In a prolonged 
monotonous guessing task reactives spontant 
ously varied their guesses more than did the 
process group (Zlotowski & Bakan, 1963), 
These findings suggest that willingness, | 


many trials, the performance of : 


ability to vary behavior is more character 
istic of reactive than of process patients. I 
this is generally true, then the former shoul 
manifest a greater ability in solving problems 
when change in behavior is demanded. 

The present experiment investigated tt 
differences in performance between proces 
and reactive schizophrenics on a prolong? 
repetitive task which allowed for thousands 
of variations in response under the following 
conditions: When subjects were instructed 0 
find many different solutions and when the) 
were told to find a solution which was diffe 
ent from the one they had just used. h 


METHOD 


Apparatus | 


The apparatus used (Armitage, Brown, & D 


1964) consists of a vertical panel containing & MY 
zontal row of eight green lights spaced at 4-ino ae 
tervals. The table top, before which the subject 
contains eight vertical levers each of which is dire 
below, and parallel to, one of the eight green ‘he 
A lever, when pulled toward the subject, Tight ee, 
green light above it which remains on until the 

is released (see Figure 1). 
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_ An amber light in the upper right-hand corner of 
the panel was lighted when all eight green lights 
had been turned on in any order and could be 
tuned off by means of a foot switch operated by 
the examiner. The amber light served as a signal 
to tell the subject that he had solved the problem. 
Turning off the amber light signaled the subject 
that he could proceed with the next trial. 

The sequence of lever pulls for each trial was 
recorded electrically by means of an Esterline Angus 
Recorder. Eight pens corresponded to the eight levers 
and a ninth pen wired to the amber light indicated 
termination of a trial. 


Subjects 


Case histories of 82 male schizophrenic patients 
between the ages of 18 and 45 were rated by means 
of Becker’s (1955) revision of the Elgin Prognostic 
Scale, The scale consists of 20 items, 15 relating 
to degree of premorbid adjustment, and 5 re- 
lating to symptoms on admission, Low scores 
indicate good premorbid adjustment and relative 
absence of symptoms, while high scores indicate 
poor premorbid adjustment and relative severity of 
symptoms. The rating was done independently by 
two raters with an interrater reliability coefficient 
of 91, The reactive group was the 20 patients whose 
Scores ranged from 19 to 39 (M =31.8); the 20 
process patients had scores ranging from 61 to 74 
(M= 64.5). All patients came from open wards, 
showed no clinical evidence of organic pathology, 
n had no shock or insulin coma therapy within 

peo prior to testing. Twenty-nine nonpsychotic 
"ON diagnosed as neurotic or character disorders, 
fie aaa a cross-section of hospital personnel, 
Ra as control subjects. For the combined 
fn d Teactive, and nonpsychotic patient groups 
a ge was 33.7 years, mean IQ was 104.9, and 

A Bee of education was 10.5. Differences among 
t ais these variables were not significant. 
subjects, ion was not available for the normal 
ea ou Teactive groups differed significantly in 

‘Ospitalization, but the correlation between 


perf 


fo 
mance and length of hospitalization was not 
rant (r= — 21), 


Procedure 


E A 

ee was tested on 5 consecutive days. On 
ate 2 trials were run each day. The subject 
aa ight the amber light by turning on each 
N z Tente No mention was made of the 
on. Any T ich the green lights were to be turned 
(imed on oral Pattern of eight green lights 
Biionse in e amber light. Pulling the lever more 
oh nw one trial did not penalize the sub- 
a en a lever which had already been 
i eee for the second time in any one 

= m light above it did not go on. 
ag ae the subject was told that he was to 
light as i y different ways of lighting the amber 
e could by turning on the green lights 
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Amber Light — O 
Or OMO 0 T 


o 


Green Lights 


Fic. 1. Diagram of variability apparatus. (The 
numbers in lever switches were not part of the 
apparatus but were assigned to allow for statistical 
analysis of patterns.) 


in different orders. Seventy-five trials were divided 
into three sets of 25 trials each, After Trial 25, the 
subject was reminded to find as many different ways 
as he could to light the amber light, and not to 
repeat any that he had done on the preceding 25 
trials. After each trial following Trial 50, the subject 
was told to make the next pattern of lever pulls 
different from the one that just preceded it. 

On Day 5, 60 trials were given, and the subject 
was told only to light the amber light. 


Dependent Variables 


The dependent variables were the sequences of 
lever pulls used by the subjects and the frequencies 
with which they occurred. In discussing the results 
the following terms are used: A lever pair is any 
sequence of two levers. A pair score was derived 
from the distribution of pair frequencies produced 
by all normals throughout the experiment by assign- 
ing to each pair a z score from a distribution with 
a mean of 10, and a standard deviation of 2; the 
greater the frequency of any pair, the higher its 
score.8 A pattern is any sequence of eight lever 
pulls (for example, 1, 3, 5, 7, 2, 4, 6, 8 and 
8, 6, 4, 2, 7, 5, 3, 1 are different patterns), A 
pattern score is the sum of the seven pair scores for 
the pattern. 

The patterns were assigned to a number of cate- 
gories which differ qualitatively in degree of change. 
Minimal change (Category 1—C:) is afforded by 
sequences of the two simplest patterns (1, 2, 3, 4, 
5, 6, 7, 8 and 8, 7, 6, 5, 4, 3, 2, 1). 

Systematic variations, other than the minimal 
ones, were also considered. For example, vari- 


8The choice of the normal group is arbitrary, 
since the pair frequency distributions of the normal 
and hospitalized patients were very similar and the 
rank-order correlations between them indicated that 
either one could be used for conversion to standard 


scores. 
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TABLE 1 


t-Test ANALysis witH Respect To NDP, NUP, C:, AND C; PATTERNS For PROCESS, 
Reactive, Nonpsycuoric, AND NORMAL SUBJECTS 


y 


Groups NDP t NUP d C: ? Ci 

P versus R 3.523 005 2.923 0058 3.136 005" 2.551 
P versus NP 3.631 -005* 2.387 018 1.929 038 2.220 
P versus N 3.593 005% 2.498 O01" 2.761 -005° 5.238 
R versus NP — ns — ns? — ns? 1.167 
R versus N` — ns? = ns> — ns? 1.468 
NP versus N _ ns» — ns? = ns? 3.698 

= One-tailed test. 

bt < 1,000, 

e Two-tailed test. 


ability within the first four and the last four 
levers (1, 2, 3, 4/8, 7, 6, 5; 1, 4, 35 2/5; 6, +7; 8; 
1, 2, 4, 3/5, 6, 8, 7); cyclic generation of one pat- 
tern from its predecessor (2, 3, 4, 5, 6, 7, 8, 1; 
3, 4, 5, 6, 7, 8, 1, 2; 4, 5, 6, 7, 8, 1, 2, 3). These will 
not be discussed further since they did not occur 
with: sufficient frequency and the groups did not 
differ significantly in their use of these patterns, 

Nonsystematic pattern variations (Category 2—C,) 
comprise all sequences occurring more than once that 
could not be classified in any of the above cate- 
gories. A unique pattern is one that was produced 
by only one subject on one occasion. 

On the basis of the Preceding discussion, it was 
predicted that process subjects would show less 
change, when change is demanded by instructions, 
than the other three groups. In terms of the de- 
pendent variables this means that the Process sub- 
jects should produce fewer different, unique, and 
Cs patterns; more C patterns; and higher pattern 
Scores. Reactives should not differ from normals and 
nonpsychotics on the above measures, 


RESULTS 


_ On Days 1-3 no significant differences were 
found among the four groups on any of the 
dependent variables. However, on Day 4, 
when variability was demanded in the in. 
structions, the process group produced signifi- 


cantly lower means for different, unique, an 
Cz patterns than did the other three groups! 
The reverse occurs for Cı patterns (s 
Table 1).* Furthermore, reactive subjects di 
not differ significantly from nonpsychotic anl 
normal subjects. These results are all in t 
predicted direction and thus support 
hypothesis. ' 

The only other difference of note show 
that normal subjects produce significan 
fewer Cı patterns than the nonpsychd 
patients, ot 

Analysis of patterns produced in successit 
blocks of 25 trials on Day 4 further empl 
sizes the effect of instructions on change il 
behavior. As Table 2 shows, process all 
reactive patients differ significantly on th 
three sets of 25 trials and on all 75 trial 
specifically, process patients produced hit 
pattern scores, whereas reactives tended U 
produce low pattern scores. $ 

Since Day 4 was the first day on w 
the subjects were instructed to produce dit 


4 All tests were £ tests. A difference at p2.055 
called significant, 


TABLE 2 


+-Test ANALYSIS oF PATTERN SCORES FOR Eacu 25 AND ALL 75 TRIALS 
BETWEEN PROCESS AND REACTIVE GROUPS 


Trials 
1-25 26-50 51-75 1515 
Process Reactive Process Reactive Process Reactive Process Reactie 
M 86.75 81.60 84.41 78.44 82.86 80.16 84.68 som 
SD 2.35 517 4.27 3.13 457 3.61 310. | a 
i “4.055% 5.059* 2.154* 5.000" 


* p = 01, two-tailed test. 
** > = 005, one-tailed test. 
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ferent patterns, and on Days 1-3 there were 
no differences between process and reactive 
= groups, it appears clear that the instructions 
"just prior to the first 25 trials had an effect 


on the type of patterns produced on these 
trials. The following analysis is concerned 
with determining the effect of the instructions 
on successive sets of 25 trials for both the 
process and reactive groups. The differences 
between pattern score means obtained on 
Trials 1-25, 26-50, and 51-75 were evaluated 
by # tests for related measures. Table 3 shows 
that both process and reactive subjects 
produced different pattern scores on the 
second set of 25 trials than they produced 
on the initial set of 25 trials. Unlike the 
process subjects, the reactive subjects also 
produced different pattern scores on the third 
set of 25 trials than they produced on the 
second set of 25 trials. On the other hand, 
the process group produced pattern scores on 
the third set of 25 trials which were dif- 
ferent from those on the initial 25 trials, 
whereas the reactives did not. 

Pearsonian intercorrelations among the 
three sets of 25 trials were computed for each 
gtoup separately. This describes the direction 
of change in the successive sets of 25 trials. 
As shown in Table 4, the pattern scores 
Produced by process subjects, on the second 
set of 25 trials, were considerably more similar 
to their pattern scores on the first 25 trials 
than is the case for the reactive subjects. 
When the two groups are compared with 


TABLE 3 


Comparison or MEAN PATTERN SCORE CHANGES 
MEAN DIFFERENCE) FROM TRIALS 1-25 
TO Trrats 26-50, TRIALS 26-50 TO TRIALS 

1-75, AnD Trrats 1-25 To Trrats 51-75 
WITHIN PROCESS AND REACTIVE GROUPS 


Process Reactive 


Between trials 


Differences between 


M* t M t 
1-25 and 26-50 
2 2.33 | 3.530%] 3.16 | 2.873** 
‘ae and 51-75| 1.55 | 1.666 | 1.72 | 2.820* 
and 51-75| 3.89 | 3.536**| 1.57 | 1.427 
Tayiehese are the differences between means presented in 


* 
Be, z :02, two-tailed test. 
+01, two-tailed test. 
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TABLE 4 


RELATIONSHIP OF PATTERN SCORES BETWEEN 
Triats 1-25 anD 26-50, 26-50 AnD 51-75 
FOR PROCESS AND REACTIVE GROUPS 


Correlation coefficient 


Trials 
Process Reactive z 
1-25 and 26-50 -154 .504 3,96** 
26-50 and 51-75 .523 .667 2,24" 
1-25 and 51-75 274 397 1.29 
*p =.05. 
** > = 01. 


respect to the second and third sets of 25 
trials, the reverse relationship occurs. When 
one compares the last set of 25 trials with 
the initial 25 trials, both groups tended to 
produce different pattern scores toward the 
end than they did initially, even though the 
reactives were more similar than the process. 

On Day 5 the performance of process and 
reactive subjects was similar to their per- 
formance on the first 3 days of the experi- 
ment; that is, their behavior on Day 5 ap- 
peared to be uninfluenced by their experience 
on Day 4. 


Discussion 


The results of this study consistently indi- 
cate that reactive schizophrenics varied their 
behavior in response to instructions more than 
did process schizophrenics. These findings are 
in line with the stated hypotheses. 

Since the so-called process schizophrenic 
has been ill for a long time, showing poor 
adjustment even when a child, it is reasonable 
to suppose that his acquired modes of dealing 
with problems are limited in number, stereo- 
typed, and resistant to change. 

On the other hand, the prepsychotic ad- 
justment of a reactive schizophrenic seems 
to have been fairly good throughout much of 
his life, more similar to that of the non- 
psychotic or the normal person than to that 
of the process schizophrenic. Consequently, 
he has probably acquired problem-solving 
behavior which is more flexible than that of 
the process schizophrenic. 

These assumptions are consistent with the 
present results that reactive patients produce 
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more different patterns, more random or 
complex patterns (C2), more unique patterns, 
and fewer Cı patterns than do process pa- 
tients; and that reactives do not differ sig- 
nificantly on the above variables from non- 
psychotics and normals, whereas the process 
patients do. Also, reactive patients use lever 
pairs which have a low frequency of occur- 
rence, whereas process patients use lever pairs 
which have a high frequency of occurrence. 
By all these criteria the behavior of the re- 
active patients is relatively flexible and 
amenable to change in accordance with in- 
structions, while the process patients tend to 
be considerably stereotyped in their behavior. 
Other evidence of stereotyped behavior in the 
process group is that the correlation between 
their first 25 and second 25 trials on Day 4 
was .75; the correlation was .52 for the reac- 
tive group. Similar evidence was presented by 
Zlotowski and Bakan (1963) who recorded 
the guesses made (heads or tails) for 300 
coin tosses. The correlation between the first 
150 and the second 150 guesses was .92 for 
a group of process patients and .50 for a 
group of reactives. 

However, this stereotyped behavior can be 
disrupted under certain conditions, so the 
ability to change is not altogether absent in 
these patients. During the last set of 25 trials 
on Day 4, the process group performed at 
about the same level as the reactive group. 
Process patients may take longer or require 
greater pressure to change, but the potential 
to change is still present (cf. Bleke, 1955; 
Peters, 1955; Rodnick & Garmezy, 1957; 
Smith, 1959), 
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THE EFFECT OF DIFFERENT VERBAL REINFORCEMENT 
COMBINATIONS ON THE VERBAL DISCRIMINATION 
PERFORMANCE OF SCHIZOPHRENICS 


JANET TAYLOR SPENCE 1 
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Anp CHARLES V. LAIR? 
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Schizophrenics were given a 2-alternative verbal discrimination task under a 
wrong-nothing (Wn), right-nothing (Rn), or right-wrong (RW) verbal re- 
inforcement procedure. For open-ward Ss (11 per group) RW and Wn per- 
formance was similar but both were significantly superior (p < .01) to Rn, due 
largely to “nothing” having less reinforcement (information) value than “wrong” 
„in the latter group. These results were comparable to those previously found 
with nonpsychiatric Ss. For closed-ward Ss (9 per group), however, “nothing” 
had less reinforcement value than an overt response from E for Ss in both 
the Rn and Wn conditions, leading to a significantly inferior performance 
(p <.05) of Ss in both of these conditions to that of the RW group. 


In a series of studies employing conceptual 
tasks with two response alternatives, Buss and 
his colleagues (e.g., Buss, Braden, Orgel, & 
Buss, 1956) have found among groups of 
college students, nurses, and psychiatric pa- 
tients that performance is poorer under a 
teinforcement condition in which the experi- 
Menter informs the subject of correct choices 
by saying “Right” but remains silent after 
incorrect ones (right-nothing or Rn combi- 
nation) than under either a wrong-nothing 
(Wn) or right-wrong (RW) combination. The 
tesults of these studies, in all of which sub- 
Jects had not been given any previous expla- 
Nation of the reinforcement procedures, led 
Buss et al. to hypothesize that “nothing” is 
Nonreinforcing and that the superiority of 
the RW and Wn combinations over Rn is 
therefore due to “Wrong” being a stronger 
Negative reinforcer than “Right” is a positive 
reinforcer, 

The results of a recent study by Spence 
(1964) in which nonpsychiatric patients were 
tested on a two-choice verbal discrimination 
p replicated these findings but suggested 
on different explanation of the superiority 

the RW and Wn combinations over Rn. 
' alyses of the percentages of correct re- 
Ponses following previously correct and in- 


1 
Now at the University of Texas. 
Ow at St. Louis University. 


correct choices indicated that “nothing” was 
equally as effective in producing correct 
responses in the Wn combination as was the 
experimenter’s “Right” in the RW and Rn 
combinations. In the Rn combination, how- 
ever, “nothing” was less effective in producing 
correct responses than was “Wrong.” These 
data suggest that while subjects in the Wn 
condition quite easily grasp the information 
being given by the experimenter’s failure to 
respond, subjects in the Rn condition are less 
able to do so and that it is this confusion 
about the meaning of “nothing” following 
incorrect choices that is responsible for the 
inferior performance of these subjects. This 
interpretation was further supported in a 
second study (Spence, Lair, & Goodstein, 
1963) in which subjects were instructed about 
the information value of “nothing” prior to 
learning the list; under these conditions, there 
was no significant performance difference be- 
tween the Rn and Wn combinations in several 
nonpsychiatric groups and “nothing” was 
found to be as effective as an overt response 
from the experimenter in both combinations. 

In the latter investigation, using informed 
subjects, the performance of schizophrenic 
groups was also studied. Unlike the non- 
psychiatric subjects, a slight tendency was 
noted in the schizophrenics for “nothing” to 
be less effective than an overt response from 
the experimenter in both the Rn and Wn 
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combinations, in spite of the fact that its 
information value had previously been ex- 
plained. These results suggested a deficiency 
in cue utilization among schizophrenics, a 
phenomenon which has been reported in in- 
vestigations using other types of tasks (e.g., 
Pishkin, Smith, & Lundy, 1962). They further 
suggested that schizophrenics would be even 
less able to utilize the information being con- 
veyed by the experimenter’s failure to respond 
when its meaning is not pointed out in pre- 
liminary instructions. This hypothesis implies 
that while schizophrenics in the Wn condition 
might exhibit better performance than those 
in Rn, as has typically been found with 
uninformed nonschizophrenic subjects, they 
should also exhibit less adequate performance 
than those in the RW condition. The present 
study was designed to test these hypotheses. 


METHOD 
Subjects 


Sixty male schizophrenics, all patients at the 
Veterans Administration Hospital, Knoxville, Iowa, 
were selected according to the following criteria: 12 
months or less of current hospitalization, 20-45 years 
of age, minimum of seventh-grade education, no 
indication of CNS pathology or history of alcoholism 
or drug addiction, and ability to cooperate in the 
experimental procedure, as estimated by ward per- 
sonnel. The subjects were divided into three sub- 
groups of 20 each, 11 in each group being patients 
housed on open wards and 9 on closed wards, The 
subgroups, and within each of these, open- and 
closed-ward subjects, did’ not differ in means and 
sigma’s on age and education. The means for the 
total group were 35.18 and 10.75 years of age and 
education, respectively. 


Materials and Apparatus 


The verbal discrimination list, made up of 16 
pairs of familiar words, is described in detail else- 
where (Spence et al, 1963). The list was pre- 
sented by means of a Dunning Animatic projector 
using a film strip in which each pair of words was 
followed by a blank frame at a 4:2-second exposure 


TABLE 1 


Means or TOTAL NUMBER or CORRECT 
RESPONSES PER SUBJECT (TRIALS 2-16) 


Condition M SD 
RW 183.55 37.18 
Wn 176.80 41.52 
Rn 147.85 33.32 
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rate, with 2 seconds between trials. The subjects 4 


were given 16 presentations of the list in four 
serial orders, the right-left position of the words 
in each pair being counterbalanced across orders, 


Procedure 


The preliminary instructions stated that the subject 
was to learn to call out the “correct” member of 
each pair but did not explain how he was to obtain 
the information that would allow him to do $0, 
Rather the subject was told: “it will become deat 
to you as we go along which word in each pair 
is correct.” On the first (guessing) trial, prearranged 
schedules were employed so that half of the subject's 
initial choices determined the “correct” word on that 
and all subsequent trials and half the “incorrect” 
word, The subjects were given their information by 
the experimenter in one of three ways: “Right” after 
each correct choice but no response (nothing) after 
incorrect ones (Rn), “Wrong” after each incorrett 
choice but nothing after correct ones (Wn), ot 
“Right” or “Wrong” after each choice (RW). 


RESULTS 


As may be seen from Table 1, the perform 
ance of the groups was in the predicted ordet, 
RW being superior to Wn and the latter to 
Rn. An analysis of variance of the total 
number of correct responses per subject indi- 
cated that the differences among the groups 
were significant (F = 4.91, p < .05). How- 
ever, as determined by Tukey’s method of 
multiple comparisons (Ryan, 1959), this 
result was primarily due to the deviant pet 
formance of the Rn subjects; while these 
subjects were significantly inferior to each of 
the other groups (p < .01), the difference 
between the means of the RW and Wn groups 
did not approach significance. 

In a more refined analysis of the data, the 
Percentage of correct responses on each trial 
for those pairs that had been correct on the 
Previous trial was first determined for ead 
subject and then the percentage of cortett 
responses for those pairs that had beet 
incorrect on the previous trial. Inspection ° 
the means of these successive percentage 
curves for previously incorrect revealed that 
the Rn subjects were markedly lower than the 
other groups; that is, “nothing,” used to indi- 
cate an incorrect response for the Rn m 
jects, had less effect in producing corte 
responses on the subsequent trial t 
“Wrong” for the RW and Wn subject 
However, the curves for previously corte’ 
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cated that except for Trial 2, 
bjects were doing about as well as 
groups; that is, “nothing,” used to 
correct response, had equivalent 
“Right.” 
results thus provided only minimal 
pporting the hypothesis that schizo- 
would be unable to utilize the in- 
being conveyed by “nothing” as 
ly as an overt response from the 
enter in the Wn as well as in the Rn 
ion, and that overall performance 
therefore be best under the RW 
m. Further inspection of the data 
d, however, that separate considera- 
the performance of the 9 closed-ward 
| open-ward patients from each group 
e of value in assessing this hypothesis. 
arallel analyses for the open-ward patients 
described above led to similar results. 
Performance of the RW and Wn groups 
not differ significantly, neither of the 
being consistently superior to the 
trials. Both groups, however, were 
icantly superior (p < .01) to Rn, the 
percentages analyses suggesting 
it this result was due to the lesser effec- 
}of “nothing” in the latter group than 
ong.” That is, a comparison of the 
5 for the percentage correct following 
lusly incorrect revealed that the Rn 
as consistently lower than RW and 
le for previously correct items, all 
Curves were highly similar. 
ce ‘somewhat different results were ob- 
‘om the analyses of the closed-ward 
the data from these patients will be 
din greater detail. As may be seen 
learning curves for the closed-ward 
presented in Figure 1, the performance 
subjects, unlike that of the open- 
ts and of the nonpsychiatric pa- 
the previous study (Spence, 1964), 
kedly Superior to the performance 
n group. The latter, as in previous 
NS, continued to be superior to Rn. 
Sis of variance of the total number 
t responses per subject, followed 
' comparisons of pairs of means, 
that all three groups differed 
ntly from each other ($ < .05). 
© overall performance differences sug- 


VERBAL REINFORCEMENT COMBINATIONS 


247 


gest that for the closed-ward subjects, 
“nothing,” whether used to indicate correct 
or incorrect responses, had less reinforcement 
(information) value than an overt response 
from the experimenter. More direct support 
for this hypothesis was found in the successive 
percentages analysis, the results of which are 
plotted in Figure 2.° On the earlier trials, at 
least, the performance of the Wn subjects on 
pairs that had previously been correct was 
poorer than that of either the RW or Rn 
subjects. A similar comparison of the Rn 
group with RW and Wn on items that had 
previously been incorrect indicated an even 
more marked difference in favor of the latter. 

Except for Trial 2, the mean percentages 
reported in Figure 2 are based on different 


3 These analyses could not be carried out beyond 
Trial 6 for the RW group and Trial 8 for Wn due 
to the presence of subjects making O correct out of 
0 previously incorrect responses after these points, It 
will be noted that in the RW- and Wn" curves there 
is a tendency towards an initial drop in percentage 
correct followed by an increase (which continues, 
for the subjects remaining, beyond the points 
plotted). These U shaped curves, which were also 
found with the open-ward subjects and with non- 
psychiatric groups (Spence, 1964), appear to reflect 
the effects of item selection, the pairs represented in 
the previously incorrect curves being increasingly 
restricted in number and to those that are more 
“difficult” (that is, pairs for which the subject has a 
response bias towards the incorrect alternative). As 
subjects eventually master all or most of these items, 
the curves rise after their initial drop. 


o 
O 


Mean % Gor rect 
3 


a w0 iz 4 i6 
Trials 

Fic. 1. Mean percentage of correct responses per 
trial under the three reinforcement combinations for 
the closed-ward groups. 
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Mean % Correct 
a 
o 


Trials 


Fic. 2. Successive percentages of correct responses 
for items that were correct (++) and incorrect (—) 
on the previous trial for the closed-ward groups. 


numbers of items between and within groups; 
only the Trial 2 data were therefore analyzed 
statistically. An analysis of variance for 
repeated measures indicated that the observed 
interaction between Wn and Rn groups and 
previously correct and incorrect items was 
significant (F = 5.73, p < .05). Similarly, ¢’s 
comparing the means of the combined Rn 
and RW groups with Wn on previously cor- 
rect items and of the combined Wn and RW 
groups with Rn on previously incorrect items 
also proved to be significant (p < .05). Thus, 
in both the Rn and Wn conditions, “nothing” 
was shown to have significantly less effect in 
producing correct responses than an overt 
response from the experimenter. 

Although the closed-ward subjects in the 
Wn group were inferior in performance to 
the RW group, the upward trend of their 
learning curve, shown in Figure 1, indicates 
that they nonetheless were mastering the list. 
For the Rn subjects, however, there was little 
indication of any systematic increase in 
correct responses after Trial 2, the mean 
number of correct responses being only 
slightly above chance throughout. A trend 
analysis of these Rn data (Lewis, 1960) indi- 
cated that the hypothesis of zero slope could 
not be rejected (F < 1.00). Inspection of the 
successive percentages curves in Figure 2 
reveals, however, that the Rn subjects were 
not simply making random choices but rather 
were tending to repeat their previous re- 
sponses whether right or wrong; the repetition 
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rates were, in fact, quite comparable for 
previously correct and incorrect items, aver. 
ing about 72% for the former and 67% for 
the latter. | 

Failure to improve over trials was nt 
found to be true of the open-ward Rn sub- 
jects. This further difference between the two 
types of patients is reflected in the results of 
an analysis of variance of number of correct 
responses in which the effects of trials, condi- 
tions (Rn — Wn) and groups (open-closed) 
were taken into account. The Trials X Con: 
ditions and the triple interaction were both 
significant (p < .05), due largely to the lack 
of improvement in the closed-ward Rn 
group. All main effects were also significant 
($ < .01), indicating the superiority of Wn 
over Rn and of open- over closed-ward sub: 
jects and the improvement over trials of the 
total group. 


Discussion 


The results obtained with the open-watd 
schizophrenics were essentially the same as 
those found in a parallel study employing 
nonpsychiatric groups (Spence, 1964): no 
significant performance differences occurred 
between the RW and Wn conditions but both 
were significantly superior to Rn. The sut 
cessive percentages analyses further suggested 
that contrary to the interpretation of Bus 
et al. (1956) concerning the relative poteney 
of “Right” and “Wrong” as reinforcers, these 
results were primarily due to “nothing 
having less reinforcement (information) valle 
in the Rn condition than “Wrong.” The pa 
tients from closed wards deviated from 
open-ward subjects in that they were unable 
to utilize the information being conveyed by 
“nothing” as effectively as an overt respos 
from the experimenter in both the Rn and Wi 
combinations, leading to poorer overall per 
formance in these conditions than with RI ‘ 
Their performance under the Rn conditiol 
was particularly poor, showing no evidence 0 
improvement over trials. Thus the hyp? 
sis presented earlier concerning a deficienti! 
in utilizing the information conveyed 
“nothing” in schizophrenics in general 
confirmed only with the closed-ward g0? 

In the hospital from which the patie? 
were selected, ward assignment is closely 


reie SES = 3 
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‘sociated with current mental status, the symp- 
toms of open-ward patients tending to be in 
full or partial remission and those of closed- 

‘yard patients in nonremission. One interpre- 
tation that therefore could be given to the 
fact that a deficiency in cue utilization was 
exhibited only by the closed-ward subjects is 
that this deficiency is simply one aspect of 
schizophrenic symptomatology, disappearing 
when other psychotic symptoms are in a state 
of remission. A similar conclusion concerning 
the reversibility of intellectual deficits ex- 
hibited by schizophrenics has recently been 
reached by Haywood and Moelis (1963) who 
studied longitudinal IQ changes in improved 
and unimproved patients. 

Many of the studies comparing the per- 
formance of schizophrenics and nonschizo- 
phrenics on intellectual tasks have been con- 
ducted to test hypotheses about personality 
characteristics of the schizophrenic that play 
a role in the development of the disorder. 
Typically the schizophrenics have been 
selected without regard to their current mental 
status, The findings reported above suggest 
the dangers inherent in ignoring the mental 
status variable in tests of etiological hypothe- 
ses: a behavioral characteristic which may 
simply be one of the overt manifestations of 
the active, ongoing psychotic process may 
erroneously be described as reflecting a pre- 
morbid characteristic of those who become 
A etc which distinguishes them from 
i ose who do not and thus may be of etio- 
ogical Significance. Ideally, premorbid com- 
Ree should be made to avoid this type 
ica sion. This procedure is in most in- 
Rhine ee but one can at least de- 
Bizoctrent er or not differences between 
aa cs and nonschizophrenics that 
tas en actively psychotic subjects are 

can also be found when their symptoms 
ve remitted, 

ier eet results also suggest that con- 
Dieting t ayton should be exercised in inter- 
a e results of studies attempting to 

ess hypotheses that state that schizo- 

Phrenics are | a n 2 š 
Phrenics E less „motivated than nonschizo- 

y positive social reinforcers or 
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rewards or that they are more disturbed by 
negative social reinforcers or censure. A num- 
ber of these investigations, such as studies of 
verbal conditioning (e.g., Cohen & Cohen, 
1960) have employed reinforcing operations 
that are essentially parallel to the Rn and 
Wn combinations of the present study. 
It is quite possible that the verbal re- 
inforcers used in these studies did not 
have the motivational or effective proper- 
ties that were assigned to them but instead, 
along with “nothing,” acted primarily as car- 
riers of information. Thus the performance 
differences that have been found among 
the several reinforcement conditions in these 
studies, or the interactions between conditions 
and diagnostic groups, may have had little 
relevance to the hypotheses being tested but 
more to do, as in the present study, with 
the kind and amount of information that 
subjects extract from a given reinforcement 
combination. 
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NEGRO PERCEPTION OF NEGRO AND WHITE PERSONALITY TRAITS 


JAMES A. BAYTON, LETTIE J. AUSTIN, anb KAY R. BURKE 


Howard University 


Male and female Negro Ss answered the Guilford-Zimmerman Temperament 
Survey in terms of how they thought “the average Negro male,” “the average 
Negro female,” would reply to the items. The results show that 3 dimensions are 
involved in the perceptions of racial personality—the trait being “stereotyped,” 
sex of the group being assessed, and the sex of S. For 6 of the 10 traits, the 
means for whites, in contrast to those for Negroes, were in the direction 
indicative of better personality adjustment. 


There have been several studies of the atti- 
tudes and stereotypes which minority group 
members hold with respect to their own races. 
These studies consistently show that members 
of minority groups tend to have attitudes and 
stereotypes concerning their respective groups 
which are similar to those held by the white 
majority. Using the adjective list procedure de- 
veloped by Katz and Braly (1933), Bayton 
(1941) and Meenes (1943) found that Negroes 
stereotype “the Negro” as being lazy, super- 
stitious, musical, loud, and very religious. These 
items also appear in the stereotype which white 
Americans have of the Negro. Sarnoff (1951) 
found Jewish college students with the same 
anti-Semitic attitudes observed in majority-group 
bigots. Engel, O'Shea, Fischl, and Cummings 
(1958) reported that an “outwardly loyal” group 
of Jewish subjects showed the same pattern of 
negative attitudes toward their own group as that 
found among white Protestants. In these latter 
studies, the Jewish Anti-Semitism Scale developed 
by Sarnoff was used. 

From these investigations arises the question of 
what dynamics would lead minority group mem- 
bers to possess negative attitudes toward their 
own race, attitudes which are quite similar to 
those held by the majority group. Sarnoff (1951) 
regards anti-Semitic attitudes in Jews as being a 
symptom of identification with the aggressor. 
These anti-Semitic attitudes are viewed as being 
expressions of hostility toward one’s own social 
group and appropriation of the attitudes of those 
in the white majority who aggress against them. 
At the level of personality characteristics, this 
symptom was related to negative attitudes toward 
the self and one’s parents and passivity in the 
face of interpersonal hostility. This same con- 
cept—identification with the aggressor—might 
provide an explanation of the negative stereotype 
of the Negro found among Negro subjects. 
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The identification-with-the aggressor theory, a 
stated by Sarnoff, includes only one direction of 
influence—incorporation on the part of thr 
minority group members of the negative attitudes 


Since the fundamental process is supposed W 
be identification, another direction of influe 
could be operating, namely, a tendency to idealize 
the aggressor. The two influences could be oper 
ating simultaneously. In the latter case, the | 
attitudes and stereotypes which the minority 
group holds toward the majority should by 
characterized by highly positive components. Evif 
dence for this point of view is found in tht 
stereotypes which Negroes have of “whit 
Americans”—intelligent, industrious, ambitiols 
progressive, etc. (Bayton, 1941; Meenes, 1943). J 
Utilizing several approaches that differ fomi 
those in other studies, the present research al 
tacks the problem of how a minority group PS 
ceives itself and the majority group. Where 
earlier investigators have used either adject 
lists to determine stereotypes or scales to a$$ 
attitudes, in this research the instrument was 4 
standardized personality test (Guilford-Zimm@™ 
man Temperament Survey—GZTS) on Will 
Negro subjects answered the items a 
thought both Negroes and whites would, F 
basic design permits investigation of the data. ; 
see whether there are indications of the ti 
directions of influence in the identification 
the-aggressor concept. è 
Bayton, McAllister, and Hamer (2956) bate 
shown that stereotypes are a function of e 
sions other than race. In their research, E 
types were obtained in terms of race-class | 
ferentiation of the groups being stereotyp 
upper-class Negroes and whites; lower 
Negroes and whites. The results indicated 
stereotypes as held by white and Negro subje 
were more class linked than race linked. 
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current research, perceptions of racial character- 
istics were investigated in the dimension of sex, 
as well as race. The subjects (all Negroes) an- 
swered the personality inventory in terms of 
“the average Negro male,” “the average white 
male,” “the average Negro female,” and “the 
average white female.” Moreover, since male 
and female subjects were used, the interaction 
of the sex of subject and the perception of the 
personalities of race-sex groups was investigated. 


METHOD 


Subjects. The subjects were 240 Negro students at 
Howard University (120 male, 120 female). 

Design. The subjects were randomly assigned to 
one of four groups. Each group of 60 subjects was 
given the GZTS. The instructions given to the 
respective groups were: “How do you think the 
Average Negro Male [Average Negro Female, Aver- 
age White Male, Average White Female] would 
answer these items?” (It should be noted that a 
given subject responded to only one of the above 
instructions.) 


RESULTS 


A three-way classification analysis of variance 
Was used to evaluate the data of each of the 
10 traits in the GZTS. (The within-groups vari- 
n was used as the error term.) The results 
An the traits (G, R, A, S, E) are given 
The Negro subjects gave whites a significantly 
higher score on general activity than they did 
Negroes (white M = 15.63, Negro M = 13.01). 
They “saw” whites as being more restrained than 
Negroes (white M = 14.22, Negro M = 11.87). 
However, the Sex Stereotype X Race Stereotype 
ae for restraint was significant—the 
hes, Subjects saw white males as being more 
hag (M =15.27) than Negro males (M 
fe 0.55). There was no difference in the stereo- 
ypes for this trait for the two groups of females 
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(white females M = 13.18, Negro females M 
= 13.18). The original racial difference for re- 
straint is created by the difference in the stereo- 
type for Negro and white males on this trait 
(restraint). 

Whites were seen as having more ascend- 
ance than Negroes (white M = 17.46, Negro 
M = 13.55). For the trait of sociability, the 
stereotype is a function of sex rather than of 
race; females (M = 19.31) were seen as being 
more sociable than males (M = 16.63). 

For emotional stability, significant differences 
were found for the sex stereotypes and the race 
stereotypes. Males (M = 13.65) were perceived 
as being more emotionally stable than females 
(M = 12.07); whites (M = 13.68) were stereo- 
typed as being more emotionally stable than 
Negroes (M = 12.00). 

The results for the remaining five traits 
(O, F, T, P, M) are shown in Table 2. Whites 
were scored as having a greater degree of objec- 
tivity than Negroes (M = 12.08 and 10,51, 
respectively). 

For friendliness, two of the interactions showed 
significance—Sex of Subject X Sex Stereotype 
and the triple interaction. Male subjects saw 
greater friendliness in males (M = 9.12) than in 
females (M = 7.42). For female subjects the 
reverse was true; they perceived greater friendli- 
ness in females (M = 9.92) than in males 
(M =8.07). 

Inspection of the data involved in the triple 
interaction for friendliness indicates that the 
male Negro subjects made their basic differentia- 
tion on this trait in terms of racial stereotyping. 
For them the group with the highest degree of 
this trait was white males (M = 10.90), second 
was white females (M = 8.17), third was Negro 
males (M = 7.33), and last was Negro females 
(M = 6.67), the first two groups being white, 
the last two being Negro. In contrast, the female 


TABLE 1 
ANALYSIS or VARIANCE: RAcE-Sex PERSONALITY TRAIT STEREOTYPING FOR FACTORS 
G, R, A, S, E 
F 
Source 
General activity Restraint Ascendance Sociability Emotional stability 

Sex of subj 

ject (A 2 12 0.56 .25 0.22 
Sex stereotype 8) 338 bis tois 10.71** 4.55% 
A y gcteotype (C) 14.44 10.30** 27.17** 110 bir 

0.41 0.71 3.02 x ; 
BLS 0.47 0.19 1.95 2.03 136 
AX k 2.29 10.30** 0.11 0.86 0.03 
BXC 035 0.01 031 1.57 3.06 


Note.—] E 
Ap g Respective within-group variances used as error terms. M = 240 (120 male, 120 female). 
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TABLE 2 
ANALYSIS OF VARIANCE: RACE-SEX PERSONALITY TRAIT STEREOTYPING FOR FACTORS 
O, F, T, P, M 
F 
Source 
Objectivity Friendliness Thoughtfulness | Personal relations Masculinity 
Sex of subject (A) 2.68 1.03 0.23 1.69 
Sex stereotype (B) 0.79 0.01 0.01 0.02, 
Race stereotype (C) 4.57* 0.91 1.86 5.61 
AXB 0.35 6.02* 0.59 0.48 
AXC 3.40 1.37 0.63 3.40 
BXC 1.69 1.41 1.74 0.12 
AXBXC 0.01 5,23* 0.70 0.32 


Note.—Respective within-group variances used as error terms. N = 240 (120 male, 120 female) 


* p <.05. 
PELON 


Negro subjects made their basic differentiation 
on the basis of sex stereotyping. Their highest 
friendliness score went to Negro females (M 
= 10.87), next to white females (M =8.97), 
third to Negro males (M =8.27), and last to 
white males (M = 7.87). For the female sub- 
jects, the two highest means were assigned to 
females; the two lowest means were for males. 

No significant results were obtained for the 
analysis of the data for the trait of thoughtful- 
ness. For personal relations, these Negro subjects 
assigned a greater degree of “tolerance and 
understanding of people” to whites (M = 10.58) 
than to Negroes (M = 9.07). 

The validity of our general procedure is seen 
in the data on masculinity. If the subjects were 
responding differentially in accordance with the 
differences in instructions, the sex stereotype 
for masculinity should be significant. The results 
satisfied this demand. The subjects gave males 
a mean of 15.96 on Masculinity and females a 
mean of 8.73 on this trait. 

For masculinity, the Sex of Subject X Race 
Stereotype was significant. The male Negro sub- 
jects attributed greater masculinity to whites (M 
= 13.18) than to Negroes (M = 10.75). The fe- 
male Negro subjects, however, saw Negroes (M 
= 12.63) as having more masculinity than whites 
(M = 12.05). Although the triple interaction 
missed being significant at b = .05, the data are 
strongly suggestive of such an effect. Negro male 
subjects gave white males a mean masculinity 
score of 17.63; they gave Negro males a mean of 
13.90. In contrast, Negro female subjects gave 
Negro males a mean masculinity score of 16.43; 
they gave white males a mean masculinity score 
of 15.87. Finally, both groups of subjects, male 
and female, gave slightly higher femininity scores 
to Negro females than to white females. 


Discussion 


These results demonstrate that members of al 
minority group do differentiate in the perception} 
of the personality characteristics of their ow 
group and of the majority group, as measured by| 
a standardized personality test. However, at Teas 
three dimensions are involved in this process— 
the trait being “stereotyped,” the sex of the 
members of the group being assessed, and ig 
sex of the individuals doing the “stereotyping. 
Race was the sole differentiating factor for only 
4 of the 10 traits—general activity, ascendancy 
objectivity, and personal relations, Sociability 
was differentiated solely on the basis of ag 
Interaction of race and sex was found F 
restraint, emotional stability, friendliness, an 
masculinity. Thoughtfulness showed no pe 
tiation on the dimensions used in this research 

In terms of basic impressions, some of these 
data seem to be in the same direction as w 
obtained in earlier studies (Bayton, ue 
Meenes, 1943). With an adjective list, Neg H 
subjects characterized Negroes as being an 
and whites as being “industrious.” In the ra 
research the Negro subjects gave whites a hig! 
score on general activity (having more pe! 
vitality, and productivity). (True, they did T 
say specifically that Negroes are “lazy. ) = 
another instance, the present Negro subjects A 
Negroes as having less restraint (less E 
minded; more happy-go-lucky). “Happy: d 
lucky” is a trait that is nearly always a gi 
to Negroes, when an adjective check list is 4 i 
The present data, however, indicate e 
Negro subjects, this particular stereotype 15 pite 
associated with Negro males (as against e 
males) than with Negro females (as 4 
white females). this 

Although we have no explicit data o 


> 
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point, there seems to be some evidence of the 
operation of the two directions of influence in 
‘the identification-with-the-aggressor theory. In 
every instance where differentiation of racial 
groups, per se, occurred (general activity, 
yestraint, ascendance, emotional stability, objec- 
tivity, and personal relations), the means for 
whites were in the direction representing better 
personality adjustment, This might result from 
the combined tendencies to idealize the aggressor 
and to incorporate his negative views toward 
the minority group. 
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RECALL EFFICIENCY AND MEDIATING COGNITIVE EVENTS 
IN “EXPERIMENTAL REPRESSION” * 


THOMAS J. D’ZURILLA ? 


University of Illinois 


This study investigated the effects of ego-threatening instructions upon recall 
efficiency and Ss cognitive experiences during a 5-min. period beginning with 
n the introduction of threat and ending with the recall trial. The latter was 
approached by examining Ss’ verbal responses to certain postexperimental 
interview questions. The results supported previous studies which have found a 
decrement in recall due to ego threat; however, an interpretation of these 
results in terms of “repression” was not supported. More threatened Ss reported 
having thoughts about a previous task which was associated with threat than 
Ss who were not threatened. The findings were discussed in terms of a response 


competition hypothesis. 


en studies concerning the role of aware- 
Pee thoughts, ideas, hypotheses) as an 
Ee variable in verbal conditioning (for 
suggested ulany, 1962; Spielberger, 1962) have 
evaluati, a promising method for a better 
a uation of studies purporting to demonstrate 
eae! repression.” A close examination 
(Ee verbal Tesponses to interview 
during eens their cognitive experiences 

e experiment could serve to clarify 
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much of the ambiguous research in this area. 
According to Zeller (1950a), the following three 
requirements are necessary for an adequate test 
of repression in a laboratory memory experiment. 
First, the study must demonstrate that the 
material in question has been learned by the 
individual. Second, it must show that the intro- 
duction of an inhibiting factor (i.e., ego threat) 
causes inability to recall or a significant decrease 
in recall of the material. Third, the experiment 
must show that the removal of the inhibiting 
factor results in the reinstatement of the ability 
to recall the material. Studies satisfying these 
criteria have been reported by Zeller (1950b) 
and Flavell (1955). 

Although Zeller’s criteria may be necessary 
for an adequate laboratory test of repression, 
they are not suficient since the changes in recall 
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efficiency described in these criteria can be ex- 
plained equally well by processes other than 
repression (for example, response competition, 
response suppression). Dollard and Miller 
(1950) state that repression refers to the auto- 
matic tendency—which is not under verbal 
control—to stop thinking about and avoid re- 
membering the threat-associated cues, However, 
the experimental design which Zeller advocates 
for a test of repression does not include an 
evaluation of the subject’s thoughts—following 
the introduction of threat—to determine whether 
or not this involuntary blocking of thinking and 
remembering actually occurs. The only variable 
under investigation is overt recall performance 
(i.e., the subject’s report of the amount of ma- 
terial he remembers at the time of the recall 
trial), which cannot alone provide sufficient evi- 
dence to justify an interpretation of repression, 

The objectives of the present study were to 
verify Zeller’s findings regarding the relationship 
between ego-threatening instructions and overt 
recall performance and to explore the subject’s 
cognitive behavior during a 5-minute period be- 
ginning with the introduction of threat and 
ending with the recall trial. The latter objective 
was approached by examining the subject’s 
verbal responses to interview questions concern- 
ing his thoughts and ideas during the 5-minute 
period. On the basis of a repression hypothesis, 
the following predictions would be made: ego 
threat will produce a significant decrement in 
the recall of previously learned material associ- 
ated with the threat, and subjects who are 
threatened in connection with a certain task 
will tend to avoid remembering and thinking 
about things related to that task to a Significantly 
greater degree than will subjects who are not 
threatened. 


METHOD 
Subjects and Apparatus 
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Procedure 


The subjects were seen individually for a 1-hour 


session. Upon their arrival for the experiment, sub- 


jects were alternately assigned to an experimental 


group (Group A) and a control group (Group B); 

Initial “learning” and inkblot test, It was decided 
to employ “incidental learning” of meaningful words 
in the present study, rather than rote memorization 
of nonsense syllables, in an attempt to approximate 
more closely than did previous studies those condi- 
tions under which repression is presumed to occur, 
The 20 experimental words were first projected on 
a screen, 1 at a time, for 2 seconds each, The 
instructions to the subject were that he should 
study the words to make sure that he was familiar 
with all of them, Following the presentation of the 
final word, the subject was then given 2 minutes 
to write down as many of the words as he could 
recall “so that a measure of his familiarity with the 
words could be obtained.” 

At this point the Procedures diverge somewhat 
for the two groups. Each subject in Group A was 
told that he was Participating in the development of 
a projective inkblot test designed to detect latent 
homosexual tendencies. The 10 inkblot slides were 
then projected on the Screen, 1 at a time, for 10 
seconds each. The subject was instructed to study 
each inkblot for the 10-second period and then to 
Teport which of the two words on the slide better 
described the inkblot. Prior to this task, however, 
each subject was informed that one of the two 
words on each slide was a response which homo- 
sexuals tend to give whereas the second word was 
a “normal” response, Since it was important to 
avoid instructions which would arouse a great teal 
of anxiety in Group B, these subjects were simply 
told that they were Participating in the develop- 
ment of a new Psychological inkblot test. Otherwise, 
the procedure in taking the inkblot test was the 
same as in Group A, Immediately following the 
inkblot test in both groups a second 2-minute recall 
trial was conducted. The explanation for this and 
all other recall trials was that the nature of the 
inkblot test materials was being evaluated. A recall 
trial at this Point was necessary because the inkblot 
test was an additional “incidental learning” trial 
and the prethreat level of learning had to be 
obtained, 

Introduction of ego 


threat. Each subject of 
Group A was informed 


that he had chosen 9 of 
the 10 “homosexual” words on the inkblot test. He 
Was also told that no Student previously tested had 
chosen more than 1 of them. The subject was then 
given 5 minutes to work on a “neutral” task which 
included a “general information form” (for example, 
age, sex, college major, etc.) and a rating form on 
which he was asked to rate the words “psychology” 
ang “experiment” on 10 scales taken from Osgood’s 
semantic differential, The experimenter left the room 
during this 5-minute period ostensibly to talk with 
the director of the research project. In Group B, 
on the other hand, subjects were simply told that 
they had done very well on the inkblot test. They 
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Fic, 1. Trends in mean number of words recalled over four recall trials by a group which was given threaten- 
ing information (Group A) and a control group (Group B). 


were also given the 5-minute “neutral” task while 
the experimenter was out of the room. A third 
2-minute recall trial was conducted immediately 
following the 5-minute period in both groups. 

Removal of threat and postexperimental interview. 
In this portion of the experiment something 
analogous to the removal of repression was at- 
tempted by confronting the subjects with the threat- 
associated words and then eliminating threat by 
revealing. the “hoax.” The subjects were instructed 
to rate each of the experimental words on 10 scales 
taken from Osgood’s semantic differential. Following 
this task, the true nature of the experiment was 
explained and a 2-minute “cathartic” discussion 
was held. The subjects of Group B also performed 
the semantic differential task, followed by a 2-minute 
discussion about the experiment. The final recall 
trial was then conducted and a structured interview 
was held in an attempt to identify the subject’s 
thoughts, ideas, and feelings at various points during 
the experiment. 


RESULTS 
Effects of Threat upon Recall Efficiency 


The trends in mean words recalled over the 
four recall trials for the two groups are illus- 
trated in Figure 1. A Lindquist Type I analysis 
of variance (Lindquist, 1953) was performed 
on the recall data. A summary of this analysis 
18 presented in Table 1. Of primary concern 
here is the Instructions X Recalls interaction 
(IX R), which was significant at beyond the 
‘01 level (F = 4.191, df = 3/120). It is apparent 
from Figure 1 that this interaction was primarily 
the result of the difference between Group A 
and Group B in the change from Rə to Rs. 
Group A showed a decrement in recall from 
Rg (following threat), whereas an increase in 


recall, resembling a reminiscence effect, occurred 
in Group B. A ¢ was calculated on the mean 
difference scores for the two groups which proved 
to be highly significant (¢= 4.523, p < .001). 
The F associated with recalls (R) was also 
significant, indicating a significant increase in 
words recalled over trials due to practice ef- 
fects (F = 30.663, df = 3/120). The F associ- 
ated with instructions (I), however, was not 
significant which indicates that the two groups 
were essentially equal with regard to the 
number of words recalled at the conclusion of 
the experiment—after threat had been removed. 


Effects of Threat upon M: ediating Cognitive 
Events 

In answer to a question in the postexperi- 
mental interview concerning performance during 
recall trials, all 42 subjects indicated that they 
had tried their best on all recall tasks and had 
reported all the words that they were able to 
remember, Only 1 subject reported that he had 


TABLE 1 
SUMMARY OF THE ANALYSIS OF VARIANCE 


Source af SS MS F 
Between subjects | 41 | 656.602 16.015 
Instructions (I)| 1 0.007 | 0.007 <1 
Error 40| 656.595 | 16.415 
Within subjects |126| 627.250 4.978 
Recalls (R) 3| 256.923 | 85.641 | 30.663* 
IXR 3| 35.112 | 11.704 4.191* 
Error 120| 335.215} 2.793 
Total 167 | 1283.852 
tps 01, 
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TABLE 2 


NUMBER OF SUBJECTS IN EACH GROUP REPORTING 
THOUGHTS ASSOCIATED WITH THE ĪNKBLOT TEST 


Thoughts 
Group 
Associated | Not associated Total 
A 13 8 21 
5 16 21 
Total 18 24 42 


recognized the true nature of the study (i.e., the 
testing of recall ability) and had “practiced” 
the experimental words between recall trials. 
This subject was from Group A and the 
“recognition” occurred during the 5-minute re- 
tention interval between Ry and Rg. Only 2 of 
the 21 Group A subjects said they were not 
affected by the threat—although overt signs of 
anxiety were present in all subjects to some 
degree (for example, blushing, muscle tremors, 
perspiration, etc.). The other 19 subjects indi- 
cated that the threat was quite effective, de- 
scribing their immediate reactions with such 
terms as “shocked,” “surprised,” “dismayed,” 
“embarrassed,” “anxious,” “worried,” “fright- 
ened,” “jolted,” and “disturbed.” 

In order to investigate whether the threatened 
subjects tended to avoid thinking and remem- 
bering about the threat-associated material, sub- 
jects of both groups were classified into two 
Subgroups on the basis of their responses to 
the following questions concerning their thoughts 
during the 5-minute retention interval (between 
Ry and Rg). 

1, Describe what you were thinking about 
during the period when I was out of the room. 

2. Did you think about anything else related 
to the experiment during this period? 

Those subjects who reported having thoughts 
about either the inkblots or the experimental 
words were classified into an inkblot test associ- 
ated category. Those subjects who indicated that 
they thought only about the “neutral” task or 
about anything else except the inkblots or the 
experimental words were classified as not ink- 
blot test associated. Table 2 shows the number 
of subjects from Group A and Group B that fell 
into each of the two categories. The results 
were in the opposite direction from the predic- 
tion made on the basis of a repression hypothe- 
sis. Thirteen (62%) of the Group A subjects 
were inkblot test associated whereas in Group B 
only five (24%) fell into this category, A chi- 
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square test performed on the data proved to be 
significant at the .05 level (x? = 4.764, df =1). 


DISCUSSION 


The results of this study support Zeller’s and 
Flavell’s findings concerning the effects of ego- 
threatening information and the removal of 
threat upon recall efficiency. However, the in- 
formation reported by the subjects in the post- 
experimental interview indicates that an inter- 
pretation of these results in terms of something 
analogous to repression is not justified. In con- 
trast with the expectation on the basis of a 
repression hypothesis, there appears to have been 
a tendency for the threatening stimulation to 
sensitize subjects with regard to the threat- 
associated material (i.e., the inkblots and experi- 
mental words). This effect occurred despite the 
fact that the threat seemed to have been quite 
disturbing to most subjects and was more 
clearly analogous to the kinds of threat presumed 
to invoke repression than the conditions most 
previous investigators have employed. 

A more likely explanation for the present 
findings is response competition. Although the 
effects of competing response tendencies have 
been discussed primarily in connection with the 
relationship between anxiety and learning effi- 
ciency (see Jones, 1961), they could also apply 
to performance on recall trials. In this con- 
nection the hypothesis discussed by Child (1954) 
seems most suitable. Following from this view- 
point, the interpretation of the obtained recall 
decrement is that certain irrelevant cognitive 
response tendencies (thoughts)—elicited by the 
threatening stimulation—increased the conflict 
between various competing response tendencies 
already involved in the recall task, therefore, 
causing a disruption in performance on that task. 
The postexperimental interview data concerning 
mediating cognitive events in Group A and 
Group B tend to support this interpretation. 
In addition to thoughts about the experimental 
words during the 5-minute retention interval, 
the Group A subjects also reported having more 
thoughts during this period which were irrelevant 
to the task of recalling as many of the words as 
possible in 2 minutes (for example, thoughts 
about the inkblots, past memories, what to do 
about the “homosexual tendencies”). This in- 
crease in amount of conflicting cognitive events 
could have reduced the efficiency of recall in 
Group A below the level of Group B. The re- 
moval of threat served to eliminate these con- 
flicting cognitive tendencies and, therefore, per- 
formance was again unaffected, 


OE S 


—_ a 


BRIEF ARTICLES 


REFERENCES 


Cum, I. L. Personality. Annual Review of Psychol- 
ogy, 1954, 5, 149-170. 

DOLLARD, J, & MILLER, N. E. Personality and 
psychotherapy. New York: McGraw-Hill, 1950. 
Dutany, D. E. The place of hypotheses and inten- 
tions: An analysis of verbal control in verbal 
conditioning. In C. W. Eriksen (Ed.), Behavior 
and awareness. Durham: Duke Univer. Press, 1962. 

Pp. 102-129. 

Fravett, J. H. Repression and the “return of the 
repressed.” Journal of Consulting Psychology, 
1955, 19, 441-443. 

Jones, H. J. Learning and abnormal behaviour. In 
H. J. Eysenck (Ed.), Handbook of abnormal 
psychology. New York: Basic Books, 1961. Pp. 
488-528. 


Journal of Personality and Social Psychol 
1965, Vol. 1, No. 3, 287-261 eee 


257 


Lrypourst, E. F. Design and analysis of experiments 
in psychology and education. Boston: Houghton 
Mifflin, 1953. 

SprevsercEr, C. D. The role of awareness in verbal 
conditioning. In C. W. Eriksen (Ed.), Behavior 
and awareness. Durham: Duke Univer. Press, 1962. 
Pp. 73-101. 

ZELLER, A. F. An experimental analogue of repres- 
sion: I. Historical summary. Psychological Bul- 
letin, 1950, 47, 39-51. (a) 

ZELLER, A. F. An experimental analogue of repres- 
sion: II. The effect of individual failure and suc- 
cess on memory measured by relearning., Journal 
of Experimental Psychology, 1950, 40, 411-422. 


(b) 
(Received July 19, 1963) 


A COMPARISON OF PERSONALITY FACTOR STRUCTURE IN 
AMERICAN AND ITALIAN SUBJECTS * 


ANDREW L. COMREY 
University of California, Los Angeles 


LUIGI MESCHIERI, RAFAELLO 


MISITI, anv RODOLFO NENCINI 


Consiglio Nazionale delle Ricerche, Rome 


Earlier investigations have been di 
describe personality in American Ss. 
determine if similar results wi 
language version 


istered to 507 volunteer Italians. Factor analysis 


defining each of 28 personality dimensions. 
Jong with 2 validation scales, age and sex. Factor 


groups were intercorrelated al 
analysis of this 32 X 
substantial similarity 


Several recent investigations by the authors 
have been devoted to the task of providing a set 
of personality factors which could be duplicated 
from one study to the next (Comrey, 1960, 1961, 
1962b, 1964; Comrey & Nencini, 1961; Comrey 
& Schlesinger, 1962). Much of this work, how- 
ever, has been concerned only with subjects from 
the United States. With these subjects, a certain 


1This research was carried out while the first 
author was a National Science Foundation Senior 
Postdoctoral Research Fellow. Support for a portion 
of the work came from a grant by the University 
of California. The authors are especially indebted to 
S. Chiari of the Centro di Orientamento Scholastico- 
Professionale di Pesaro, B. Celli, G. Cruciani, and 
other members of the Istituto Nazionale di Psicologia 
for their assistance in preparing the Italian language 
inventory and finding subjects. 


levoted to developing a set of factors to 
The present study was an attempt to 
ould be obtained using Ttalian Ss. An Italian- 
of an English-language factored personality test was admin- 


was used to select items best 
Total scores over these item 


32 correlation matrix produced a factor solution with 
to that for American Ss. 


degree of stability from study to study has been 
achieved for some of the most important factors. 
The present paper reports the results of a study 
to determine to what extent these same factors 
could be duplicated in a sample of Italian sub- 
jects. The plan was to repeat as nearly as possible 
the most recent study carried out with American’ 
subjects (Comrey, 1964) except that Ttalian 
subjects and an Ttalian language inventory would 
be used. 
PROCEDURE 


Sample 

Five hundred and seven volunteers were tested, 
284 males and 223 females. The male subjects were 
divided as follows: 120 University of Rome students, 
91 persons known to members of the Istituto 
Nazionale di Psicologia and the Istituto Superiore 


258 


di Sanita’ in Rome, 48 students from an agricultural 
trade school in Pesaro, and 25 miscellaneous, The 
female subjects were divided as follows: 60 school 
teachers and prospective school teachers from Pesaro 
and Urbino, 35 social-work school students, 35 nurses 
and nursing students, 30 dietetics students, 45 Uni- 
versity of Rome students, and 18 miscellaneous 
volunteers, Except where noted, the subjects were 
residents of Rome, The great majority were in their 
late teens and early twenties. Well over half the 
males were university students or university gradu- 
ates. About half the females fell in this category. 
Each subject was promised an explanation of his 
test results as an inducement to participate. The 
corresponding sample of American subjects differed 
in having a higher proportion of females, married 
Persons, and nonstudents, 


Inventory 


The previous study with American subjects was 
based upon 30 Personality dimensions and two vali- 
dation scales. Two of the Personality dimensions, 
Sex Concern and Sex Drive, were omitted from this 
study because some advisors felt that the social 
climate in Italy did not favor inquiry into this 
particular subject matter, Age and Sex, accordingly, 
were substituted for these two variables in the present 
analysis, For the remaining 28 personality dimensions 
and two validation scales, multiple-choice Personality 
items were prepared in Italian, attempting to capture 
the flavor of the English-language variable without 
resorting to literal translation of items. 
lying a technique described elsewhere (Comrey, 
1961), the first author initially prepared six items 
in Italian to measure each of the 28 Personality 
dimensions and two validation scales, attempting to 
maintain homogeneity of item content within dimen- 
sion subsets, These items were rewritten in good 
by the other authors and members of the 


were rechecked by the first author and modified 
where n i 


by an X or a Y to indicate to the ib; i 
response scale he should use. RA 
Each of the 32 variables is listed 


below 
illustrated by a translated sample item; m 
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1. Need to Excel (23X). If someone is better 
at something than I am, I try with all my might 
to do better than he does. 

2. Succorance (24X). I like to discuss my plans 
with my friends in order to hear their opinions, 

3. Need for Order (3X). I keep all my things 
in their proper place so that I will know exactly 
where to find them. 

4. Sensitivity (4X). Criticism, even if minor, 
annoys me. 

5. Cynicism (SY). The majority of public em- 
ployees would accept bribes if they were large 
enough. 

6. Shyness I (6X). It is difficult for me to 
converse with a person I have just met. 

7. Depression (73X). I think that there are more 
disappointments than satisfactions in life. 

8. Conformity (30X). I avoid appearing to be 
different from the others in the group of which 
I am a part. 

9. Hostility II (31X), The majority of people 
nauseate me. 

10, Ascendance (32X). If someone tries to take 
advantage of me, I am quick to react. 

11. Friendliness (11X), I try to enlarge my circle 
of friends, 

12, Truth Scale (34X). I have stolen something. 

(This was a validation scale.) 

13. Drive to Finish (14X). When I start a job, 
I finish it, 

14. Self-Control (15X). No matter how angry 
I get, I manage to control myself. 

15. Reaction to Authority (82X), I have had 
incompetent people over me. 

16. Psychopathic Personality (39X). If I found 
a wallet, I would keep the money in it. 

17. Self-Sufficiency (18X). I like to be alone. 

18. Social Desirability (20X). I give what I can 
to the poor, (This was a validation scale.) 

19. Rigidity (43X). I have the courage to ex- 
Press and support my opinions even when they 
are unpopular, 

20. Cautiousness (44X). I avoid running risks. 

21. Age. Only the first digit of the age was used, 
ie, “1” or “2” for most subjects. 

22. Inferiority Feelings (133¥). Most of the 
people I know are more intelligent than I am. 

23. Need for Approval (134X), It is important 
to me to be approved by the community. 

24. Hostility I (135X). If someone makes a 
Social blunder, I like to tell everyone about it. 

25. Personal Grooming (136X). I try to be 
well groomed. 

26, Aggression (137¥), I can become enraged 
to the point of striking someone, 

27. Shyness II (138X). It would be difficult for 
me to make a speech, 

28. Agitation (161Y). I think’ that my nerves 
are very sensitive, 

29. Defensiveness (163X). To be trusting is 
good, but not to be trusting is better. 

30, Talkativeness (142X). I am a talkative 
person. 


| 


31. Sex. A “1” was scored for males and a “0” 
females. 

32. Impulsiveness (144X). I make decisions 

ithout having thought much about them. 


cription of the Analyses of Items for the 
e Study 


Items were included in the booklet to measure 
additional dimensions besides those used in the 
is comparing results with a previous study. 
14 additional dimensions did not appear in the 
‘earlier study, hence were not included in the present 
" comparison analysis. The total of 44 dimensions was 
“used to obtain correlations between the six item 
dimensions as originally prepared. The correlations 
ong these dimensions provided the basis of 
dividing up the dimensions into smaller groups for 
‘analysis of items. In selecting the dimensions to go 
particular group, care was taken to include in 
Same group dimensions which were substantially 
elated. This was done to see if their items de- 
fined a single item factor, thereby requiring the 
two dimensions to be combined or one of them to 
dropped. 
Item Analysis I, then, six items were included 


II, Defensiveness, Psychopathic Personality, 
Scale, Social Desirability, Rhathymia, Help- 
Self-Control, Cautiousness, Aggression, Im- 
. In Item Analysis II, the following 
‘dimensions were included: Shyness I, Shyness II, 
Talkativeness, Sensitivity, Depression, Satisfaction 
with Life, Inferiority Feelings, Agitation, Pessimism, 
ear of Responsibility, In Item Analysis III, the fol- 
Towing dimensions were included: Need to Excel, As- 
< Gendance, Adequacy, Drive to Finish, Activity, Need 
l lor Order, Personal Grooming, Rigidity, Calmness, 
Impatience, Welfare of Loved Ones, Thoughtfulness. 
Item Analysis IV, the following dimensions were 
included: Hostility I, Reaction to Authority, Suc- 
" Corance, Friendliness, Need for Approval, Self- 
icy, Conformity, Culture, Need for Freedom. 
each of these item analyses, Pearson correla- 
matrices were computed for the items, six items 
luded per dimension. This resulted in 72-, 60-, 
72-, and 54-variable matrices in the four analyses. 
rs were extracted by the minimum residual 
Method and rotated analytically by the Kaiser 
Normal varimax, Care was taken to extract at least 
more factors than the number of dimensions 
included in the analysis. It was necessary to break 
the items into groups this way because all items 


pay could not have been analyzed in one 
‘Matrix, 


$ 
y 


Each of the 30 dimensions described in the 
er emerged as a factor in one of these analyses 
, although in most instances not all six 
had loadings high enough to warrant their 
. Sensitivity split into two factors only the 
ae Prominent of which was retained. Items for 
Sons ession factor came both from the Depres- 

lon dimension and the Satisfaction with Life di- 
Personal Grooming and Agitation also split 
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into two factors. Again, only the most prominent 
factors were retained. 

From inspection of these item factor analysis 
results, items were selected for each of the 30 
dimensions described in the paper which had load- 
ings of at least .40, and the great majority of cases 
.5 or more, on that item factor without having a 
major loading on any other item factor. In some 
cases only two items were used, due to a splitting 
off of some of the items on a weaker second factor, 
or because only two of the items had sufficiently 
high factor loadings to be used. Other dimensions 
had more items and in several cases all six of the 
original items were retained. 

Elsewhere, in the ADI? materials, the items used 
to measure each dimension are listed. The major 
factor loadings for each item used is shown in 
Table 1, except for the items in the Truth Scale, 
Social Desirability Scale, and the Validity Scale. 
Since each of the scales used did define its own 
factor in the analyses of items, the loadings given 
for those items on a given scale are for the same 
factor, In a few cases, items had loadings less than 
the one shown here on some other factor but still 
of some importance. 


Analysis 


Several factor analyses of items were carried out 
to determine if the items written to measure a 
particular factored homogeneous item dimension 
(Comrey, 1961) were, in fact, doing so. A separate 
item factor emerged for each of the 28 personality 
dimensions listed above, although Sensitivity and 
Agitation each split into two factors, In these two 
cases, the larger factor was chosen for inclusion in 
the analysis of total dimension scores, For each of 
the personality dimension factors, the items with 
loadings of 47 or more were retained, Total scores 
over these retained items were computed for each 
person on each of the 28 personality dimensions. 
Total scores over all six items were obtained for 
each of the two validation scales. Dimensions 4, 8, 
15, and 19 had only two items each; Dimensions 
2, 9, 13, 28, and 29 had three items each; the 
remaining dimensions had at least four items. 


2The computations for this research were carried 
out on the IBM 7090 operated by the UCLA com- 
puting facility. The inventory and answer sheet, a 
list of the items used for each dimension analyzed, 
the matrix of correlations among the dimensions, the 
means and standard deviations of the dimension 
scores, the matrix of minimum residual factor 
loadings, the matrix of 14 varimax factors, the matrix 
of final rotated factor loadings, and the transforma- 
tion matrix to get the final loadings from the 
varimax factor loadings have been deposited with 
the American Documentation Institute, Order Docu- 
ment No. 8183 from ADI Auxiliary Publications 
Project, Photoduplication Service, Library of Con- 
gress, Washington, D. C. 20540. Remit in advance 
$1.75 for microfilm or $2.50 for photocopies and 
make checks payable to: Chief, Photoduplication 
Service, Library of Congress. 
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TABLE 1 
Major Facror LOADING ror EACH Irem Usep 


Item | Loading} Item | Loading| Item | Loading 
23 -69 119 69 133 53 
45 71 10 67 155 53 


67 11 32 14 177 46 
89 71 54 -70 199 71 
24 69 76 60 221 «70 
46 78 120 45 243 64 
90 14 11 AE) 134 -65 

3 84 33 72 156 63 
25 87 55 84 178 „55 


Total scores on the dimensions and scales were 
intercorrelated, together with the Age and Sex vari- 
ables to give a 32 X 32 matrix of Pearson correlation 
coefficients, Nineteen factors were extracted by the 
minimum residual method 


N 
RESULTS 


For each factor, loadings of .3 or more will 
be presented. Following each factor loading will 
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be given in parentheses the factor loading for 
the same name variable in the previous analysis 
on the factor which matches most closely with 
the one in the current study. In the case that a 
variable had a loading of .3 or more on the 
factor in the previous analysis, the factor loading 
for the present study will be given if it is less 
than .30. If the variable does not appear in one 
of the two studies, a dash (—) will be entered 
instead of a factor loading. If no loadings are 
given in parentheses, the factor is one for which 
no corresponding factor appeared in the previous 
analysis. 

Hostility. 5. Cynicism, .63 (.77); 9. Hostility 
II, .68 (.63); 29. Defensiveness, .55 (.49); 16, 
Psychopathic Personality, .51 (.50); 15. Reaction 
to Authority, .39 (.08); 24. Hostility I, .32 (.08); 
and 18. Social Desirability, —30 (—.18). 

Neuroticism. 22, Inferiority Feelings, .67 
(.79); 7. Depression, .67 (.70); 6. Shyness I, 
-48 (.48); 27. Shyness II, 45 (.59); 28. Agita- 
tion, .40 (.63); 17, Self-Sufficiency, .33 (.02); 
and 2. Succorance, .08 (.35). 

Compulsion. 3. Need for Order, .63' (.78); 13, 
Drive to Finish, .53 (.63); 25. Personal Groom- 
ing, .49 (.60); 29, Defensiveness, 35 (,04); 
1. Need to Excel, 30 (11); and 20. Cautious- 
ness, 13 (.36), 

Dependence. 11, Friendliness, .64 (.28); 2. 
Succorance, .59 (.47); 17, Self-Sufficiency, —.50 
(—.48); 23. Need for Approval, .46 (.54); 30, 
Talkativeness, .34 (21); 8. Conformity, —.06 
(.64); and 24. Hostility I, .11 (.31), 

Social Desirability, 16, Psychopathic Person- 
ality, —.60 (—.55); 18. Social Desirability, .58 
(48); 12. Truth Scale, —.57 (—.67); 31. Sex, 
—36 (—); 21. Sex Concern, — (—.38); and 
24. Hostility I, —.22 (—.32). 

Talkativeness. 30, Talkativeness, 58 (.61); 
6. Shyness I, —35 (—.35); 27. Shyness II, 
—.34 (—.31); and 19. Rigidity, .20 (.63). 

Impulsiveness, 32, Impulsiveness, .69 (.71); 
20. Cautiousness, —.56 (—.46) ; 14. Self-Control, 
—49 (—.02); and 26, Aggression, .44 (.25). 


Self-Control. 14. Self-Control, .52 (.82); 26. 
Aggression, —.42 (—.27); and 28. Agitation, 
—.41 (—.34), 


Sensitivity, 4. Sensitivity, .44 (.45); and 23. 
Need for Approval, .24 (.53). 

Friendliness. 6, Shyness I, —46 (—.44); 27. 
Shyness II, —35 (—.04); and 11. Friendliness, 
.03 (.62). 

Dominance, 19, Rigidity, .58 (12); 10. 
Ascendance, .47 (68); and 15. Reaction to 
Authority, .13 (.53). 

Age. 21. Age, .69, 


| 


Sex. 31. Sex, .56; and 25, Personal Groom- 


Discussion 


_ Hostility, Compulsion, Neuroticism, Depend- 
ence, and Social Desirability were the most 
| prominent factors in the study with American 
= subjects. They were also the most prominent 
factors in the present analysis based upon Italian 
subjects. Although the correspondence between 
these factors is by no means perfect, the simi- 
~ larity of factor structure in the two investiga- 
tions is unmistakable. Perhaps the most impor- 
i ‘tant discrepancy in the findings among these 
Major factors occurred with the total test score 
“variables Friendliness and Conformity. Friendli- 
~ hess had a high loading on the Dependence factor 
in this study but not in the previous one. On 
" the other hand, Conformity had a negligible load- 
ing here and a high loading in the previous study. 
Inspection of the Friendliness items reveals no 
obvious difference in item content between the 
two studies. There were five items in the current 
_ study, however, as opposed to three for the 
- earlier analysis. This may have accounted to 
= some extent for the higher correlations obtained 
here for Friendliness with Succorance and Self- 
Sufficiency, i.e., .36 and —.45 as compared with 
12 and —.27. Rotational solution differences may 
also be involved since Friendliness defined a 
‘Separate factor in the previous study. 
2 Conformity was defined by two items here and 
___ by five items in the previous study. The group 
of items from the previous study suggested to a 
4 _ Sreater extent, upon retrospective examination, 
_ 4 dependence upon the group for support. The 
two items for the Italian subjects, on the other 
hand, concern the extent to which the individual 
14 tries to avoid standing out in a group. These 
two items may have assessed primarily the extent 
to which the individual was concerned about 
making a bad social impression, a matter of con- 
siderable importance in the Italian culture. If so, 
this could account for the failure of this pair of 
; items to produce a substantial loading on the 
_ Dependence factor with Italian subjects. 
Among the factors of secondary importance, 
A % Impulsiveness, Talkativeness, Self-Control, and 
© Sensitivity are readily identified as corresponding 
ae to a certain extent in the two studies. The 
L Friendliness and Dominance factors match 
| poorly, however, sharing only one substantially 
| loaded variable each. Age and Sex each defined 
= à factor in the current study, but not in the 
earlier investigation, where these two variables 


x 
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were not included. Conformity was the only 
variable with a loading over .30 on one factor 
in this study, but no such factor emerged in the 
other investigation. The study with American 
subjects also had two factors which did not 
appear here, Need to Excel and Sex Drive. The 
latter was a singlet defined by a variable omitted 
from this analysis. Need to Excel was defined 
by loadings of .54 and .41, respectively, for Need 
to Excel and Drive to the Finish. The correla- 
tion between these two variables with American 
subjects was .29. The corresponding correlation 
for Italian subjects was .24. Since this dif- 
ference is small, the emergence of such a factor 
in one study and not the other may be due at 
least partially to the particular rotations carried 
out. 

Two earlier investigations using Minnesota 
Multiphasic Personality Inventory items showed 
substantial similarity in personality factor struc- 
ture between American and Italian subjects 
(Comrey, 1960; Comrey & Nencini, 1961). 
These preliminary findings together with those 
from the present study lend support to the 
supposition that if a well-substantiated factor 
structure of personality can be worked out for 
American subjects, it probably will apply reason- 
ably well to Italians and perhaps other western 
Europeans, 
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2 hypotheses about resistance to influence were tested: (a) warning Ss 
that they are going to be exposed to a communication with which they 
disagree will increase their resistance to influence; (b) distracting Ss from 
the content of such a communication will decrease resistance to influence. 
The results support the Ist hypothesis, but provide only minimal support for 
the 2nd. In addition the study provides data on the mechanisms by which 
forewarning increases resistance, and suggests that an active rehearsal of 
supporting arguments is the major process by which resistance is increased. 


It has generally been assumed that subjects 
who are forewarned about the content of a 
communication with which they disagree will be 
less influenced by it than subjects who are not 
forewarned. This has occasionally been stated 
explicitly (for example, Lazarsfeld, Berelson, 
& Gaudet, 1948, p. 152), but more often has 
been an implicit assumption built into research 
on attitude change, and into actual attempts to 
influence people’s opinions. Although this assump- 
tion appears quite plausible, there is little 
experimental evidence supporting it. 

The only experimental study that provides 
directly relevant data is one conducted by Allyn 
and Festinger (1961) in which they report that 
subjects who were forewarned are influenced less 
by a persuasive communication than subjects 
who were not warned. Unfortunately, an impor- 
tant source of ambiguity in the design of the 
study makes this result difficult to interpret. 
The forewarned subjects were told that they 
would be asked for their opinions after the talk, 
while the nonwarned subjects were told that 
they would be asked for their impressions of the 
speaker's personality, Festinger and Maccoby 
(1964) point out this confounding of warning 
and orientation and contend that the difference 
between the groups in amount of opinion change 
was due primarily to the difference in orientation. 
They suggest that the personality instructions 
tended to distract subjects from the content 
of the speech, made it more difficult for them 
to defend themselves against the speaker's argu- 
ments, and consequently produced more change 
in the direction advocated by the talk, They 
present additional evidence to Support this dis- 
traction hypothesis. 

Although this explanation by Festinger and 
Maccoby is quite compelling, the original hy- 
pothesis, that forewarning increases resistance to 

+ This study was supported in part by a grant 
from the National Science Foundation, 


influence, remains both plausible and interesting, 
Because of the confounding in the Allyn and 
Festinger study, however, it remains essentially 
untested. The primary purpose of the present 
study is to provide an effective test of this 
hypothesis. ; 

The study also attempts to specify in more 
detail the mechanisms by which the forewarning 
increases resistance, if such an increase is found. 
One possible reason forewarning may increase 
resistance is that source derogation may occur 
when the subject expects a highly discrepant 
communication from a previously unknown 
source. Telling the subject that the speaker is 
going to disagree with him may tend to make 
the subject suspicious of and hostile to the 
speaker, and thus cause him to derogate him. 
To the extent that the speaker is derogated, the 
communication will have less effect. To test this 
possibility, all subjects in the present study are 
given a neutral description of the speaker prior 
to the speech, and measures of rejection of the 
speaker are taken after the speech. 

Another mechanism which might decrease the 
effectiveness of the talk is decreased attentive- 
hess on the part of the subject. If the subject 
does not listen at all, presumably he will not be 
influenced. Telling the subject ahead. of time 


what position the speaker will take, may cause , 


him to avoid listening in order to avoid exposure 
to discrepant opinions. A recall test was given 
after the speech to test this possibility. 

In addition to these mechanisms, the warning 
may activate a more active defensive process. 
During the period between the warning and the 
communication subjects may rehearse, recall, or 
even construct arguments supporting their initial 
Position, and may attempt to refute arguments 
against their position. This would strengthen their 
position by making them more confident and 
giving them additional information with which 
to counter the arguments presented in the 
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nunication. An important implication of this 
e of defensive mechanism is that a longer 
mning should produce a greater increase in 
istance since it would enable the subjects to 
over more arguments. Therefore an indirect 
test of this active process would be a comparison 
of the effect of different lengths of warning on 
sistance to influence. If increased resistance 
stems only from increased suspiciousness OT 
from lack of attentiveness, a 2-minute warning 
Should be as effective as a 10-minute warning; 
but if this active defensive process occurs, the 
longer warning should produce greater resistance 
than the shorter one. 
‘A final purpose of the study is to provide 
‘additional evidence on the idea advanced by 
"Festinger and Maccoby that mild distractions 
increase the effectiveness of a persuasive com- 
“munication. In the present study some subjects 
are told to pay attention to the content of the 
ne and some are instructed to pay 
attention to extraneous stimuli. If distraction 
‘does reduce resistance to influence, there should 
‘be more change in the distraction condition. 
To summarize, the study is designed to test 
two main hypotheses: 


1. Subjects who are forewarned about the 
‘content of a talk that disagrees with them will 
3 change less than those who are not forewarned. 
2. Subjects who are told to pay attention to 
“extraneous stimuli will change more than those 
‘who are told to pay attention to the content 
“of the talk. 


Sy addition, an attempt is made to specify 
how the forewarning increases resistance. Three 
possible defensive mechanisms are considered: 
increased derogation of the communicator, de- 
creased attentiveness to the talk, and rehearsal 
k of supporting arguments before the counter- 
| communication. 


Ss 


METHOD 


na Measures of attitudes toward teenage driving were 
Obtained from high-school students several weeks 
i before and immediately after these students listened 
to a talk? which was strongly opposed to teen-age 
driving. The design included four main conditions. 
Some subjects were warned 10 minutes before the 
talk that it would be against teen-age driving; some 
Subjects were given no warning. Half of each of 
these groups were instructed to pay attention to the 
content of the talk and half were instructed to pay 
" Attention to the personality of the speaker. In ad- 
_ dition, a fifth group was given the content instruc- 


g Ee authors are grateful to Vernon Allen and 
Alan Gross for giving these talks in the various 
administrations, i 
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tion, and was warned only 2 minutes before the 
talk, This group was included specifically to test 
the hypothesis that earlier warning would produce 
greater resistance; hence no comparable personality 
condition was required. 

The subjects were 198 seniors from Cubberly High 
School in Palo Alto, California, and 143 seniors from 
Mountain View High School in Mountain View, 
California. Of these, 299 both took the prequestion- 
naire and were present at the talk. The data from 
7 of these subjects were discarded because they 
arrived late to the final testing session or did not 
fill out the second questionnaire. 

The experimental sessions were conducted in large 
classrooms or auditoriums at the high schools. A 
total of four sessions were run, two at each school, 
All instructions and questionnaires were included in 
a booklet that was handed to each student after 
all were seated. The booklets were handed out in 
a random order, and subjects in all five groups were 
run in each session. 

All subjects went through the following steps. 
Ten minutes before the beginning of the talk, they 
read the initial instructions; then they took a 
brief, irrelevant questionnaire which was included to 
make the delay seem reasonable and to occupy some 
of that time; 2 minutes before the talk they read 
a second set of instructions; and then the speaker 
was introduced. Immediately after the talk, which 
lasted for about 12 minutes, the subjects filled out 
a questionnaire containing questions on attitudes 
toward teen-age driving and toward the speaker. 

The initial instructions for all subjects began with 
the following: 


This is part of a survey of students’ opinions on 
a wide variety of topics. We are going to ask 
you a number of questions and would appreciate 
it if you would answer them all as honestly as 


possible. 

They also included these two sentences: 
The speaker is Dr. Vernon Allen, a noted expert on 
automobiles, highway administration and driving 
in general, who has kindly agreed to come here 
today and talk to you. [and] Please pay close 
attention to him and listen carefully to what he 


has to say. 
The rest of the initial instructions differed for the 
various groups. 


Warning Manipulation 


For the 10-minute warning group the initial 
instructions also included the following warning: 


The title of his [Dr. Allen’s] talk is “Why Teen- 

Agers Should Not Drive.” Dr. Allen will tell you 

why he thinks that teen-agers should not be 

allowed to drive. 

For the no-warning and 2-minute warning groups, 
the initial instructions simply omitted this statement 
of the title and content of the talk. However, the 
statement was included in the second set of instruc- 
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tions for the 2-minute group; whereas for all other 
subjects the second instructions merely stated: 


You are now going to hear the talk by Dr. Allen. 
Pay close attention during the talk. 


In other words, the warning was given either in 
the intial instructions which were read 10 minutes 
before the talk, in the second instructions which 
were read 2 minutes before the talk, or only im- 
mediately before the talk when the speaker was 
introduced and the title of his talk was announced. 
The presence or absence of the two-sentence warning 
is the only difference between warning conditions. 


Distraction Manipulation 


The orientation of subjects was varied in the 
initial instructions. Content orientation instructions 
included the following sections: 


In particular we are interested in your reactions 
to a talk which you are going to hear... . You 
will be asked to give your opinions on this problem 
after he has finished his talk. 


The corresponding parts of the personality orienta- 
tion instructions read: 


In particular we are interested in your impressions 
of a speaker you are going to hear... , You will 
be asked to give your impressions of his personal- 
ity after he has finished his talk. 


This was the only difference between orientation 
conditions, 

The data consisted of responses to the two ques- 
tionnaires, The Prequestionnaire contained two ques- 
tions on teen-age driving for Cubberley subjects, and 
four for Mountain View subjects. For both schools 
the critical items were imbedded in irrelevant items 
to make a total of 12 questions. The postquestion- 
naire contained 4 questions on teen-age driving, 
ratings of rejection of the speaker, a number of 
other evaluations of the speaker, and (for Cubberley 
subjects) a recall measure. All questions were 
answered on undivided scales labeled at both ends. 


RESULTS 


Scores on the prequestionnaire indicated that 
before the talk virtually all subjects were 
against any limitations on teen-age driving. The 
overall mean was 3.75 with 14 being maximum 
agreement with the limitations. There were no 


TABLE 1 `- 
MEAN CHANGE FOR ALL CONDITIONS 


Warninga 
Orientation 
0 2 10 
1. Content 1.54 (68) -94 (57 -55 (56) 
2. Personality | 1.83 (53) en 1.31 Eo 


Note.—Group N appears in parentheses. 
a In minutes. 
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TABLE 2 
ANALYSIS OF VARIANCE OF Four Main Groups 
IN TABLE 1 ` 
Source MS df ; 
Warning (A) 503 1 4.88 
Orientation (B) 201 1 1,95 
AXB 98 1 <1.0 
Error 102.88 


appreciable differences among the groups on these 
ratings. All results are presented in terms of 
change scores from the initial ratings. For sub- 
jects from Cubberley the mean change scores 
for each subject are based on change on two 
questions; for Mountain View subjects they are 
based on four questions. Nevertheless, there are 
no differences between corresponding groups from 
the two schools that even approach significance; 
and a preliminary analysis indicated that no 
appreciable amount of variance was attributable 
to the different schools. Therefore, the analysis 
was done combining subjects from the two 
schools. 

The major results are presented in Table 1, 
which shows the differences among groups in 
amount of change from initial to final question- 
naire in attitudes toward teen-age driving. The 
first hypothesis to be tested is that forewarning 
increases resistance to influence. As may be seen 
in the table, both 10-minute warning groups 
changed less than the corresponding no-warning 


. groups. The overall effect of warning is signifi- 


cant (F=4.88), although due to extremely 
large variances the separate differences within 
content and personality orientation conditions 
are not significant (F = 3.82 and 1.23, respec- 
tively). In addition, 67% of the subjects in the 
no-warning groups changed in the advocated 
direction compared to 52% in the 10-minute 
warning groups. This difference is significant 
(x? = 5.63, p <.02). 

The second main hypothesis is that distracting 
subjects by telling them to pay attention to the 
personality of the speaker rather than to the 
content of the talk will reduce resistance to 
influence. The results are consistent with this 
hypothesis, but are not very strong. Both dis- 
traction (personality orientation) groups changed 
more than the corresponding no-distraction 
groups, but neither the overall difference between 
means nor either of the separate differences 
are significant (F= 1.95 for combined groups, 
F < 1.0 for no-warning groups, and F = 2.49 for 
10-minute warning). The results in terms of 
number of subjects who changed in the advo- 


| 


i 
| 
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cated direction are somewhat stronger. In the 
content orientation conditions the figure is 54% 
and in the personality condition it is 66% 
(2 = 3.34, P< -10). 

Although these differences are in the expected 
direction, they cannot be considered as sup- 
porting the distraction hypothesis. The weakness 
of this effect, as contrasted with that reported 
by Festinger and Maccoby (1964), may be due 
to a number of factors. The most obvious pos- 
sibility is that telling subjects to pay attention 
to the personality of the speaker is not as 
strong a distraction as showing an amusing, ir- 
relevant film during a persuasive speech. It may 
also be that distraction reduces resistance to 
influence only under certain limited conditions 
involving the degree of commitment to the initial 
position, strength of the persuasive communica- 
tion, etc., and that the present situation did not 
meet these special conditions. 

Since the forewarning does appear to increase 
resistance, the question of interest becomes how 
it produces this effect. The study provides data 
on three different mechanisms which might have 
been activated by the forewarning. The first 
is increased derogation of the communicator, 
due presumably to increased suspiciousness and 
hostility on the part of the warned subjects. 
Data relevant to this is provided by responses 
to the question “How biased do you think the 
speaker was?” which was answered on an un- 
divided scale ranging from not at all biased to 
extremely biased. Table 3 presents the means for 
the five groups. As may be seen, all groups 
reject the communicator equally except the 
10-minute warning content group which has a 
somewhat higher rejection score. However, not 
even the largest difference approaches signifi- 
cance (¢ < 1.0). Therefore, although there is a 
slight suggestion that the warning resulted in 
greater rejection, it seems unlikely that this 
factor accounted for the quite sizable differences 
in degree of resistance. 

The second possibility was that warning 
caused the subject to attend less carefully to 
the talk. To check on this possibility subjects 
from Cubberley were given a recall test after 
the talk. Presumably if some subjects paid less 
attention they would recall less of the content 
of the talk. There were no appreciable differences 
among any of the groups in recall scores, 
thus making it appear that lack of attention 
was not a major cause of increased resistance. 
Tt might be added that it would have been 
quite surprising if the group that changed less 
had paid less attention. If the distraction hy- 
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TABLE 3 
Mean RATING or COMMUNICATOR BIAs 
Warning* 

Orientation 

o 2 10 
Content 8.20 8.25 9.48 
Personality 8.77 8.24 


Note.—A score of 14 indicates that the speaker is rated ex- 
tremely biased. 

a In minutes. 
pothesis is correct, as evidence from both this 
and the earlier study seem to suggest, lack of 
attention would, if anything, produce greater 
change in the direction advocated. 

Finally, forewarned subjects might have spent 
the time between the warning and the talk 
rehearsing arguments in favor of their position, 
and perhaps refuting counterarguments. Only 
an indirect test of this notion is provided in the 
study. If increased resistance is due to this type 
of active process, a longer warning should give sub- 
jects more time to defend themselves and should 
produce greater resistance than a briefer warn- 
ing, As is shown in Row 1 of Table 1, the 
results are consistent with this: expectation. The 
10-minute warning group changed less than the 
2-minute group which in turn changed less than 
the no-warning group. Although the linear trend 
across these conditions is fairly strong (F = 3.65, 
p<.10), the 2-minute warning group is not 
significantly different from either of the others. 
Thus, these data provide only weak support for 
the idea that an active defensive process occurs. 

A second source of relevant data is the inter- 
action of warning and orientation. It could be 
argued that the more concerned the subject 
is about the content of the talk, the greater 
effect the warning should have. The warning 
allows the subject to defend himself by re- 
hearsing arguments, but the extent to which he 
will do this depends in part on how threatened 
he feels by the anticipated talk. If he has been 
told to pay attention to the personality of the 
speaker, presumably the content becomes less 
important and he should feel less threatened. 
Therefore, the warning should have less effect in 
the personality conditions than in the content 
conditions. The data give some support to this 
analysis. The difference between 10-minute and 
no-warning groups is twice as large in the content 
condition as in the personality condition, This 
difference does not approach significance, but is 
in the expected direction. Thus, although the 
present study does not provide any evidence 
which directly indicates the presence of the 
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active defensive process described above, two 
different sources of data provide some indirect 
evidence for such a process. This evidence is 
admittedly very weak, and it is hoped that 
subsequent studies will allow a more exact 
specification of the mechanisms involved, 
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Experimental psychology students formed into teams of controlled composition 
worked on 10 operant conditioning problems—9 problems under routine condi- 
. Prior to formation of the teams, 8 of the 


tions and 1 problem under str 
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appointed leaders gave favorable ratings to their least preferred co-worker 
(LPC) and 6 gave unfavorable ratings. Teams with leaders giving unfavorable 


ratings to LPC were superior to other teams on performance on the stress 
problem but equal on the routine problems. The 2 kinds of leaders showed 
little difference in observed behavior for the stress problem as a whole, but 
when the problem was analyzed according 


to phases, significantly different 


patterns appeared, Interpersonal attraction did not differ in the 2 kinds of teams. 


Group productivity can be predicted from the 
way the group leader describes the personality 
of his least preferred co-worker (LPC), accord- 
ing to a number of investigations by Fiedler 
(1958, 1962). In several natural groups, the 
leader who described the LPC in unfavorable or 
socially undesirable terms tended to have a 
more productive group. The greater production 
secured by the leader giving unfavorable ratings 
to the LPC has been explained by this leader’s 
ability to maintain the necessary social distance 
between himself and his workers (Fiedler, 1958). 
(For the remainder of this report an individual 
who gives generally favorable ratings to the least 
preferred co-worker will be called “favorable 
LPC” while the term “unfavorable LPC” will 
denote someone who rates LPC in a generally 
unfavorable direction.) The support for the 
social distance interpretation is based for the 
most part on the co-worker ratings. 

In the present research, natural groups were 
studied while performing their regular task under 
stress. The team composition was controlled: 
some groups had a favorable LPC leader while 


others had an unfavorable LPC leader. A major 
objective was to provide a description of the 
behavior within the two kinds of groups, and in 
Particular, to see if the favorable LPC leader 
would respond to his group in a more positive 
socioemotional manner than would the unfavor- 
able LPC leader, } 

Productivity of the natural groups was deter- 
mined under stress and also under routine condi- 
tions. On the basis of Fiedler’s findings, one 
would expect superior performance of the un- s 
favorable LPC groups under stress. An additional" 
aim of the present investigation was to exafhine 
interpersonal attraction expressed betwee group 
members in order to determine whether the 
unfavorable LPC leader generates more social 
distance between himself and his followers. 


METHOD 


The independent variable was the LPC score of 
the appointed leader, Subjects rated their most pre- 
ferred co-worker and their least preferred co-worker 
on 24 adjective pairs, such as friendly-unfriendly, 
quitting-persistent, on 8-point scales (Fiedler, 1958). 
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The results of the experiment are based on the data 
from 14 groups—8 with favorable LPC leaders and 
6 with unfavorable LPC leaders. The results are 
. based on teams with endorsed leaders only. Endorse- 
ment was determined by the questionnaire given at 
the end of the investigation period, and a leader was 
classified as endorsed if all members agreed that they 
would select him to take charge of the team on a 
future problem. Two groups with favorable LPC 
leaders and four with unfavorable LPC leaders did 


h 
J 
not meet the criterion of endorsement. 


Nature of Task and Group 


The natural groups under study were four-man 
sets of students from the undergraduate experimental 
psychology laboratory course. As part of the course 
work, each team of students was assigned a Skinner 
box and a rat and required to perform several 
operant conditioning exercises according to the 
id Homme and Klaus (1961) manual. Such laboratory 
teams have been a regular part of the experimental 
psychology instruction in recent years, 

Each group had an appointed male leader who met 
the criteria of having a good academic record and 
a score at least 1 standard deviation from the mean 
on LPC. Between groups, the followers were 
equated on LPC, grades, and prior acquaintance. 

The role of the appointed leader was to assign 
tasks, supervise the work of the other students, 
lead the discussion of questions based on the operant 
conditioning problem, and write a general report 
after the problem had been completed. The leader 
acted as the channel of information between the 
laboratory instructor and the group. For each operant 
conditioning exercise, the followers performed the jobs 
of experimenter, recorder, and timer. The tasks were 
rotated each week. 

Each laboratory team carried out 10 operant con- 
ditioning problems over a 2-month period. The eighth 
assignment in the series was a laboratory test, and 
this exercise was completed under somewhat stressful 
conditions in the small-groups laboratory. The test 
was assumed to be stressful because the students did 
not have the help of their laboratory manual, the 
90 minutes allowed for the test was 15 minutes less 
than the time allowed for routine assignments, and 
furthermore this assignment had more weight than 
others in determining final grade. No group left 

efore the end of the allotted time. 

Hari grades earned by each individual in 
the laboratory were determined from the grades given 
his team and not from scores of assignments prepared 
by that individual. 


Measures 


Data collected included observer scoring of be- 
havior during the stress problem, the scoring of the 
laboratory reports, and a questionnaire completed by 
the students at the end of the academic term. Ob- 
servers, who did not know the composition of the 
groups or the rationale of the experiment, scored 
behavior using the 12 categories of the Bales inter- 
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action process analysis (Bales, 1950). Scoring was 
performed for a 10-minute period followed by a 
5-minute rest period. For each 10-minute period the 
observer noted the nature of the group’s activity in 
one of three divisions: planning, running (rat in 
Skinner box), or paper work. An act was scored 
as leader-to-member, member-to-member, or member- 
to-leader. 

For each exercise, the graph, the animal behavior 
notes, and the recording sheet were scored by a key. 
The key was a check list comparing these products 
to a model; judgments of presence or absence were 
required and some judgments of neatness. 

One week following the completion of the 10 
conditioning problems, the team members were given 
questionnaires to determine endorsement of the 
leader, judgment of the roles various individuals had 
performed in the group, social distance between 
individuals in the group, and perceived group 
atmosphere. 


Rationale Given to Subjects for Manipulations 
and Measures 


All rationales and instructions given to students 
concerning the laboratory teams, the observation, and 
the questionnaires followed outlines prepared in 
advance, The observation of the laboratory teams 
was explained as a means of providing practice 
for observers of group behavior. It was made to 
seem that the laboratory instructor (JS) was merely 
acquiescing to a professor’s demand, and the 
instructor dissociated himself completely from the 
observation, All evidence indicates that students 
accepted the rationales. 


RESULTS 
Group Productivity 


The scores for each of the nine laboratory 
exercises are summarized in Table 1. Note that 
each score is based on the work of the followers 
and does not include the score given to the 
leader’s report. No objective scoring key was 
developed for the leader’s report; the reports 
were randomly assigned high grades, The teams 
are separated into fall and winter terms to ex- 
amine a possible scoring bias for the fall teams; 
the scoring of the fall groups was performed 
by one of the authors (JS) while the winter 
groups were scored by someone who had no 
knowledge of the experiment. A close corre- 
spondence ‘is noted between fall and winter 
scorings. Favorable and unfavorable LPC teams 
perform equally well on the routine assignments, 
but on the stress problem, the favorable-LPC- 
led team scores below the unfavorable; the dif- 
ference is significant for the nine winter groups 
(p<.05)+ 

1 All reported probabilities for t are based on two- 
tailed tests. | 
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TABLE 1 
MEAN Scores OF LABORATORY ASSIGNMENTS FOR FALL AND WINTER TERMS 
Accorpinc TO LEADER TYPE 
S Fall term Winter term 
Assignment 
Unfavorable» Favorableb Unfavorables Favorable) 

2 80 84 84 90 
3 87 84 85 86 
4 88 87 92 88 
S 86 86 79 74 
6 84 84 79 73 
7 87 88 91 85 
8 (stress) 90 80 81 66* 
9 86 84 84 87 
10 92 90 87 88 

Number of groups 2 3 4 5 


Note.—Score is based on the percentage of possible points earned by followers on the graph, recording sheet, and animal be- 
havior notes and does not include the percentage score given to the leader’s report, 


a Team with unfavorable LPC leader. 
» Team with favorable LPC leader. 


* Difference between unfavorable and favorable for winter term, p < .05. 


Behavior during Stress Problem 


The two kinds of leaders showed little dif- 
ference in their observed behavior for the stress 
problem as a whole. For example, mean per- 
centage of leader-to-member acts classified as 
positive socioemotional was 24% for favorable 
LPC and 21% for unfavorable LPC teams, 

Significantly different patterns do appear for 
the two types of leaders when their behavior is 
analyzed for planning, running, and paper work. 


In Table 2 behavior of the two types of leaders 
and their followers is described for each of the 
three problem phases. Phase-by-phase comparison 
of the behavior of the favorable and unfavorable 
LPC leaders discloses clear differences in positive 
acts and attempted answers. During planning, 
positive reactions account for an average of 30% 
of the favorable LPC leader’s behavior as com- 
pared to 11% for the unfavorable LPC leader 
(p <.01). During the running phase the favor- 


TABLE 2 
MEAN PERCENTAGE OF ACTS IN CATEGORIES DURING EACH PROBLEM PHASE 


FOR LEADERS AND 


THEIR FOLLOWERS 


Problem phase 
Observer 
behavior category Planning Running Paper work 
Ls Fb La Fb La Fe 
Favorable LPC groups 
FORVE 
ocioemotional 30 28 14 
EEEN answers 48 39 62 i o $ 
uestions 15 24 11 
Neui ee y% 
ocioemotional 6 ih 12 
Unfavorable LPC groups 3 s . 
HOS 
ocioemotional 11 12 31 
fees answers 71 52 52 y Pa ra 
uestions 15 25 
Negative & B a s 
Socioemotional 3 8 10 10 5 4 


Note.—Percentages are based on six unfavorable LPC groups and eight f: 
when one of the favorable LPC groups could not be scored due to equipment fairen wake 


aL = leader to follower. 
bF = follower to leader. 


groups except during the planning phase 


a 
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able LPC leader drops to 14% and the unfavor- 
able LPC leader increases to 31% positive acts 
(p<.01 for the difference between types of 
leaders). During the paper-work phase the 
favorable leader again has a higher proportion 
of positive acts than the unfavorable supervisor, 
but the ¢ is only 1.3. The phase-by-phase 
changes in attempted answers are inverse to the 
changes in positive reactions; the percentage 
of attempted answers from planning to running 
increases for the favorable LPC leader and de- 
creases for the unfavorable LPC supervisor. 
Analysis of mean number of positive acts and 
attempted answers for each phase yielded es- 
sentially the same picture of leader difference as 
did the analysis of mean percentages. 

When leaders are compared to their followers 
in Table 2 the follower profile compared to the 
corresponding leader profile is found to have a 
smaller percentage of acts in attempted answers 
and a larger percentage in questions, These 
leader-follower profile differences in the task 
area are not related to leader type. The emo- 
tional behavior of the members is matched to 
that of the leader except during the running 
period of the favorable-LPC-led groups; the fol- 
lowers show a higher proportion of positive 
acts than their leader (p<.01) and a lower 
proportion of negative units (p <.05). 

The proportions of suggestions given by leaders 
during the problem phases are given in Table 3. 
From planning to paper work the favorable LPC 
supervisor increases his proportion of suggestions 
while the unfavorable LPC individual gives fewer 
suggestions relative to his team (p< 01). 


Subjects’ Descriptions of Their Team 


The measures of interpersonal attraction failed 
to yield differences associated with LPC. On 
the Guttman scale measuring desire for social 
contact with team members, the two types of 
leaders were equal. No difference was discovered 
when members of favorable and unfavorable 


, TABLE 3 


Proportion or SUGGESTIONS GIVEN BY LEADER 
BY PROBLEM PHASE 


Problem phase 


Team leader 
Planning Running | Paper work 
Favorable LPC 56 17 69 
Unfavorable LPC -79 67 AL 


su Note;—Suagestions are acts recorded by observers in “gives 
uggestion.” For number of groups, see Table 2. 
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LPC teams were compared with regard to 
attraction to their leader. 

On a “guess-who” test, the favorable LPC 
leader received an average of 2.2 votes (of 3 
possible votes) for “stimulates ideas” compared 
to .8 votes for the unfavorable LPC leader 
(p< .05). No differences were found for the 
other items: was constructive critic, attempted 
to increase cooperation, attempted to keep the 
group from getting off the subject, contributed 
new ideas. 


DISCUSSION 


Distinctly different behavior patterns emerge 
for the two types of leaders when the stress 
problem is broken into phases. The unfavorable 
LPC leader, who is initially task oriented with 
a minimum of emotional response, gives a high 
proportion of positive acts during the time when 
the team is conditioning its rat. The favorable 
LPC individual, however, plays these roles in 
reverse order. He is task oriented during the 
running period, when his followers are giving 
a high proportion of positive socioemotional acts. 
The different patterns of behavior on the stress 
problem are associated with differences in group 
production; the unfavorable LPC teams are 
found to be superior. 

One interpretation of the above trends is that 
the unfavorable LPC leader firmly and quickly 
structures the groups’ procedures during the 
planning phase and then is able to play a less 
dominating, and even jovial, role. On the other 
hand, the favorable LPC leader holds a group 
discussion during the planning period and the 
work is not clearly organized at that time. This 
leader then attempts to organize the work during 
the later phases, with only partial success. 

The failure of the favorable LPC groups is 
conjectured to come from two sources: from the 
use of a group discussion procedure which is 
workable for the routine problem solving but un- 
adaptable to the demands of the test problem 
and from the inability of the favorable LPC 
leader to control the positive socioemotional 
behavior of his followers. Perhaps the problem 
of socioemotional control was made more dif- 
ficult by the fact that during the paper work 
the group was obliged to remain together in 
the small room. It was observed that in the 
classroom, team members often moved apart to 
perform their individual paper-work chores. 
Spatial separation is a technique for gaining 
control of positive social-emotional behavior (as 
any grade school teacher knows). 

The investigation supports findings from 
Fiedler’s research on LPC. The production dif- 
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ferences under stress occurred as expected from 
Fiedler’s (1958) studies on task groups. The find- 
ing that the favorable LPC leader was described 
by his followers as “stimulating ideas” is con- 
gruent with the results (Fiedler, 1962) that 
groups with favorable LPC leader are better able 
to produce at creative tasks than are groups with 
unfavorable LPC leaders, provided interpersonal 
stress is minimal, 

The present investigation, however, does not 
support the notion that the unfavorable LPC 
individual is generally socially distant and un- 
friendly. The study reveals no difference in the 
total amount of positive socioemotional behavior 
between favorable and unfavorable LPC indi- 
viduals, nor was any difference found in the inter- 
personal attraction in the groups with different 
types of leaders. The investigation demonstrates 
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that both favorable and unfavorable LPC leaders 
are friendly to group members but at different 
times during the task. It is not the occurren 
of positive behavior which distinguishes the two 
kinds of leaders, but the timing of this behavior. 


REFERENCES 


Bares, R. F. Interaction process analysis: A method 
for the siudy of small groups. Cambridge, Mass 
Addison-Wesley, 1950. 

Firpier, F. E. Leader attitudes and group effective- 
ness. Urbana: Univer. Illinois Press, 1958. ee 

Fiepier, F. E. Leader attitudes, group climate, and $, 

group creativity, Journal of Abnormal and Social 
Psychology, 1962, 65, 308-318. Ji 

Homme, L. E., & Kraus, D. J. Laboratory studies in 
the analysis of behavior. Albuquerque, N. M.: 
Teaching Machines, Incorporated, 1961. 


(Received July 11, 1963) ti 


Personality and Sona Psychology 


|. 1, No: 3, 270- 


ASSOCIATION OF NEUTRAL OBJECTS WITH REWARDS: 
EFFECT ON VERBAL EVALUATION, REWARD EXPECTANCY, 
AND SELECTIVE ATTENTION 1 


JUM C. NUNNALLY, ALBERT J, DUCHNOWSKI, anv RONALD K. PARKER 
Vanderbilt University 


The purpose of the study was to investigate the effects on 3 dependent 
measures when neutral objects are associated with rewards, The dependent 


in a different game. 


An experiment will be reported concerning the 
effects of pairing neutral objects with rewards. 
A hypothesis that guided this research is that, 
when a subject encounters a rewarding object 
(for example, candy for a child), reliable effects 
Occur on verbal evaluation, perception, instru- 
mental responses, and autonomic activity. If 
such responses are made to rewarding objects, 
it may be possible to condition some of those 
responses to neutral objects. If so, to the extent 
to which the neutral object comes to elicit the 


1 This research was supported, in part, by Research 
Grant No. RD-846p from the Vocational Rehabilita- 
tion Administration. 


nor loss), 30 Ss received conditioning 
pendent measures indicated that, relative 


higher score on positive verbal evaluation, 
measure of selective attention, and (c) led to 


mulus (a) received a 
(b) was looked at more in a 
a greater “expectation of reward” 


same types of responses as the rewarding object, 
it can be said that the neutral object acquires Í 
conditioned reward value.” A series of experi- 
ments has been undertaken to investigate the 
conditioning of reward value in children (Hall, 
1964; Nunnally, Stevens, & Hall, in press; 
Parker, 1964; Stevens, 1964). The purpose of 
the experiment discussed in this article was li 
to demonstrate the conditionability of verbal n; 
evaluation, reward expectancy, and selective — 
attention. i 

In this series of studies, simple conditioning 
procedures were employed with children. In each 
study the Conditioning apparatus was some type — 
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| of game whereby neutral stimuli (conditioned 
stimuli—CSs) were paired with rewards (uncon- 
ditioned stimuli—UCSs). Different games have 
“been slot machines, roulette wheels, and various 
types of “search games.” The CSs have been 
either nonsense syllables or geometrical forms. 
The UCSs in various studies have been candy, 
money, and social approval. 

In each of the studies in this series some 
measure of verbal evaluation has been employed 
to index the extent to which an originally 
neutral stimulus became positive after a series 
of conditioning sessions. Prior to this experi- 
ment, the measure has been a modified semantic 
differential, in which the subject was asked to 
assign positive and negative adjectives to stimuli 
(see Nunnally et al, in press, for typical 
measurement methods and results). For this 
experiment a more subtle measure of verbal 
evaluation was developed. yi 

The use of a measure of “reward expectancy” 
is based on the hypothesis that, after a neutral 
stimulus has achieved cue value for rewards in 
one situation, it will serve as a cue for rewards 
in a different situation. 

In employing a dependent measure concerning 
selective attention it was assumed that, when 
given a choice, subjects tend to look at reward- 
ing objects more frequently than at neutral ob- 
jects. The effort in this study was to determine 
whether or not this selective attention is condi- 
tioned when neutral objects are associated with 
rewards. This is relevant to the controversy as 
to whether or not in studies of secondary re- 
inforcement the CS acquires reward value or 
only serves as a cue for the receiving of rewards. 

Prior to performing the experiment to be 
described, a pilot study was conducted on 16 
children. The only major difference in the two 
Studies was in the spacing of conditioning ses- 
sions, In the pilot study the subjects were seen 
only once a week; in the major study the 
Subjects were seen twice a week. Because the 
results for all three dependent measures were 
very clear in the pilot study and in the major 
study, only the major study will be described. 


f METHOD 
Subjects 


The subjects were 12 boys and 18 girls from the 
second grade of a Catholic elementary school in 
enle Tennessee. The school mainly serves neigh- 

orhoods that would best be described as lower- 
middle class, 


Apparatus 


: Stimuli. The stimuli were three nonsense syllables: 
OJ; mvv, and yg. Each was determined by Archer 
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(1960) to have a similar low level of meaningful- 
ness. In order to control for any inherent prefer- 
ences, the stimuli were counterbalanced in their 
roles as payoff, neutral, or punishing stimulus, but 
they remained constant in their role for each subject. 
Five subjects were randomly assigned to each of 
the six possible permutations of the three nonsense 
syllables. To subjects, nonsense syllables were 
referred to as “make-believe words.” 

Spin wheel. The conditioning apparatus consisted 
of a circular board 28 inches in diameter with a 
circle of 18 upright pins 4 inches from the outside. 
‘An 18 X 1 inch diameter aluminum tube with a 18 
inch flexible pointer attached to one end was affixed 
to a pivot in the center of the board so that it 
could be spun easily by hand. In the 18 equal spaces 
between the pins, cards bearing one of three nonsense 
syllables were placed in such a manner that each 
particular nonsense syllable appeared in every third 
space. 


Dependent Measures 


Guess-who game. This verbal evaluation measure 
was a variation of the “guess-who” technique of 
Hartshorne and May (1929). Fifteen questions, 5 
positive, 5 neutral, and 5 negative were read in 
random order, The 15 questions were as follows: 
Positive—Who has the most friends? Who would 
you like to sit next to in school? Who do you like 
the best? Who helps Mom and Dad around the 
house? Who would you like to play with? Negative 
—Who is mean to animals? Who steals pennies from 
other children? Who don’t you like the most? Who 
tells. lies? Who always fights and picks on smaller 
children? Neutral—Who walked into the yard? Who 
has a blue hat? Who walked down the street? Who 
lives in a green house? Who walked into the school 
yard? The subject answered to whom he thought 
each. question applied by indicating one of three 
stick figures placed before him. These stick figures, 
all identical, were 3 inches high, drawn with black 
ink on separate pieces of white poster board 5X 8 
inches. The faces were blank. Beneath each figure, 
one of the nonesense syllables appeared. Each syl- 
lable was printed in letters 1.5 inches high with 
black ink. Care was taken to make the cards 
identical, the only difference being the nonsense 
syllable. The nonsense syllable in this case was 
referred to as the stick figure’s name. 

Looking box. This device was enclosed in a 
36 X 25 X 16 inch blue metal box. The side presented 
to the subject had six 3 X 4 inch translucent plastic 
windows in two parallel rows of three windows 
each, Beneath each window there was a button the 
subject could press to light a bulb behind the 
window and make visible a stimulus projected onto 
the face of the window. The stimuli were placed 
on slides in a metal frame inside the box, New 
frames containing different arrangements. of the 
stimuli were easily inserted from the side of the box. 
A stimulus could be seen only when the bulb was 
lighted. The bulb stayed lit for 4 second after the 
subject pressed the button and then automatically 
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cut off. If the subject wished to view the stimulus 
longer he had to release the button and press it 
again. An additional apparatus connected electrically 
but out of the subject’s sight recorded the number 
of times each button was pressed and therefore gave 
a record of how long each stimulus was viewed. 
Each of the three nonsense syllables appeared in two 
of the six windows. Two random orders of the 36 
possible combinations were used to reduce location 
effects. 

Treasure hunt. There were 21 identical white boxes 
2%3%X1 inch. On the top of each a nonsense syl- 
lable was printed. Each of three nonsense syllables 
appeared on seven different boxes. The subject was 
given 14 selections to find one box containing a 
quarter ($.25). 


Procedure 


The first day of contact with the subjects was 
used to establish rapport and to acquaint them 
with the apparatus and the tasks they were to 
perform. They were shown the three cards which, 
respectively, contained the three nonsense syllables 
printed below three identical stick figures. They 
were told that each nonsense syllable was the name 
of a boy. Subsequently, three other stick figures with 
three other nonsense syllables (RUX, KEP, DAK) were 
used to give the subjects practice in employing the 
measure of verbal evaluation. One face was smiling, 
the second frowning, and the third had a bland 
expression. For each of three negative, three neutral, 
and three positive questions, the subject was asked 
to point to one of the figures. None of the questions 
in this familiarization task was the same as those 
subsequently employed to measure the effects of 
conditioning. The subject was introduced to the 
spin wheel by having him make three spins, no 
pennies being given, 

On conditioning days the subject was escorted 
from his classroom and led to a small room where 
the spin wheel was kept. The room was not specifi- 
cally prepared for the experiment. He was told that 
he could spin the wheel 30 times. If the pointer 
stopped on one “make-believe word” he could take 
2 pennies from the pile; on another word nothing 
would happen; on the third word he must return 
1 penny to the pile of pennies on the table. He was 
told that he could keep all the pennies he had won 
after 30 spins, Each subject was given 5 pennies to 
start him off. This was necessary to prevent the 
subject from “going broke” during the early trials, 
The expected reward on each day was $.15, the 
actual amount varying in terms of the binomial 
distribution. After playing the game the subject 
returned to class. 

On days when dependent Measures were taken, 
the subject was told he would “play different games 
today.” For the verbal task the subject was shown 
the three stick figures. The experimenter said, 


I am going to ask you some questions; you tell 
me which one of these little boys you think would 
be the answer. 
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For the looking box, the experimenter said, 


This is called a looking box. If you press a button, 
a light will go on and you will be able to see 
something. The light will only stay on for a short 
time and then go out. If you want to look at 


something longer, you will have to keep pressing 


the button. 


After looking for 2 minutes, the subject was given | 


a rest while a new randomized pattern of slides 
was placed in the box. The subject was allowed to 
look for 2 more minutes. 

For the treasure hunt, 21 boxes (7 for each 
stimulus) were placed on a table. For each subject, 
positions of boxes were shuffled to achieve a partial 
randomness of locations. The subject was told that 
a quarter was hidden in one. If he found it he 
could keep it, but he could only look in 14 of the 
boxes. Actually, in order to obtain an equal number 
of responses from the subjects, none of the boxes 
contained a quarter. After the subject failed to find 
the quarter, the experimenter gave him one. This 
game was the last activity in the experiment. The 
experimenter made sure no child told any of those 
who had not yet played the game what happened. 
The experimenter recorded the order in which the 
boxes were selected. 

Each subject was seen on nine occasions, twice 
each week for 4 weeks and once during the fifth 
week. Half the subjects were on a Monday- 
Wednesday schedule, and the other half were on a 
Tuesday-Thursday schedule. The first session was 
used to familiarize the subjects with the two experi- 
menters and with the experimental materials. This 
was followed by three sessions of conditioning with 
the spin wheel. On Day 5, measurement was made 
of verbal evaluation (“guess-who game”) and selec- 
tive attention (“looking box”). Three more condi- 
tioning sessions were held on Days 6-8. On Day 9 
all three dependent measures were applied (the 
above two plus the “treasure hunt”). Because of 
absences, some of the subjects got off schedule by 
1 or more days, 


RESULTS 
Verbal Evaluation 


After three sessions of conditioning, the mean 
number of positive questions attributed to the 


Positive stimulus was 4.77 out of a possible 5. 


A mean of .23 positive questions was attributed 
to the neutral stimulus. In no instance did a 
Subject attribute a positive question to the nega- 
tive stimulus. For the neutral questions the 
means were 1.02 for the positive stimulus, 3.2? 
for the neutral, and .68 for the negative stimulus. 
For negative questions a mean of 4.33 was 
obtained for the negative stimulus and .67 for 


the neutral. In no case did a subject attribute - 


a negative question to a positive stimulus. No 
Significant differences were found between the 
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first and second administrations of the verbal 
measure. 


«Looking Box 


For the “looking box” three comparisons were 
made. After three conditioning sessions the sub- 
jects looked longer at their positive stimulus 
as compared to their negative; longer at their 
positive as compared with their neutral; and 
longer at their neutral as compared with their 
negative. A sign test yielded a z greater than 4 
for all three comparisons (p < .001). No sig- 
nificant differences were found between the first 
and second administrations. 


Treasure Hunt 


Over the 14 choices, the subjects had a mean 
of 6.04 for their positive stimulus, 3.87 for 
neutral, and 4.09 for negative. A sign test 
showed the difference between the positive- 
negative and positive-neutral categories signifi- 
cantly different (p < .01, z = 2.9 in both cases). 


The neutral-negative comparison was not 
significant. 
In the fest seven choices, a mean of 3.60 


positive boxes was chosen, 1.43 neutrals, and 
1,97 negatives. The positive-neutral and positive- 
negative differences were both significant (p 
<.001, z=5.00 and 4.43, respectively). The 
neutral-negative difference was not significant. 


Discussion 


All three measures were strongly influenced by 
the experimental treatments. The new measure 
of verbal evaluation logically is more subtle 
than the adjective check list that was used in 
previous studies, yet the new measure was very 
sensitive to the treatment effects in this study. 
The results obtained in the “treasure hunt” lend 
support to the contention that the experimental 
procedures lead to a general “expectation of 
reward” from stimuli that have been conditioned 
to reward value. 

Results from the looking box suggest that 
pairing of a neutral object with a reward subse- 
quently makes the subject “want” to look at 
the formerly neutral object. Since the subjects 
were told that no pennies would be given 
regardless of which buttons were pushed, choices 
should not have been influenced by that factor. 
Even if there had been some expectation of 
receiving pennies for pushing particular buttons, 
experience in the first testing (after three 
Conditioning sessions) should have reduced the 
marked preference for the payoff stimulus on 
the final testing session, which was not the case. 
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Further evidence for the conditionability of 
selective attention comes from three studies of 
the effects on eye movements of pairing neutral 
objects with rewards (Nunnally et al., in press). 
‘After conditioning, there were more fixations on 
rewarded stimuli than on neutral or punished 
stimuli. 

For want of a better word, the treatments in 
this series of studies have been referred to as 
“conditioning”; but conditioning in these studies 
may differ importantly from that in studies of 
classical conditioning. In classical conditioning 
supposedly what occurs is the linkage of a 
discrete stimulus to a discrete response (for 
example, salivation to a tone), although that 
view has been challenged on numerous occasions. 
In our studies, a stimulus-stimulus association 
is formed, for example, the sight of a nonsense 
syllable with the sight of pennies. Some of the 
types of overt responses that are affected by 
the association do not occur at all in the condi- 
tioning situation. For example, in the condition- 
ing situation children speak very little, and only 
rarely do they say “nice” things about the UCS, 
such as “I like pennies.” Yet later, when called 
on to make choices between pleasant and un- 
pleasant adjectives to describe the CS, the 
subjects predominately chose the former. Also, 
it is hard to see how a stimulus-response condi- 
tioning model could explain the results found for 
reward expectancy and selective attention. Such 
effects might be understandable in the context 
of “mediation theory,” as it has been presented 
by Osgood (1953) and others. To account for 
the conditioning effects, a three-stage process is 
hypothesized in which: before the conditioning 
sessions a rewarding stimulus typically elicits 
a variety of responses, during conditioning an 
association is formed between a neutral stimulus 
and a rewarding stimulus; and in the post- 
conditioning test phase, fractional components of 
some of the responses typically elicited by the 
rewarding stimulus are now elicited by the 
neutral stimulus, such responses being mediated 
by the association. 
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EFFECTS OF ANCHORING ON WEIGHT JUDGMENTS OF 
NORMALS AND SCHIZOPHRENICS * 


SIEGFRIED A. WURSTER ? 


University of Houston 


2 anchor techniques were utilized to differentiate the weight judgment processes 
of 30 normal white males and 30 white male patients with a single diagnosis 
of chronic schizophrenia, It was predicted that, compared to a group of 
normals, schizophrenics’ weight judgments would: (a) show more disruptive 
anchor effects when an extra-series (unjudged) heavy anchor was paired with 
each standard series stimulus; (b) show less disruptive anchor effects and 
hence greater judgment “accuracy” when an irrelevant anchor was interjected 
in the standard stimulus series; and (c) show a more rapid rate of decay of 
residual anchor effects, regardless of the anchoring condition. All predictions 
were significantly verified with the exception of a more rapid rate of decay 
of residual anchor effects for schizophrenics, using the extra-serjes, heavy 
anchor, The results were discussed in terms of the schizophrenics’ extreme 
passivity to contextual stimuli and his “ahistoric frame of reference.” It was 
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also suggested that the results of the intraseries anchor effect in schizophrenic 
Ss might be interpreted as evidence for a lack of internal consistency of their 


conceptual frame of reference. 


Our success in dealing with the world around 
us is predicated upon the accurate interpretation 
of the information afforded us by our various 
sensory receptor systems. We are in the continual 
process of making judgments involving time, 
space, and mass in terms of the standard physical 
units ascribed to them. Gross disruptions in the 
ability to make veridical judgments related to 
these physical dimensions have long been cited 
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as clinical evidence of impaired contact with 
reality and of mental illness, Schizophrenia is 
generally considered to involve the impairment 
of normal conceptual and judgment processes. 
Experimental approaches utilizing psychophysical 
techniques and requiring judgments involving 
conceptual standards have provided an empirical 
basis for differentiating between normals and 
individuals with a schizophrenic disorder (Board- 
man, Goldstone, Reiner, & Himmel, 1964; Dunn, 
1954; Rosenzweig, 1956; Salzinger, 1957). 
Individuals have a tendency to categorize their 
experiences in terms of some kind of dimensional 
or quantitative order (Helson, 1947). Eriksen 
and Hake (1957) have offered the “subjective 
standard hypothesis” to define, operationally, this 
categorization process, stating that judgments are 
always made relative to a standard or reference 
level that is subjectively present. Thus, an abso- 
lute judgment task is transformed into a relative 


d 


judgment procedure, using the recalled value of ~ 


the standard as a comparison stimulus. One of 
the characteristic features of data obtained by 


ethods of absolute or relative judgment 
prominent anchor effects obtained by use 
muli on the ends of the stimulus continuum. 
investigators have also applied the term 
nchor” to those stimuli added within the range 
‘the experimental stimuli, and it has even 
uggested that every stimulus serves the 
- function of an anchor, whether intended as such 
or not (Guilford, 1954, p. 312). 
‘The effects of anchor stimuli on judgment 
esses of psychiatric patients have been 
ied by several investigators. Goldstone, 
mon, and Boardman (1957), and Weinstein, 
and Boardman (1959) studied the 


cept of an inch unit of linear measure were 
vestigated by Boardman, Goldstone, Reiner, 
'Fathauer (1962) and were not found to be 
ifferent between normals and schizophrenics 
ünder standard conditions. However, anchoring in 
particular study did differentiate the two 
ups, with the schizophrenics showing the 
"greater anchor effects. A recent study of anchor- 
> a the weight judgments of normals and 
izophrenics was conducted by Salzinger 
57). Significantly greater alteration of judg- 
ents due to the influence of a heavy, unjudged, 
red anchor was found for the patient group. 
‘These studies all attested to the schizophrenic’s 
ter anchor vulnerability, i.e., greater respon- 
ess to contextual stimuli, as compared to 
als. The findings have been explained in part 
function of the patients’ “ahistoric frame 
reference” (Weinstein et al., 1959) which 
mizes the effects of past experience in the 
dgment process. One of the general inferences 
g from these anchoring studies is that the 
ty of schizophrenics’ judgments, as com- 
red with normals, appears grossly and con- 
ntly impaired when they are confronted with 
cific, examiner-manipulated anchor stimuli. 
er, these studies were all conducted in 
a way as to force a shift in all subjects’ 
ne of reference, and the relatively greater 
observed in schizophrenic subjects’ judg- 
nts, in accordance with their ahistoric frame 
erence, was assumed to represent a basic 
a increased vulnerability to anchor 


the ahistoric nature of the schizophrenics’ 
of reference is in fact the key variable in 
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producing these differences, it should be possible 
to specify a condition under which this variable 
would act to minimize anchor effects and allow 
schizophrenics to maintain a more stable frame 
of reference than would normals. This condition 
would be one in which the range of the judged 
stimulus series is extended by the addition of a 
stimulus of such a magnitude as to be an 
extension of the original series, Classically, this 
produces a shift in the subject’s frame of refer- 
ence to incorporate the new stimulus and results 
in a modification of the relative judgments of 
the entire series. This shift depends upon the 
new stimulus having an effect over time in 
accordance with Helson’s concept of pooling. 
This process, having an important temporal 
factor, should be extremely limited in the schizo- 
phrenic, and the effects of immediate anchor 
stimuli which depend on pooling to produce a 
frame of reference shift should be less than 
for normals, Normals, who should be able to 
utilize the pooling of past experience to a far 
greater extent than schizophrenics in making 
judgments, should show greater anchor vulner- 
ability, manifest by greater judgment distor- 
tions, and consequent shifting of adaptation level. 
As a consequence it could be predicted that 
schizophrenics would exhibit a greater stability 
or rigidity of judgment under this type of anchor- 
ing condition. Also, because of the greater effect 
of past experience on immediate performance for 
normal subjects, the rate of decay of any residual 
anchor effects could be expected to be slower 
than for schizophrenics, regardless of the type 
of anchor condition. 

In the present study, the effect of the size- 
weight illusion was used to provide an intraseries 
anchor, In the size-weight illusion, a larger 
standard feels lighter and may be said to function 
as a light anchor (Furth, 1960). Conversely, a 
smaller standard feels heavier and acts as a 
heavy anchor. “The anchor affects the scale to 
the degree of its illusory or perceived weight, 
rather than by its physical weight [Brown, 1953]. 


METHOD 


Five identical-appearing weights were used, fash- 
ioned out of 8-ounce polyethylene bottles, weighted 
with BB shot, lined with a paper cuff to produce 
a uniform appearance, and weighing 100, 150, 200, 
250, and 300 grams and referred to as 1, 2, 3, 4, 
and 5, respectively. A weight of 250 grams was the 
jntraseries anchor, and differed from the original 
series weights only in jts smaller sizes, being a 
2-ounce bottle. This anchor stimulus is always judged 
heavier than either its series-sized counterpart or the 
300-gram weight. An g00-gram anchor for paired 
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presentation was similarly fashioned from an 8-ounce 
polyethylene bottle. Presentation of the stimuli was 
done in the standard manner on a screened turntable. 
The experimental room was kept dark, with the 
exception of a small 7.5-watt light behind a shield. 

Subjects were volunteers selected from among the 
inmate populations of the Wynne Farm and the 
Wynne Treatment Center of the Texas Department 
of Corrections, All were of average intelligence, 
between 18 and 50 years of age, and without sus- 
pected or demonstrated organic pathology. Group I 
consisted of 30 normal, white males without a 
history of mental illness. Group II was composed 

_ of 30 white male patients with a single diagnosis 
of chronic schizophrenia. 

Weight judgments were made according to a 
9-point scale; 1, extra light; 2, very light; 3, light; 
4, medium light; 5, medium; 6, medium heavy; 
7, heavy; 8, very heavy; and 9, extra heavy, After 
the initial familiarization and practice, each subject 
was presented with six runs of a weight series 
(Condition I), A run consisted of 25 judgments of 
the individual, randomized, series weights. The first 
three runs served as controls, the fourth constituted 
the anchor run, while the fifth and sixth runs com- 
prised the postanchor controls. After a brief rest 
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Fic. 1. Comparison of anchor run with the pre- 
anchor control run for normal and schizophrenic 
groups (Condition I). 
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period, a second series of six runs comprised experi- | 
mental Condition II. In Condition I, the Standard 
Series Weight 4 (250 grams) was replaced by the 
smaller “illusory” anchor stimulus (250 grams) 
during the fourth run. In Condition II, the 800- 

3 


gram weight was used as a paired unjudged anchor 
in the fourth run. 


RESULTS 


The numerical judgments obtained from each 
subject for each of the series stimuli were totaled 
for the run and then averaged to obtain the 
subject’s mean scale judgment for each series 
weight. The Mann-Whitney U test and the 
Wilcoxon matched-pairs signed-ranks test (Siegel, 
1956), were used to test intergroup and intra- 
group differences, respectively. 

In Condition I the substitution of a subjec- 
tively heavier stimulus in the judged series pro- 
duced shifts in the judgments of both subject 
groups. The new weight was assigned the same 
position on the scale by both groups. The scale 
value assigned to Weight 5 by both groups di- 
minished to an approximately equal degree, and 
the value assigned to Weight 3 remained un- 
changed. However, the scale values of Series 
Weights 1 and 2 revealed significant differences i 
(p> .01) between the two groups. The lower | 
end of the normals’ subjective scale shifted 
upward, producing a contraction of the overall ` 
range of their judgments. The results are depicted _ 
graphically in Figure 1. 

By comparing the subjects’ preanchor control 
tun with each of the subsequent runs under 
experimental Condition I, it was found that 
the normals manifested a significant shifting 
(p> .01) of their judgments for Series Weights 
1, 2, and 5, whereas schizophrenics showed a 
Significant (p> .01) judgment shift only on 
Series Weight 5. These data are presented in 
Table 1. The data also indicate a more rapid 
decay of residual anchor effects for the schizo- 
phrenic group than for normals, when each of the ` 
postanchor control runs was compared with the 
Preanchor control run (Table 1). 

Statistical comparison of mean scale values for 
the two groups in experimental Condition II re- 
vealed significant differences (p > .01) between 
the judgments of the two groups under the 
paired, unjudged, heavy anchor condition for 
every stimulus. With this anchoring condition, ” 
schizophrenics revealed Significantly greater an- 
chor vulnerability through greater shifting of 
their subjective judgments toward the lower end | 
of the scale than did normals. The prediction 
that with a paired unjudged anchor, schizophrenic 
subjects would show a greater shift in their 


BRIEF ARTICLES 277 


TABLE 1 


JNTRASUBJECT COMPARISON OF CONDITION I PREANCHOR JUDGMENTS OF EACH WEIGHT 
with ANCHORED AS WELL AS POSTANCHOR Jupcments By MEANS OF WILCOXON 
MATCHED-PAIRS SIGNED-RANKS TEST 


Weight 
Condition 1 2 3 4 5 
100 grams 150 grams 200 grams 250 grams 300 grams 
Mean judgments 
Normals ; 
Preanchor control run (3) 1.79 3.58 5.29 6.92 7.93 
Anchor run (4) 2.06* 3.84** 5.43 8.48** 7,54" 
Postanchor control run (5) LAS s95% 5.46 6.91 7.87 
Postanchor control run (6) 233° 4.15** 571% 7.18** 7.94 
Schizophrenics 
i Preanchor control run (3) 1.62 3.42 5.25 7.02 8.03 
Anchor run (4) 1.61 3.30 5.19 8.47** 7,69%* 
‘  Postanchor control run (5) 1.75 3.54 5587r 7.12 7.98 
Postanchor control run (6) 1.75 3.63* 5.37 7.30* 8.22 


Note.—The level of significance for the anchor run (4) under each experimental condition was obtained for a one-tailed test, 


All Oe runs utilized a two-tailed test. 
ky S01. 


weight judgments than normals, as previously Discussion 
found by Salzinger (1957), was confirmed. The By comparing relative weight judgments ob- 
mean scale values for Condition TI are presented tained under two distinctly different anchoring 
E gane 2. procedures, the hypothesis that, under a condi- 
Significantly more rapid decay of residual tion of anchoring which depended on stimulus 
anchor effects for the schizophrenic group was pooling—over time—for its effect, schizophrenics 
not found under this anchoring condition. would perform with greater judgment stability 


TABLE 2 


INTRASUBJECT COMPARISON OF CONDITION TI PREANCHOR JUDGMENTS OF Eacu WEIGHT 
WITH ANCHORED AS WELL As PoSTANCHOR JUDGMENTS BY Means OF WILCOXON 
Marcrep-Pairs SIGNED-RANKS TEST 


Weight 
Conditi 1 2 3 4 5 
‘ondition 100 grams 150 grams 200 grams 250 grams 300 grams 
Mean judgments 
es Oh ae 
( Teanch 4,20 5.73 -l , 
TTE Ea ae 3.10% gate 5.46%" 594 
Postanchor control run (5) 2.18 3.80** 5:25 6.52" lrA, babal 
Postanchor control run (6) 2.49 4.10 §.59 6.80 7,58 
Schizophrenics ies 728 3.17 
reanch FEN f; x i 
Anchor run oO PRR gare 236" 3.68** 4.85** 5.32** 
Postanchor control run (5) 1.64 3.22" 4.97** 6.51** 7.48" 
Postanchor control run (6) 1.80 3.57 5.32 7.06 7.16 


~All Note.—The level of significance for the anchor run (4) under each experimental condition was obtained for a one-tailed test. 
other runs utilized a two-tailed test. 
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and show less anchor effects than normals was 
confirmed. In contrast, under the condition of 
a paired, unjudged heavy anchor, the schizo- 
phrenics’ excessive vulnerability to contextual 
stimuli caused him to be unable to ward off the 


effects of the immediate anchor stimulus. The. 


results are consistent with the concept of the 
schizophrenic ahistoric frame of reference which 
implies a deficiency in the ability to “pool” past 
experience with immediate stimulus experience in 
judgment situations. It also serves to explain the 
significantly more rapid rate of decay of residual 
anchor effects for schizophrenics than for normals 
under experimental Condition I. A comparison 
of preanchor control runs of experimental Con- 
dition II with those of Condition I likewise 
revealed residual effects of the weight judging 
experience of experimental Condition I, “pooling” 
with the contemporary judgment task of 
Condition II, these effects being significantly 
more pronounced with normal subjects than with 
the schizophrenic group. 

Where a judged intraseries anchor stimulus is 
introduced, the subject’s experience of the stimu- 
lus series is changed only in relation to a single, 
temporally distant stimulus which must be in- 
corporated into his frame of reference. Under 
this anchoring condition, both normal and schizo- 
phrenic subjects reacted similarly to the effect 
of the anchor stimulus, assigning it to a new 
point on the subjective scale. The changes of the 
normal subjects’ judgments at other points on 
the subjective scale is evidence of the “pooling” 
effects of past experience, with the anchor stimu- 
lus affecting the subjects’ entire frame of refer- 
ence. The schizophrenics’ ahistoric frame of 
reference served to nullify the effects of the 
intermittently presented anchor stimulus and 
thus he was able to perform with greater 
Judgment rigidity between anchor presentations. 

Considering the subjective scale as a repre- 
sentation of the individual’s internal frame of 
reference, these intraseries anchor effects suggest 
an experimental analog of the impairment in 
conceptualization that frequently marks the per- 
formance of schizophrenic subjects. For normals, 
the shift of adaptation level and the subjective 
scale, due to an intraseries anchor stimulus, 
may be interpreted as being a measure of internal 
consistency, which maintains the intactness of 
their frame of reference. Schizophrenic subjects 
did not evidence this shifting of the subjective 
scale, which resulted in greater rigidity in their 
judgments, and which may also be interpreted as 
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evidence of a lack of internal consistency in the 
conceptual processes of these individuals, since 
one portion or aspect of their frame of reference 
was significantly altered without affecting the 
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ness of the schizophrenics’ mental processes may 
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judgments. 
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RACE AND BELIEF: 
AN OPEN AND SHUT CASE* 


DAVID D. STEIN, JANE ALLYN HARDYCK, ano M. BREWSTER SMITH 
University of California, Berkeley 


In a test of Rokeach’s contention that prejudice is the result of perceived 
44 white 9th graders completed a “teen-age 
social distance scale” for 4 “stimulus teen-agers,” presented as, respectively, 
white or Negro, and like or unlike the respondent in values, in relation to Ss’ 


dissimilarity of belief systems, 
i 


own responses 2 mo. earlier. Ss also indicated how friendly they would feel 


toward each. When information was 


provided on values, similarity of be- 


lief accounted for much more variance than the race effect, which was also 


significant; 


other data indicate a strong race effect in the absence of such 


information. Ss’ earlier responses to an otherwise undescribed “Negro teen- 
ager” correlated substantially with their responses to an unlike but not to a 


like Negro teen-ager. 


One of the many ideas presented in The 
Open and Closed Mind (Rokeach, 1960) is 
that prejudice may be in large part the result 
of perceived dissimilarity of belief systems. 
That is, Rokeach, Smith, and Evans (1960) 
contend that the prejudiced person does not 
reject a person of another race, religion, or 
nationality because of his ethnic membership 
per se, but rather because he perceives that 
the other differs from him in important be- 
liefs and values. He reports two studies in 
which subjects were asked to rate pairs of 
stimulus individuals on a 9-point scale, defined 
at the ends by the statements, “I can’t see 
myself being friends with such a person” and 
“I can very easily see myself being friends 
with such a person.” In one experiment, the 
stimulus individuals were white or Negro; in 


* the other they were Jewish or gentile, Racial 


and religious attitudes and general beliefs of 


‘the stimulus individuals were also varied. In 


this situation, it was found that the friendship 
Preferences expressed were determined pri- 
marily on the basis of congruence in beliefs 
rather than on racial or religious grounds. 

f Triandis (1961) took issue with this posi- 
tion, stating that: 

poodle do not exclude other people from their neigh- 
“eee for instance, because the other people have 
: erent belief systems, but they do exclude them 
fs they are Negroes [p. 1861]. 


of Ae paper may be identified as Publication A-26 
fi he Survey Research Center, University of Cali- 
ornia, Berkeley. 


He has reported results contrary to Rokeach’s 
contention regarding the primacy of belief 
congruence over race as a determinant of 
prejudice. Since he objected to Rokeach’s use 
of the single criterion of friendship as the 
measure of prejudice, he employed a social 
distance scale of 15 items. For his manipula- 
tion of belief congruence, he used “same phi- 
losophy” or “different philosophy” as deter- 
mined by the subjects’ most and least pre- 
ferred of Morris’ (1956) “13 ways to live.” 
Stimulus individuals in the study were varied 
in race, religion, and occupational status as 
well as in philosophy. He obtained a “race 
effect” that accounted for about four times as 
much variance, in terms of the percentage of 
the total sum of squares, as any of the other 
three effects singly, although all four main 
effects were highly significant. 

Rokeach (1961) replied with the objection 
that the long and involved passages of Morris’ 
“ways to live” could not be equated with be- 
lief systems as he defined them; the “ways to 
live” were too vague and were not salient to 
the subjects. He concluded that the results of 
Triandis’ study were therefore irrelevant to 
the point at issue. In a more recent study, 
Byrne and Wong (1962) essentially supported 
Rokeach’s position, employing personal feel- 
ings of friendliness and willingness to work to- 
gether in an experiment as dependent vari- 
ables. 

The present study was designed with the 
intent of reconciling these disparate findings. 
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Tt seemed reasonable to assume that there 
might be some truth in each position, and 
that the large differences between the results 
obtained by Rokeach et al. and by Byrne and 
Wong, on the one hand, and by Triandis, on 
the other, followed primarily from the meth- 
ods used. 

In the design of the present study, our first 
concern was that of making our “stimulus in- 
dividuals” appear real to our subjects. In 
Rokeach’s studies, pairs of individuals, de- 
scribed in very sketchy fashion, were pre- 
sented in such a way that it was rather ob- 
vious to the subject that a choice was to be 
made between race and belief. In Triandis’ 
study, there was less of a suggestion of choice, 
but the descriptions were equally sketchy and 
the measure of belief was, indeed, very vague. 
Our intent has been, following an improved 
procedure devised by Byrne (1961), to pre- 
sent to our subjects, as nearly as is possible 
on paper, realistic stimulus individuals. In this 
study, as in Byrne and Wong (1962), stimu- 
lus individuals were varied in race and in the 
similarity of their beliefs to those previously 
expressed by the subjects. This procedure 
makes it possible to elicit absolute rather than 
comparative judgments so as to minimize self- 
consciously ideological responses. As our de- 
pendent variables, we employed both a meas- 
ure of friendly feelings and a social distance 
scale, on which responses to each individual 
item could be separately analyzed. 


MeEtHop 


The sample consisted of 23 male and 21 female 
white teen-agers in two ninth grade classes of a Cali- 
fornia high school, The subjects, all of whom were 
14 years of age, came mainly from working class 
homes in a nonmetropolitan industrial community, 
They participated in the study during their advisory 
periods.? 

At the beginning of the period, the experimenter 
introduced himself as “a research worker from the 
University of California” and handed out a mimeo- 
graphed booklet to each student, by name. The in- 
structions were printed on the front page of the 
booklet and read as follows: 


As you remember, a few months ago we asked 
you to answer some questions concerning your in- 
terests and attitudes about yourself, your friends, 


2 We would like to thank Wayne Henderson, his 
teaching staff, and the students at Pacifica High 
School, Pittsburg, California, for their cooperation 
in this research, and Herbert Weissman who served 
as experimenter in one classroom. 
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and certain groups of teenagers. You may also re- 
call that there were some questions asking you to 
give first impressions about people when you knew 


only a few things about them, such as the per- 


son’s religion or type of job. We are very much 
interested in how people form these impressions, 

In fact, we would like to know how you would 
feel about some teenagers who took the same ques- 
tionnaires as you did, but in other parts of the 
country. Therefore, we have taken some of their 
answers and presented them on the following pages, 

We want you to look at the descriptions of four 
teenage boys [girls] and then answer some ques- 
tions about how you feel toward them. The four 
teenagers will be called: TEENAGER I, II, III, 
and IV. If you have any questions, please raise 
your hand and the research worker will help you. 
Be sure to read everything carefully.® 


As the instructions indicate, 2 months prior to this 
study the students had filled out the pretest version 
of a questionnaire being developed for a large-scale 
study of teen-age attitudes towards minority groups.* 
A value scale on the pretest questionnaire had asked 
the students, “Do you think teenagers in general 
ought to . . .” about each of 25 items. Five response 
alternatives were provided, ranging from “Strongly 
feel they should” to “Strongly feel they shouldn’t.” 
The students’ own responses to these items on the 
pretest provided the basis for the manipulation of 
belief congruence in the present study (Figure 1). 

For each of the subjects, two “stimulus teen-agers” 
were constructed who were like him in values, and 
two were constructed who were unlike him, following 
a procedure similar to that used by Byrne (1961). 
One “like” stimulus teen-ager was made up whose 
responses were identical with those given by the sub- 
ject. In order to avoid raising the suspicions of the 
subjects, the other “like” teen-ager was made to 
differ slightly from the first by moving the responses 
to six items, chosen at random, one step on the 5- 
point scale.’ Each “unlike” teen-ager was created by 


® Male subjects answered questions about boys; fe- 
male subjects answered questions about girls. Word- 
ing throughout the questionnaire was adapted to the 
sex of the subject, 

*This study of teen-age attitudes is a part of a 5- 
year program of research on various aspects of anti- 
Semitism being conducted at the Survey Research 
Center of the University of California under the gen- 
eral direction of Charles Y. Glock. The research is 
supported by a grant to the Survey Research Center 
from the Anti-Defamation League of B'nai B'rith. 
We gratefully acknowledge our indebtedness to the 
Anti-Defamation League, but this organization is not 
to be held responsible for our interpretations. 

5A check of responses to the question, “How much 
like you is this teen-ager?” showed that responses t0 
an “exact-like” stimulus teen-ager differed somewhat 
from those to a “modified-like” teen-ager (t= 1.76 
Ż < 05, one-tailed test). Since half of the white-like 
stimulus teen-agers were “exact like” and half of each 
were “modified like,” this difference cannot have af- 
fected our results, 
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and the one facing it are the answers given by TEENAGER I. After you have looked over the 
d have a good picture of what you think this person is like, turn the page and answer the questions 
you feel toward this person. 


T has his own ideas about how his fellow students ought to be. We would like you to tell us, for 
geome on the a below, whether or not you think teenagers in general ought to be like that, and how 
u feel about it. 


y Strongly Feel Strongly 
y : Feel They They Don’t Feel They Feel They 
1 think teenagers in general ought to... .” Should Should Care _ Shouldn't _ Shouldn’t 
ry to please their parents by the things they do. ie x 0 — —— 
school spirit ; know what’s going onin school 
ke part in activities. bi . 0 = i 
ble to express their feelings freely and “Jet 
Ives go.” ee * 0 PS Ea 
$ « ” 
on average grades, not go “all out My A i Ei eis 
ent, be able to think clearly about 
*k * 0 — —— 
vomed, keep themselves neat and 
y a * 0 leag —— 
good taste in clothes. a £ 0 = R 
ned about other people, not be 
ntered. tg! * 0 cS Snes 
nodest, ot try to draw attention to 
elves. ** * 0 — —— 
good at athletics. bi $ 0 ay = 
sincerely religious. 22 t 0 ER aoe 
re respect for other students’ wishes and 
; not be bossy. oly rol 0 er Gar, 
everybody have his fair say in running things 
the school. E A SAN 
honest and trustworthy. er K 0 a ee 
erally friendly and sociable, mix with 
mt kinds of students. j Sy i 0 F, TRA 
poes as equals, not be conceited uh A 0 LA TER 
t and well behaved in school, not get into 
+k * 0 — — 
v all the rules and laws that have been H 
by those in authority. ai tg 0 ag ZE 
ea e they are welcome, not be be 0 P epi 
up to strict moral standards. be à 0 ome ek 
od at expressing their opinions. ed $ 0 Ke eS. 
od at dancing. nh hi 0 i, MET 
to sti ll 
er to hard problems, try to do wel pes : 0 = ae 
ong with what most other students do and 
for, not be too different. a i 0 T E 
ad own feet, work for things, not seek a : 0 ie Tey 


ple of information concerning values provided to describe a “stimulus teen-ager.” (Un- 


derlined responses reflect the subject’s own responses from the pretest questionnaire.) 
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mark the program you think 


(write in) 
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TEENAGER I 
1. Sex__M Grade 9 
2. What program are you taking in school? (If undecided, 
you will take.) 
0 Vocational 
1 Commercial 
2 X College preparatory 
3 General 
4 Other 
3. Last year, what kind of grades did you get? 


about an A average 

if between an A & B average 
Ziyi about a B average 
between a B & C average 


4. What is your race? 
0x white 1 
3 other (What? 


4 about a C average 
5 between a C & D average 

6 about a D average 

7 below a D average 
Negro 2 Oriental 
) 


Fic, 2. Example of information, other than values, 


stimulus teen-ager. 


choosing at random three of the items the subject 
had answered “Strongly feel they should” and chang- 
ing them to “feel they: shouldn’t.” Three more mod- 
est alterations were made as well, depending on the 
subject’s original pretest response pattern.? 

Besides the information on how the stimulus teen- 
agers had “answered” the value items, the subjects 
were given the sex, grade and program in school, 
last year’s grades, and race of the teen-ager (Fig- 
ure 2). For half the subjects, this additional infor- 
mation preceded that on values throughout the book- 
let, and for the other half, the value scale information 
was presented first. The sex and grade in school were 
always the same as that of the subject, the program 
in school was college preparatory, and grades were 
“about a B average.” Only race was varied. Thus, 
by combining like and unlike responses on the value 
scale with “Negro” and “white,” four stimulus teen- 
agers were created. These will be referred to as 
white-like, white-unlike, Negro-like, and Negro-un- 
like. These four were presented in eight different 
orders, the only restriction on ordering being that 
like and unlike teen-agers were alternated. 

As the subject opened his booklet, he was con- 

_ fronted with the description of one of the four stimu- 
lus teen-agers, called Teenager I. The subject read 
this first description, at his own speed, and then 
turned to the next pair of pages and answered three 
questions, One of these served as a check on the 
manipulation of belief congruence, and the other 
two were measures of friendliness and social distance 
towards the stimulus teen-ager. The questions will 
be discussed in more detail under Results, The sub- 


6 A pilot study revealed that more drastic changes 
than the ones finally used made the stimulus teen- 
agers appear unreal. Details of procedure for con- 
structing the stimulus teen-agers may be had by writ- 
ing to the first author listed, 


provided to describe a white-like or white-unl 


ject then went on to read the description of Teen: 
agers II, III, and IV and in turn to answer the qué 
tions about them. When he had finished, usual, 
20-25 minutes, he turned over his booklet and waite 
for the rest of the class to complete their booklets, 


RESULTS AND DISCUSSION 


Check on the Manipulation of Belief Com 
gruence > 


One question answered by the subjects 
about each of the four stimulus teen-agers V 
the following: 


How much like you would you say Teenager 
(o 


as much like me as any teenager 
think of 

1 very much like me 

2 —— a little like me 

3 —— a little unlike me 

£ very much unlike me 

5 —— as much unlike me as any teenager 1 
think of 


The subjects’ responses to this question sé 
as a check on the manipulation of simila 
between the subject and the stimulus 
agers. Mean responsés to this question, 
each of the four stimulus teen-agers, may 
found in Table 1. It is clear that the whil 
like (1.63) and Negro-like (1.91) teen: 
are seen as more like the subjects than 
the white-unlike (2.76) and Negro- 
(3.27) teen-agers. The mean of responsi 
both like teen-agers combined (3.56) di 
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from the mean of responses to both unlike 
teen-agers (6.05) at well beyond the .001 
level (f= 6.99). All individual like-unlike 
| * comparisons also yield ¢ values significant at 
“beyond sthe .001 level ($ values reported 
henceforth. are all two-tailed). From’ these 
data we may conclude that the manipulation 
of similarity or dissimilarity between the sub- 
jects and the stimulus teen-agers has been 
successful. 


“Friendliness” Question 
The first question the subject answered 
about each stimulus teen-ager was the fol- 
lowing: 
Tf you met this teenager for the first time, what 
would your immediate reaction be? 
I think I would feel: 


0 quite friendly 

1 a little friendly 

2 nothing either way 
3 a little unfriendly 
4 quite unfriendly 


This question was intended to be a nearly 
pure measure of “affect”; that is, a measure 
of the subject’s overall reaction to each stimu- 
lus teen-ager. The mean responses with re- 
spect to each of the teen-agers are given in 
Row 1 of Table 1. Subjects would feel most 
friendly towards the white-like teen-ager 
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(.59), followed by the Negro-like (.83), 
white-unlike (1.69), and Negro-unlike (1.86) 
teen-agers. An analysis of variance * using Mc- 
Nemar’s (1955, p. 330) Case XIV mixed 
model reveals that belief congruence accounts 
for a much larger part of the variance of re- 
sponses than does race, although the effects 

«for both race and belief are significant. (F for 
the belief effect = 37.72, p < .001; F for the 
race effect = 5.21, p < .05.) This result, of 
course, is consistent with Rokeach’s theory. 

This question was also asked, in a some- 
what different format, on the “pretest” ques- 
tionnaire mentioned earlier. At that time sub- 
jects were asked to respond to a list of many 
different individuals, of which one was SA 

Negro teen-ager.” Of the subjects in the pres- 
ent experiment, 35 answered this item on the 
pretest. An interesting finding emerges when 
we compare responses to “A Negro teen-ager,” 
with no other information, with responses to 


- Negro-like and Negro-unlike in the present 


study. “ 

A rather obvious expectation is that the 
mean of responses to “A Negro teen-ager” 
should fall between the means for Negro-like 
and Negro-unlike. This is the case. Means for 
those subjects present on both occasions (W 


7 AN analyses of variance reported follow this 
model. 


TABLE 1 
MEAN RESPONSE TO STIMULUS TEEN-AGERS 


Stimulus teen-ager 


Question N 
: White-like | White-unlike Négro-like Negro-unlike 
1. “How friendly” 42 59 1.69 83 1.86 
2. Social distance scale total score? 44 9.84 5.90 7.81 5.54. 
2A. Individual items on social distance scale® 
Invite home to dinner 44 82 36 39 20 
Go to party to which this person was invited | 44 1.00 80 93 70 
to same school 44 1.00 91 1.00 91 
Have as member of social group 44 91 .32 82 48 
Have as speaking acquaintance 44 91 59 91 59 
Live in same apartment house with this per- 44 89 AS i 43 27 
son and his (her) family 
Eat lunch at school with 44 93 57 -84 ST 
Sit next to in class 44 98 70 .93 13 
lose personal friend 44 .80 27 59 32 
ork on committee with 44 93 68 91 13 
3 Date my sister (brother) 44 68 25 09 05, 
- How much like you?”* 8 1.63 2.76 1.91 3.27 


"For these questions, a low score signifies greater friendliness and perceived similarity, respectively. 


b 
Scoring: 1 for “yes,” 0 for “no”; 11 points possible. 


® Scores run from 0 to 1. A mean of 1.0 signifies endorsement of the item by everyone, 
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TABLE 2 
ANALYSIS OF RESPONSES ON THE “FRIENDLINESS” SCALE TOWARDS VARIOUS 
Srmtus TEEN-AGERS (N =35) 
evens M | of Comparison Correlation | / between | | between 
F 5 4 e 2a 
-like 91 -48 | Negro-like versus Negro teen: ager AS 2.08 K 

Noe PR UN 1.34 | .99 Nens teen-ager versus Negro-unlike steag 2.885 <1 Mae 
Negro-unlike 1.80 | 1.13 | Negro-like versus Negro-unlike .29 4.68 3.01 


= A low score indicates greater friendliness towards the stimulus teen-ager. 


b From pretest questionnaire. 
*p<.05. 


*k b <.01, 
Ak p <.001, 


= 35) are given in Table 2. (They are .91, 
1.34, and 1.80 for Negro-like, Negro teen- 
ager, and Negro-unlike, respectively.) Sub- 
jects feel significantly more friendly towards 
the Negro-like teen-ager than towards the Ne- 
gro teen-ager (t = 2.08, p < .05) and signifi- 
cantly more friendly towards the Negro teen- 
ager than towards the Negro-unlike teen-ager 
(t = 2.88, p < .01). 

One should also expect that subjects’ re- 
sponses to the Negro teen-ager should corre- 
late moderately both with responses to Negro- 
like and Negro-unlike. This should be the case 
unless, for some reason, subjects have an ex- 
pectation that Negro teen-agers in general are 
either like them or unlike them. Again refer- 
ring to Table 2, we note that the correlation 
between Negro-like and Negro teen-ager is .15 
and the correlation between Negro-like and 
Negro-unlike is .29. Neither of these correla- 
tions is large enough to be considered signifi- 
cantly different from zero. (The CR for the 
correlation of .29 reaches the .09 level of sig- 
nificance.) The correlation between Negro 
teen-ager and Negro-unlike, however, is .62, 
significant at beyond the .001 level, and also 
significantly different from the other two cor- 
relations, at the .01 and .05 levels, respec- 
tively. 

These differences would seem to demon- 
strate an important point: namely, when our 
white subjects are given no information at all 
about a Negro teen-ager, they apparently as- 
sume that he is different from them in values 
and react towards him accordingly. It should 
be noted here, referring again to Table 2, that 
the variance of responses to the Negro-like 
teen-ager is significantly smaller than the 
variance of the other two distributions, Some 


caution must be exercised in the interpreta- 
tion of the differences between the correla- 
tions for this reason. Our data from the ques- 
tion “How much like you would you say 
Teenager X is?” add further information, 
however, On that question, the subjects per- 
ceived the Negro stimulus teen-agers to be sig- 
nificantly less like them than were the white 
teen-agers, even when given the same informa- 
tion about both. The mean of responses to Ne- 
gro-like and Negro-unlike teen-agers combined 
was 5.33, while the mean for like and unlike 
white teen-agers was 4.44 (£ for this differ- 
ence is 3.29, p < .01). That is, with belief 
similarity held constant, the subjects per- 
ceived that the white stimulus teen-agers were 
more like them, given identical information 
about the whites and Negroes. These results 
parallel findings reported by Byrne and Wong 
(1962, p. 247, Table 1), in which white sub- 
jects attributed greater similarity of attitudes 
to unknown whites than to unknown Negroes. 
Our data further indicate the expectation held 
by the subjects that a Negro teen-ager, simply 
by virtue of his being a Negro, will be differ- 
ent from them. It seems likely that their pro- 
pensity to react negatively towards the Negro 
is based on this expectation, or, equally com- 
patible with the obtained relationship, for per- 
sons sharing the anti-Negro prejudices en- 
demic in American society, the sheer fact that 
a person is Negro marks him as significantly 
“different,” however similar he may be in 
other respects, 


Social Distance Scale 


Our major measure of reactions to the four 
stimulus teen-agers was the following “teen- 
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age social distance scale”: 

Everyone has his own preferences about the peo- 
ple he wants to associate with. There are probably 
some people with whom you would be willing to be 
very good friends, and others whom you would just 
as soon not ever be with. We would like you to tell 
us how close a relationship you think you would be 
willing to have with TEENAGER X. Check the 
blank under “yes” for each statement you agree with, 
and the blank under “no” for each statement you dis- 
agree with for TEENAGER X. Guess if you aren’t 
really sure. 

I think I would be willing: 


to invite this person home to dinner 

to go to a party to which this person 
was invited 

to go to the same school with this per- 
son 

to have this person as a member of my 
social group or club 

to have this person as one of my speak- 
ing acquaintances 

to live in the same apartment house 
with this person and his family 

— to eat lunch with this person in school 

to sit next to this person in class 

— to have this person as a close personal 
friend 

to work on a committee at school with 
this person 

to have this person date my sister 
[brother] 


This social distance scale, which was devised 
for the pretest questionnaire, was patterned 
after that of Triandis. Items were changed, 
omitted, and added to make the scale suitable 
for teen-age subjects; for example, no nega- 
tive items were used, on the assumption that 
they would not discriminate between the sub- 
Jects, Total scores on the scale were obtained 
by simply summing responses to the 11 items, 
each scored “1” for “yes” and “0” for “no.” 


TABLE 3 


ANALysIs OF VARIANCE OF TOTAL SOCIAL 
DISTANCE SCALE SCORES 


Source SS df F 
Individuals (A) 528.11 43 — 
Any (B) 62.64 1 7.20* 
Roler (©) 423.46 1  48.51** 
N XB 374.11 43 <1 
B s c 375.29 43 <1 
A c 30.28 1 <1 

XBXC 9,225.11 43 — 
175 
* b <.02, 


** p <.001. 
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TABLE 4 


Soctat DISTANCE SCALE ITEM COMPARISONS WITH 
Respect TO BELIEF AND Race (N=44) 


Items in social distance scale Fess stor 
1. Invite home to dinner 4.57**) 5.00** 
2. Go to party to which this person 
was invited 4.30**] 2,00 
3. Go to same school 2.57* | <1 
4. Have as member of social group Walon <a 
5. Have as speaking acquaintance 4.92**| <1 
6. Live in same apartment house 5.36**) 4.92** 
7. Eat lunch at school with 4,924! <1 
8. Sit next to in class 4.00**| <1 
9. Close personal friend 6.15**| 1.23 
10. Work on committee with 3.75") <1 
11. Date my sister (brother) 4.80**) 6.67** 


_ * Based on the difference in mean response to like and unlike 
stimulus teen-agers, regardless of race: (white-like -+-Negro- 
like) —(white-unlike +Negro-unlike). 

b Based on the difference in mean response to white and 
Negro stimulus teen-agers, regardless of whether like or unlike; 
(white-like +-white-unlike) — (Negro-like -+Negro-unlike), 

* p <.02, 

** p <.001, 


Responses to the social distance scale were 
analyzed in two ways. First, an analysis of 
variance of the total scores was computed; 
results are presented in Table 3. As in the 
analysis of the “friendliness” question, belief 
accounts for by far the largest amount of the 
variance, although effects for both race and 
belief are highly significant. (F for race = 
7.20, p<.02; F for belief = 48.51, p< 
.001.) Then, ¢ tests for both race and belief 
effects were calculated for each of the 11 
items. That is, each subject’s responses to the 
two Negro stimulus teen-agers were combined, 
and his responses to the two white stimulus 
teen-agers were combined. The ¢ between the 
means of these scores evaluates the race ef- 
fect, The belief effect was tested similarly, by 
combining responses to the two like teen- 
agers and comparing the mean of these scores 
with the mean of the summed responses to the 
two unlike teen-agers. 

The t-test analysis, presented in Table 4, 
adds more specific information concerning the 
areas in which race and belief effects are 
strongest. It is clear that belief has a very 
strong effect on all 11 items. All but one of 
the differences between responses to like and 
unlike teen-agers are significant at beyond the 
001 level; the difference on the item con- 
cerning “Go to the same school” is significant 
at the .02 level. The race effect, however, ap- 
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TABLE 5 


ANALYSIS OF VARIANCE OF TOTAL SOCIAL 
DISTANCE SCALE Scores, Pretest Data 


D. D. Stem, J. A. Harpycx, AnD M. B. SMITH 


Source SS df F 
Individuals (A) 408.77 36 a 
Race (B ‘ 283.95 1 45.50* 
Status (C) 134.33 1 28.52* 
AXB 224.80 36 <1 
AXC 169.42 36 <1 
BXC 10.28 1 <1 
AXBXC 8.695.45 36 = 
* p <.001 


pears to be specific to three items: “Invite 
him home to dinner,” “Live in the same 
apartment house,” and “Have him date my 
sister [brother].” These 3 items on which the 
race effect is significant at beyond the .001 
level seem to be “sensitive areas,” ones in 
which there is widespread resistance, in Ameri- 
can society, to Negro-white contacts. Rokeach 
(Rokeach et al., 1960) has stated that his 
theory applies “insofar as psychological proc- 
esses are involved . . . [p. 135].” As an ex- 
ample of institutionalized racial prejudice out- 
side the framework of his theory he later 
States “the southern white bigot would not 
want his daughter to marry the ‘good’ Negro 
any more than the ‘bad’ one [p. 165].” In 
Rokeach’s sense, the present “sensitive” items 
would seem to fall in the latter category. 
Clearly, an empirical definition of institu- 
tional prejudice in terms of an obtained “race 
effect” would be circular and meaningless. For 
purposes of future research, we would suggest 
two criteria for-situations that may be ex- 
pected to produce a “race effect”: intimacy of 
contact and presence of others—in this case 
parents—who are the enforcers of social 
norms. At present, all we can state from our 
empirical finding is that a belief effect is 
strong on all the items, whereas a race effect 
occurs on items that appear to involve pub- 
licly visible relationships that are “sensitive” 
or controversial by prevailing cultural stand- 
ards. 

One further set of data is available, from 
the pretest, which provides an important com- 
parison with the results of the social distance 
scale in the present study. On the pretest, sub- 
jects were asked to respond, on the same 
“teen-ager social distance scale,” to stimulus 


teen-agers who resembled quite closely 
stimulus individuals used by Triandis. 
or different philosophy” and “same or differ 
ent religion” which he used as variables : 
omitted. Our stimulus teen-agers were 
stated to be Christians, and varied only it 
race, white versus Negro, and in status. l 
the status variable, program in school 
grades were varied. The teen-agers were tl 
described as either “in the college preparati 
program getting Bs” or “in the vocational 
gram getting failing grades.” Again, 
were four stimulus teen-agers: white, 
status; white, high status; Negro, low stai 
and Negro, high status. 

The results of the analysis of variance 
total scores on the social distance scale, in 
sponse to these four stimulus teen-agers, 
given in Table 5. In this case, there is a vi 
large race effect (F = 45.50, p < .001), abo 
twice-as large as a smaller, but still hig! 
significant status effect (F = 28.52, p < .00 
in terms of percentage of variance explaint 
These results, obtained on 37 of the 44 su 
jects used in the later study, resemble qui 
closely those obtained by Triandis, When b 
lief is not a variable, as in these data, or when 
the belief effect is weakened by the ambigui 
of the information provided, as in Triandis” 
data, both race and status account for 
preciable portions of the variance, with race 
being by far the more important variable. < 

The explanation for all of these data, | 
would seem, is to be found in a very simpl 
fact. Individuals make judgments about 0 
on the basis of all of the relevant information 
they possess. If little information is provid 
and a judgment is demanded, it is made 0 
the basis of inferences from past experienc 
or information obtained from others. Thi 
first impressions are seldom accurate is du 
to the fact that very little information is ava 
able, and the person must be judged on tl 
basis of some known group membership. 
correlations presented earlier, between 
sponses to Negro-like, Negro teen-ager, í 
Negro-unllike, seem to indicate that the in 
ence made by most subjects about a Ne 
teen-ager, in the absence of other information, 
is that he is unlike them. 

If the foregoing interpretation is col 
the very large race effects obtained by 
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andis and also demonstrated in the pretest 
data are easily accounted for. The subjects are 
forced to guess at the belief systems of the 


“stimulus individuals, and their guess is that 


the Negro is unlike them. Our subjects in this 


| situation respond with a very large “race ef- 


fect.” When essentially the same subjects 
were provided, in the later study, with actual 
information about the belief systems of the 
stimulus individuals, they no longer had to 
guess, and they responded primarily, though 
not exclusively, in terms of the information 
about belief congruence with which they had 
been provided. 


CoNCLUSIONS 


The data presented strongly support Rok- 
each’s theory that the variable of belief con- 
gruence accounts for a major portion of the 
variance in prejudice, if it does not tell the 
“whole truth” about it. The teen-age subjects 
in this study, when given extensive informa- 
tion concerning the belief systems of stimulus 
teen-agers, react primarily in terms of simi- 
larity of beliefs and only very secondarily in 
terms of race. This was the case in an analy- 
sis of total scores on a social distance scale, 
and in an analysis of “friendliness” responses. 
Strong “race effects” were obtained on “sensi- 
tive” items on the social distance scale, per- 
haps reflecting institutionalized areas of preju- 
dice, and on total social distance scores when 
information concerning belief systems was not 
provided.® 

Not only do our results support Rokeach’s 
contention regarding the primacy of belief 
congruence, but they also account for the dis- 


$ Current research by one of the authors (DDS) 
Teplicates these findings for white ninth grade sub- 
jects in an Eastern school system in which there 
Were substantial numbers of Negro students. 
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crepancy between the findings reported by 

Rokeach et al. (1960) and by Byrne and 

Wong (1962), on the one hand, and those re- 

ported by Triandis (1961), on the other. 

When subjects are forced to evaluate stimu- 

lus individuals in terms of their beliefs, then 

belief congruence is more important than race. 

But when the belief component is not pro- 

vided, spelled out in considerable detail, sub- 

jects will react in racial terms on the basis of 
assumptions concerning the belief systems of 
others, and of emotional or institutionalized 
factors. The practical implications of these re- 
sults are obvious. If people of different races 
encounter one another under conditions favor- 
ing the perception of belief congruence (as, 
for example, in equal-status contacts), then 
racial prejudice should be substantially re- 
duced. 
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3 groups of Ss, selected on the basis of perceptual defense patterns (avoiders, 
sensitizers, and nonspecific defenders—NSD) were exposed twice to a ten- 


sion-arousing film (Wages of Fear) at 
of 4 psychophysiological variables was 


On initial showing, 


1-wk. intervals. Continuous recording 
done throughout both film viewings. 


the avoider and sensitizer groups showed markedly less 


skin-conductance response than the NSD group. No differences were found on 
the other psychophysiological variables on initial showing, On the 2nd showing 
avoiders and sensitizers reacted more intensely in skin conductance than they 


did initially while the NSD group showed a marked attenuation 


in skin- 


conductance response on the 2nd viewing. The data for heart rate did not 
parallel the skin-conductance data as the avoider group showed less heart-rate 


response on 2nd showing while the NSD 
reactivity. When the groups with cl 
sensitizers) were separated into high- 
between coping style and anxiety level 
not for skin conductance. The questio: 
and its relationship to optimum psychol 


Most modern theories of personality at- 
tempt at some level to deal with individual 
differences in response to intense emotional 
arousal. In most instances it has been found 
necessary, in accounting for the wide differ- 
ences in arousal noted, to postulate processes 
within the individual which mediate between 
the stimulus and the ultimate verbal and 
physiological response observed. Recent theory 
(Hartman, 1939) has emphasized the role of 
control or regulatory processes as primary 
factors mediating between stimulus and re- 
sponse, while earlier theory, drawn from class- 
ical psychoanalysis emphasized the role of 
defensive styles in controlling affective arousal. 
Whether couched in positive terms (ego- 
strength, ego-control processes) or in negative 
terms (repression, denial, projection, ete, 


1 This study was supported by Grants 04720 and 
08744 to Goldstein from the National Institute of 
Mental Health, United States Public Health Service, 
and by a grant from the Upjohn Corporation to 
the Mt. Sinai group. The authors would like to 
express their appreciation to Julia Wark for her 
kind assistance throughout the experiment and to 
the UCLA Health Sciences Computing Facility 
for computer time and valuable consultation, 


group showed greater heart-rate 


ear defensive patterns (avoiders and 


and low-anxiety groups, interactions 


were noted on finger pulse volume but 
n of patterning of response to threat 
logical functioning is discussed. 


the intent has been similar, namely, to place 
the coping style of the individual in a central 
position in accounting for the magnitude of 
reaction to a highly arousing negative stimulus 
and the rate of adaptation to that stimulus 
following repeated exposures to it. The present 
paper reports an exploratory investigation of 
the relationships among experimental meas- 
ures of coping style and initial reactivity to 
a highly arousing motion picture and adapta- 
tion to a second viewing of that film. 

Despite the general agreement of the im- 
portance of coping style in mediating between 
stimulus and response, there are few guidelines 
in contemporary theory for predicting a rela- 
tionship between a particular coping style and 
either reactivity or adaptation to the arousing 
stimulus. However, implicit in the discussions 
of the various coping styles are hints which 
might provide the basis for differential pre- 
diction. Coping response patterns can be 
divided into two general classes, those based 
primarily upon avoidance of the threatening 
stimulus and those based upon vigilance for 
threat. Within the framework of psycho- 
analytic theory, defenses such as intellectual- 
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ization, reaction formation and projection 
might be classified as coping styles which 
would result in greater vigilance for threat 
while repression and denial represent at- 
tempts to avoid awareness of threat. If the 
avoidance class of coping mechanisms are de- 
signed to avoid recognition of threat, it could 
be predicted that individuals who character- 
istically use this class of defenses would be 
less reactive to threat and because of their 
tendency to avoid verbalization of the stressor 
stimulus following exposure, to show less 
adaptation to the stressor stimulus. The ex- 
pectation for individuals who characteristically 
use the vigilance class of coping mechanisms 
would be reversed; because of their heightened 
sensitivity of threat, they would be highly 
reactive to a stressor stimulus and likely to 
show greater adaptation to the stimulus. The 
latter prediction is based on the knowledge 
that the vigilant individual would be more 
likely to ruminate about the stimulus, to 
verbalize its impact and to thereby extinguish 
the feelings originally stimulated. Following 
these lines of reasoning presented, the follow- 
ing specific hypotheses were tested: 

Hypothesis 1. Individuals using coping 
mechanisms of the avoidance type will be less 
reactive psychophysiologically to the initial 
showing of a tension-arousing film than in- 
dividuals who use vigilance defenses. 

Hypothesis 2. Individuals who use coping 
mechanisms of the avoidance type will show 
less adaptation psychophysiologically to a 
second viewing of the film than individuals 
using vigilance-type defenses. 


METHOD 
Subjects 


Since the authors did not have available a stable 
resource population from which to draw subjects, 
they were dependent upon volunteer subjects. In 
order to insure a wide range of coping styles, sub- 
jects were solicited from two general populations, 
a volunteer group not currently seeking psychiatric 
help and a population of psychiatric outpatients. 
The nonpsychiatric volunteers were drawn from two 
Sources: acquaintances of the experimenters, naive 
with regard to the experiment, and students from 
the introductory course at UCLA who volunteered 
or course credit. The patient sample was drawn 
from two sources: a private practice sample whose 
therapists informed them of the possibility of volun- 
teering for the experiment and applicants for help at 
local community clinic who were similarly in- 
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formed. All subjects volunteered to participate in 
the experiment and no compensation, monetary or 
otherwise, was provided. Since the experiment 
required 6 experimental hours distributed on 3 
separate days, it was easier to induce females to 
volunteer than males; thus, the final sample of 32 
subjects consisted of 24 females and 8 males, 


Method of Studying Coping Styles 


The coping style of each subject was assessed 
using the perceptual defense battery developed by 
one of the authors (Goldstein, Jones, & Kinder, 
1964). Using methods of tachistoscopic presenta- 
tion, each subject responded to threatening and 
neutral stimuli under three different conditions of 
measurement. The method for selection of stimuli, 
the different conditions of measurement, and the 
general rationale for estimating coping style in 
this fashion are described fully in Goldstein et al. 
(1964). However, in the present study, only the 
data derived from what has been called the 
stimulus present procedure was used for classifying 
subjects, This procedure involves the presentation of 
single stimulus words (either threat or neutral) in 
a random sequence at a single exposure setting 
designed to yield above chance accuracy. The sub- 
ject’s differential accuracy for threat and nonthreat 
words was the basis for classification. If the ac- 
curacy for threat words was greater than the ac- 
curacy for neutral words, the subject was classified 
as a sensitizer, while the reverse indicated an 
avoidance classification. To insure that this classifi- 
cation was based upon reliable difference scores 
and did not represent random variation around the 
mean, reference was made to normative data col- 
lected previously on 200 subjects which had been 
converted to standard score form. Thus, for a sub- 
ject to be classified as an avoider, his score had to 
be one standard deviation from the mean of the 
reference group in the avoidance direction, The 
analogous criterion was used for selecting sensitizers, 
In addition to these two extreme groups another 
comparison group was selected which has received 
little attention in the literature on experimental 
studies of coping style, namely, a group which did 
not show a definite trend in their perceptual defense 
scores. To qualify for this group, a subject has to 
show no differential accuracy for threat as com- 
pared to neutral words using the statistical criterion 
of falling within one point of the mean score in 
the standardization sample. This group was termed 
the nonspecific defender group (NSD) and provided 
a comparison with the other two groups which 
showed stable defense patterns. Using the statistical 
criteria mentioned above, it was possible to classify 
32 out of 36 subjects into one of the three groups. 
The 4 subjects not classified were dropped from 
further data analysis. Fourteen subjects were classi- 
fied as avoiders, 9 as sensitizers, and 9 as NSD sub- 
jects. The perceptual defense battery was admin- 
istered to each subject on the first experimental day. 

Since subjects were not drawn from a homo- 
geneous population, it was necessary to determine 
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whether the three groups were reasonably comparable 
on other parameters which might affect reaction 
and adaptation to the stressor stimulus. The group 
was extremely homogeneous with regard to educa- 
tional level, being either college graduates or college 
students, However, to provide a more objective 
estimate of intellectual level each subject was 
given the Shipley-Hartford test following the per- 
ceptual defense procedure. The average verbal score 
for three groups was as follows: avoiders, 28.32; 
sensitizers, 29.67; NSD, 32.62 which did not differ 
significantly. The sex distribution in each group was 
as follows: avoiders, 9 females and 5 males; sensi- 
tizers, 7 females and 2 males; NSD, 7 females and 
2 males. These proportions did not differ significantly 
among each other, 

The variable of degree of psychopathology in the 
three groups will be discussed in a later portion 
of this paper in conjunction with the experimental 
findings. 


Experimental Stimulus 


On the second experimental day, typically the day 
following the perceptual defense Procedure, the sub- 
ject was shown the film Wages of Fear, during which 
a complex of psychophysiological variables were 
continuously recorded. The film is a 100-minute, 
dubbed version of the original. It has been generally 
rated as highly tension arousing although over the 
100-minute period there are Occasional tranquil 
scenes, Wherever possible two subjects were run 
simultaneously, Each subject was instructed that 
he was to watch a movie, was given a brief intro- 
duction to the psychophysiological recording pro- 
cedures, and was made as comfortable as possible. 
Approximately 5 minutes after being hooked up to 
the equipment, the movie was started. The psy- 
chophysiological measures continuously recorded 
were: skin conductance, heart rate, finger pulse 
volume, and respiration rate.2 The apparatus used to 
record each response system is described elsewhere 
(Wenger, Engel, & Clemens, 1957). Following the 
completion of the movie, each subject was asked 
to return at exactly the same time 1 week later to 
“view a movie.” Subjects were not told that they 
would be seeing the same movie and any questions 
on that point were Parried by the experimenter. On 
the third experimental session, each subject saw 
Wages of Fear once again; however, he was not 
aware of that fact until the film began. 

Since a large number of subjects were female, it 
was important to run these subjects at similar 
phases in their menstrual cycle to avoid the changes 
In autonomic responsivity in different Phases of the 
j 2 All Psychophysiological leads were placed on a 
single arm and hand (except for the second EKG 
lead) and this hand was held in place by a weighted 
strap to reduce movement artifact, 


movements. Places 
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cycle. Therefore, female subjects were run such 
that both movie sessions were at least 5 days before 
or after the subject’s actual or expected menstrual 
period. 


Method of Reading Psychophysiological 
Records 


A method of time sampling was used in reading 
the 100 minutes of recording. The time sampling 
technique varied slightly for each psychophysiolog- 
ical channel as follows: 

Skin resistance. The lowest resistance in the first 
5-minute period of film viewing was read. The lowest 
resistance in the first 5 minutes of film viewing 
Provided a base-line estimate which was used to 
correct later reactivity scores. The use of a base- 
line measure obtained during a very tranquil period 
of the film rather than a prefilm base line was done 
to correct for the large orienting response shown to 
the onset of the film. In many instances subjects did 
not return to their preorienting level and any cor- 
rection of later reactivity scores by the latter 
Measure would have confounded the orienting re- 
sponse with later emotional responses to the highly 
arousing aspects of the film. Essentially, we were 
using the response of an alerted organism to a 
neutral stimulus as a base-line variable to correct 
later reactivity to emotional cues. In addition the 
minimum and maximum resistance during the film 
were read. 

Heart rate. The fastest 30-second period was 
located in the first 5 minutes of film viewing. Then, 
the average heart rate during that period was com- 
puted. For each successive 5-minute point, the 
fastest 30-second period around the point was located 
and the average heart rate was computed. A similar 
Procedure was used to locate the slowest heart rate 
per 30 seconds at each 5-minute period. Once again 
the measure taken during the first 5-minute period 
was used as a base line to correct later reactivity 
scores. 

Finger pulse volume. An analogous procedure to 
heart rate was used in selecting the periods of 
maximum constriction and dilation in the initial 
S-minute period and for each successive 5-minute 
period. 

Respiration rate. During the first 35-minute period, 
the fastest 1 minute of respiration rate was located 
and used as a basal measure. At each s-minute 
Point, the fastest 1 minute and slowest 1 minute 
were located and read. The mean of each 1-minute 
Period was then computed. Once again, the reading 
in the first S-minute period was used as a base line 
to correct later reactivity measures. 


RESULTS 


Even with the time sampling technique 
described above, a great deal of data is avail- 
able for each subject and some method of 
data reduction is required. In view of the high 
degree of redundancy in various indices of 
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TABLE 1 
REACTIVITY MEASURES FOR AVOWERS, SENSITIZERS, AND NSD Grours To First VIEWING 
OF THE FILM 
M (adjusted) SD F A S e Cpmperisoda 
NSD 
n conductance (maximum log conductance — initial level log conductance) 
Avoiders .070 .024 7.144* 17.18** 0.23 
T Sensitizers 046 .027 
221 029 
Finger pulse volume (initial level — maximum constriction) (microliters) 
voiders 0.85 -120 0.595 0.218 0.33 
i 1.02 .150 
1.05 -150 
eart rate (fastest heart rate — initial level) (beats per minute) 
T Avoiders 5.72 1.24 
Sensitizers 4.27 1.56 0.408 0.29 0,53 
NSD 4.17 1.55 
Respiration rate (fastest rate — initial level) 
| Avoiders 2.21 0.50 
 Sensitizers 2.17 0.69 0.472 1.25 0.09 
NSD 1.29 0.81 


*p <.01, df = 2/27, 
Dp < 001, df = 1/27. 


ychophysiological functioning described by 
arus, Speisman, and Mordkoff (1963), it 
s determined that a simple measure of 
imum reactivity during the film and a 
pple measure of the maximum degree of 
axation during the film would provide a 
ect and an indirect measure of each in- 
d vidual’s psychophysiological reaction to the 
ilm. Therefore, for each autonomic measure, 
a change score was computed for each sub- 
ject in which his score for the maximum 
action period was subtracted from his initial 
level during the first 5 minutes of film view- 
. For skin resistance (transformed to log 
mductance units), this meant that the dif- 
ence score for reactivity was between his 
tial conductance and maximum conductance 
ing film viewing. In the case of relaxation 
cores, an analogous difference score was ob- 
d between initial level and the score in- 
Cative of minimal arousal. Thus, using con- 
tance as an example, the difference score 
re was between initial level and minimal 
F mductance, Thus, for each psychophysiolog- 
Measure, one score for maximal reactivity 
d one score for maximal relaxation was 
puted. In order for a measure to be scored 
relaxed, it must have increased (or de- 


creased depending upon the measure) above 
the initial resting level in the relaxed direction. 


Reactivity to the First Film Viewing 


The reactivity scores for the three subject 
groups across the four psychophysical chan- 
nels are presented below in Table 1. The 
mean score in each instance is represented in 
its appropriate unit of measurement. The 
means presented in Table 1 represented the 
adjusted means derived from an analysis of 
covariance applied to these data. In this 
analysis, the initial level score was used as 
the covariate to adjust the difference score 
(initial level — reactivity). The F ratios 
represented are derived from the analysis of 
covariance and the orthogonal comparisons 
presented in the far right of the table are 
based on the specific comparisons between 
the adjusted means utilizing the adjusted 
error mean square from the covariance 
analysis as the error term. Table 1 will 
be discussed separately for each autonomic 
variable. 

Skin conductance. The difference scores 
presented in Table 1 indicate marked differ- 
ences in reactivity between the three groups. 
The sensitizer group shows the least reaction 
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to the film, the avoiders were somewhat more 
reactive, while the NSD subjects were highly 
reactive to the film. The F from the analysis 
of covariance for the between-group com- 
parison was significant (p < .01, df = 2/27). 
The orthogonal comparisons indicate that the 
avoider and sensitizer groups do differ signifi- 
cantly from the NSD group but not from each 
other. Thus, the clear-cut distinction in 
reactivity lies between the two groups that 
show a clear-cut defensive pattern tachisto- 
scopically and the group that does not indi- 
cate such a pattern, Thus, our hypothesis 
concerning differential degrees of reactivity 
in the avoider and sensitizer groups is not 
confirmed and an unexpected pattern of dif- 
ferences from the NSD comparison group 
appears, 

Finger pulse volume. The reactivity data 
for finger pulse volume fails to indicate a 
Statistically significant difference between the 
three groups. However, once again the avoider 
group mean tends to be lower than the NSD 
group mean; however, this difference is not 
Statistically reliable (F < 1). 

Heart rate, It can be seen in Table 1 that 
the three groups do not vary significantly 
on the heart-rate increase variable. The trend 
here is for greater increase in heart rate 
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for the ayoider group, but once again none 
of the individual group comparisons iş 
significant. 

Respiration rate. The groups do not differ 
significantly on the maximum increase in 
respiration rate. The trend once again is for 
greater reactivity in the avoider group but is 
not statistically greater than the mean of ‘the 
other two groups. 

In summary, measures of reactivity taken 
during the first film viewing are not uni- 
formly revealing of, or sensitive to, group 
differences, Marked differences appear for 
skin conductance, suggestive of slight in- 
creases in conductance in the avoider and 
sensitizer groups but marked increase in the 
NSD group. But, none of the other auto- 
nomic variables suggest marked differences in 
reactivity. 


“Relaxation” Scores during First Film 
Viewing 

As mentioned above a second set of scores 
was available for each subject, reflecting the 
opposite trend in the measures taken. From 
one point of view, these measures do cor- 
relate with states of relaxation (decreased 
skin conductance, lowered heart rate, slow 
respiration rate, etc.) and thus represent the 


TABLE 2 
“RELAXATION” MEASURES For ÅVOIDERS, SENsITIzERs, AND NSD GROUPS FOR First VIEWING 
OF THE Fim 
M (adjusted) SD F ARSENA] >| Compete 
NSD 
Skin conductance (initial level log conductance — minim 
Avoiders 126 06 | aa 4.81* 0.49 : 
Sensitizers 154 031 : l 
NSD 057 -031 
Finger pulse volume (maximum dilation — initia] level) 
Saper 1.1 .28 
ensitizers 0.9 :36 
Send He ar 0.226 0.29 0.21 
Heart tate (initial level — slowest heart rate) 
f> voiders 5.43 0.89 
ensitizers 7.87 1.11 
NSD 7.12 111 E ne vt 
Respiration rate (initial level — slowest respirat; 
eanas 2.69. an Pesi ot) 
ensitizers 2.84 0.80 
NSD 2.50 0.85 na ag A 


*p <05, df = 1/27. 


slight relaxation. 
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inverse of a stress of arousal response. From 
these measures the response to a tension- 
arousing stimulus can thus be estimated 
inversely by the degree of relaxation or de- 
arousal shown during tranquil periods of the 
film.* The data for the “relaxation” measures 
are presented in Table 2. Once again they 
will be discussed separately for each psycho- 
physiological channel. 

Skin conductance. Once again, the avoider 
and sensitizer subjects appear drawn from 
a different population than the NSD groups. 
Both avoider and sensitizer groups showed 
marked decreases in skin conductance while 
the NSD group showed a slight drop at some 
points during the film. While the between- 
group F in Table 2 is not significant, the 
orthogonal comparison between the averaged 
mean of the avoider and sensitizer groups and 
the NSD mean is statistically significant 
(p < .05, df = 2/27). Thus, the groups are 
mirror images of each other in reactivity and 
relaxation. The avoiders and sensitizers indi- 
cate slight skin-conductance reactivity and 
marked increases in relaxation, while the 
NSD group showed a marked reaction with 
To determine if these 
trends held for individual subjects as well 
as group, a net change score was computed 
for each subject which represented the dif- 
ference between the two difference scores 
(relaxation — reactivity) such that a positive 
score indicated more reactivity than relaxa- 
tion while a negative score, the opposite trend. 
The mean net scores, adjusted for initial 
level, were: avoider, —5.12; sensitizer, 
—10,47; and NSD, +10.85; indicating that 
the avoider and sensitizer groups showed 
more relaxation than reactivity during the 
film while the NSD groups showed more 
reactivity than relaxation. 

Finger pulse volume. The differences be- 

3 It is recognized that the scores which we classify 
as indicators of “relaxation” may not reflect that 
Psychological state so much as different types of 
reactions to stress. For example, Lacey (1959) has 
demonstrated that heart-rate deceleration accompanies 
certain types of arousing stimuli while acceleration 
accompanies others. However, despite the ambiguity 
Surrounding psychophysiological measures when 
Scored in the “relaxation” direction, it was felt that 
they represented a dimension of response to the film 
which should be analyzed independently of the 
traditional measures of arousal. 
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tween groups here are nonsignificant. The 
trend is for the avoider and sensitizer groups 
to show more dilation (relaxation) than the 
NSD group. 

Heart rate. The between-groups F is not 
statistically reliable for this measure. The 
trend is for the avoiders to show less heart- 
rate deceleration than the sensitizer and NSD 
groups. The specific comparison between the 
avoider and sensitizer group falls short of 
statistical significance. 

Respiration rate. The data here do not vary 
significantly among the three subject groups. 

In summary, skin conductance appears to 
be the one measure on which differences be- 
tween the groups are found in “relaxation,” 


Adaptation to the Second Film Viewing 


The second basic hypothesis of the present 
study was that avoiders and sensitizers would 
differ in their rate of adaptation to the 
tension-arousing film. Specifically, it was pre- 
dicted that avoiders would show less adapta- 
tion than sensitizers in psychophysiological 
response to the film. No specific predictions 
were made regarding the NSD group. It 
should be apparent from the data on initial 
film viewing that the hyporeactivity of the 
avoider and sensitizer groups limits the degree 
of adaptation that could occur. One can 
hardly adapt to a stimulus to which one 
never reacted. The NSD group, in contrast, 
reacted intensely (in skin conductance), and 
could show a marked adaptation. Thus, a 
ceiling effect operated on these data which 
limited the amount of change for the avoider 
and sensitizer group. 

The analysis of the change from first to 
second viewing scores presented one methodo- 
logical problem. Since each reactivity score 
(initial level-reaction) had to be corrected 
for initial level, the reactivity scores had to 
be corrected for the subject’s initial level for 
each viewing session. Thus, each subject’s 
reactivity score in each session (first or 
second) was corrected by the procedure sug- 
gested by Lacey (1956) in which the regres- 
sion of reactivity on initial level was used 
to obtain a predicted reactivity score. The 
difference between the predicted and obtained 
reactivity score was termed a subject’s 
corrected reactivity score. The corrected re- 
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TABLE 3 
MEAN DIFFERENCES IN REACTIVITY SCORES (ADJUSTED) FOR SESSION 2— SESSION 1 
M difi. n F Orthogonal comparisons 3 

Pree Sia +3.30 128 2.06 A versus N: SD, F = 405 

Sensitizers +0.34 9 A versus S, F <1 

NSD 3.73 9 
ae pre —0.03 14 0.19 none > 1 

Sensitizers —0.77 8 

NSD +1.10 9 
Bo —2.39 14 3.08 A versus NSD, F = 5.48 = 

Sensitizers +0.73 9 A versus S, F = 2,80 

NSD +1.97 9 
Respiration rate 

“Avoiders +0.15 13 0.47 none > 1 

Sensitizers —0.17 7 

NSD —0.78 6 


eco in recording etc., resulted in variable ns for the 
*p <.05, 


activity score for the first session was then 
subtracted from the second session corrected 
Score to yield a distribution of difference 
Scores. The nature of the subtraction was 
such that a plus score indicated greater 
reactivity on the second showing, while a 
minus score indicated adaptation or lessened 
reactivity. The mean difference scores by 
groups for each Psychophysiological variable 
are presented in Table 3. Analyses of 
variance were applied to each of the four 
distributions of scores, The data in Table 3 
will be discussed separately for each variable, 
Skin conductance. The Mean change scores 
in Table 3 indicate that the avoider group 
tended to react more to the second viewing 
of the film, while the NSD group showed 
less reactivity, The sensitizer group showed 
no particular group trend. The specific com- 
parisons between the avoider and NSD 
groups just falls short of Statistical signifi- 
cance (p< .07), The numbers of subjects 
owing a particular trend is similar: 9 out 
of 12 avoiders showed greater reactivity on 
the second showing, 6 out of 9 sensitizers 
showed greater reactivity on the second show- 
ing, while 3 out of 9 NSD subjects showed 
greater reactivity on the second showing. 
Once again, we fail to find a Statistically 
reliable difference between the avoider and 


sensitizer groups as Predicted. It is the 


different psychophysiological variables, 


avoider-NSD contrast which is most evid 
the avoiders not only fail to adapt but sho 
increased reactivity in skin conductance 
while the NSD groups show adaptation. 
we put the data on the skin conductance fi 
the two showings together, the avoider gro 
shows low reactivity during the first show! in 
with a tendency towards increased reactivit 
on the second showing; the NSD gro 
shows intense reactivity on the first showi 
and less reactivity to the second show 
finally, the sensitizer group shows low 
activity on the first showing and no pre 
dictable pattem on the second showing, 
Heart rate. The data on heart rate do 
parallel those for skin conductance as 
avoider group indicates less heart-rate aca 
celeration during the second showing wh 
the NSD group showed greater heart-rate 
celeration during that showing. This spe 
comparison is statistically significant. T 
sensitizer group once again shows no com 
sistent trend. If the reader will return” 
Table 1, he will note that the avoider grol 
tended to show more cardiac accelerati 
during the first showing than the other y 
groups. Thus, we see that high reactivity 


showing shows up in one group, while | 
reactivity in the first showing is coupled wi 
increased Teactivity in the second sh 
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in another group. The embarrassing aspect of 
this apparent symmetry is that the group 
_ trends for heart rate are opposite to those 
found for skin conductance! 

Finger pulse volume. There are no trends 
of note on this variable, as can be seen in 
Table 3. It is interesting to note that the 
order of the means is similar to heart rate. 

Respiration rate. There are no significant 
differences between groups on this variable; 
however, it will be noted that the group 
means are similar in direction to skin 
conductance. 


Change in “Relaxation” Indices from First 
to Second Viewing 


The change scores for all variables except 
heart rate did not reveal statistically signifi- 
cant change scores between the groups, For 
heart rate, the inverse of the reactivity data 
was found as the avoider group showed an 
increase in relaxation (heart-rate decelera- 
tion) M = +2.18, the sensitizer group showed 
less “relaxation,” M = —1.38, as did the 
NSD group, M = —2.53. The overall F be- 
tween groups was not significant (F = 2.62); 
however, the specific avoider-NSD compari- 
son was (F = 4.50, p < .05, df = 1/29). This 
mean presents in reverse the earlier finding 
that the avoider group showed the greatest 
adaptation in heart rate. 


Relationship to Psychopathology 


It was mentioned earlier in this paper that 
the degree of psychopathology was not con- 
sidered in assigning subjects to groups. Some 
of our subjects quite clearly were using 
Coping styles which were ineffective in bind- 
ing anxiety as they were seeking psychiatric 
treatment while others were not actively self- 
defined as patients. It cannot, however, be 
assumed that this latter group was free of 
marked psychopathology as previous investi- 
gations (Pollin, 1962) have shown that a 
high degree of psychopathology can be 
detected in nonpatient volunteer subjects. 
Therefore, we needed some indication of the 
relative success of an individual’s coping style 
in binding manifest anxiety and discomfort 
Which did not require the intensive clinical 
investigation necessary to evaluate psycho- 
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pathology in nonpatient subjects. Fortu- 
nately, a paper-and-pencil inventory was 
administered to all subjects prior to the 
experiment proper, the Bendig (1956) short- 
ened version of the Taylor Manifest Anxiety 
(MA) scale which purports to measure mani- 
fest anxiety. Despite the limitation of this 
type of self-report instrument, it did provide 
a wide range of scores within our subject 
population and provided a basis for esti- 
mating a subject’s success in avoiding high 
levels of chronic tension. Using a cutoff 
score of 10 or greater on the MA scale 
(shortened form) for the criterion of high 
anxiety and 9 or below as Jow anxiety, certain 
group trends became obvious.* First, only 
2 of the 14 high-anxious subjects in the 
sample fell in the NSD group. The modal 
pattern in that group were scores ranging 
from 0 to 5, with 2 subjects scoring 11 and 
19. Thus, whatever the NSD person is, he 
is not on the average a person who reports 
(and possibly experiences) a great deal of 
chronic anxiety. Whether this implies that 
the NSD subject does not need a rigid defined 
coping pattern because of a low level of con- 
flict or that he has no defense pattern and 
must deny, massively, any hint of anxiety 
is difficult to determine at this juncture. The 
balance of the high-anxious subjects were dis- 
tributed as follows in the other two groups: 
avoiders, 7 high anxious, 7 low anxious; 
sensitizers, 5 high anxious, 4 low anxious. 
In an effort to determine the possible rela- 
tionship between success of a coping pattern 
and psychophysiological response to stress, 
mean scores on each psychophysiological vari- 
able for high- and low-anxious subjects in 
the avoider and sensitizer groups were com- 
puted. These data for first film viewing and 
difference score between first and second 
viewings are presented in Table 4. In this 
table, all data are corrected Lacey scores 
or differences between corrected Lacey scores. 
A two-way analysis of variance was done 
on these data in the hope of locating inter- 
actions between coping style and anxiety level 
as well as a primary effect for anxiety level, 


4]t should be noted that the high anxious-low 
anxious dichotomy turned out to be a patient-non- 
patient division as all patients scored 10 or greater 
on the MA scale and all volunteers scored below 10. 
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TABLE 4 
REACTIVITY AND CHANGE IN REACTIVITY SCORES FOR AVOIDERS AND SENSITIZERS BY ANXIETY LEVEL 
(2-1) 
i F 
E eee R change 
i i : H 
I A 5 —2.17 | Anxiety level (AL) =0.16| +6.39 Ak is l 10 
Avoider-low anxious 7 —3.38 tet es aN +1.09 AS=0. 
Sensitizer-high anxious 5 —3.98 ALXAS=0.15| +1. 14 ALXAS=0.73 
Sensitizer-low anxious 4 —6.19 - D. s 
NSD-low anxious 7 +8.31 —3. 
Heart rate 
Avoider-high anxious 7 +0.58 AL=1,09) — ne 
Avoider-low anxious 7 +2.34 AS=0.65} —2, 
Sensitizer-high anxious 5 —1.34 ALXAS=0.01| -+2.10 
Sensitizer-low anxious 4 +1.18 —0.99 
NSD-low anxious 7 —2.27 +4.09 
Fin; ulse volume a 
Avol ler-high anxious Cf +1.27 AL=2.4| —4.15 nee at 
Avoider-low anxious if —3.01 AS=0.55| -+4.14 AS= dane 
Sensitizer-high anxious 5 +0.08 ALXAS=3.24| —0.11 ALXAS =4. 
Sensitizer-low anxious 3 +2.71 —1.86 . ny 
SD-low anxious 7 +0.81 —0.52 a ioni vaj 
avoiders =9,62** 
Respiration rate 
Avoider-high anxious 6 +0.26 AL=0.01| -+0.52 AL= a 
Avoider-low anxious 7 +0.03 AS=0.01|  —0.16 AS oos 
Sensitizer-high anxious 5 —0.19 ALXAS=0.72 +0.45 ALXAS=1.1 
Sensitizer-low anxious Z +0.88 —1.69 
SD-low anxious 4 —0.88 —0.95 
* 
"p Sn 


independent of coping style, The F ratios for 
each category (anxiety level, coping style, 
and the interaction of the two variables) are 
Presented directly adjacent to the means for 
each subgroup. The means for the low- 
anxious subjects in the NSD group are pre- 
sented for comparison Purposes, but were not 
used in the statistical comparisons cited. 

Tt can be seen in Table 4 that the break- 
down by anxiety level failed to reduce the 
variance between groups on most psycho- 
physiological variables, The data for skin con- 
ductance remains essentially identical to our 
earlier findings for initial reaction and change 
between first and second viewing. The avoider 
and sensitizer groups, regardless of anxiety 
level, are less reactive than the NSD group 
initially and show greater reactivity while the 
NSD group shows less reactivity on the second 
showing. Only with finger pulse volume do we 
see a significant effect when the avoider and 
Sensitizer groups are divided by anxiety level, 
which is present at a hear significant level 


for first viewing and at a significant level 
for the change between first and second score. 
Two statistically significant effects can be 
noted on this change score, one due to anxiety 
level and the other to the interaction between 
anxiety level and coping style. Generally the 
high-anxious subjects showed adaptation in 
finger pulse volume while the low-anxious 
subjects showed either greater responsiveness 
or no marked change. The interaction noted 
Suggests that high-anxious avoiders show 
adaptation while low-anxious avoiders show 
increased responsiveness. High-anxious sensi- 
tizers show slight adaptation while low- 
anxious sensitizers show marked adaptation. 
There are indications that the significance 
due to anxiety level is an artifact of the 
interaction noted, as the cancellation of high- 
and low-anxious avoider scores to essentially 
zero produces a marked effect between anx- 
iety groups. Therefore, these data appear to 
be meaningfully understood as an interaction 
between anxiety level and coping style. 
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Patterning of Results 


One striking feature of Table 4 is the 
}, marked inverse relationship between first 
viewing scores and the change score from 
first to second viewing. If a group of subjects 
was underreactive on the first showing, they 
were in almost every instance more reactive 
on that same variable on the second showing, 
while the reverse held true for the groups 
which were below average in reactivity on 
the first showing. Possibly, this indicates some 
general regression towards the mean in re- 
activity scores; however, while it might be 
expected that a high level of physiological 
activity might be followed by lesser activity, 
it is not compatible with a general regression 
hypothesis to assume that minimal activity 
is necessarily followed by intense reactivity 
to the same general stimulus. One other 
hypothesis worthy of consideration is that 
adaptation in a particular physiological 
response system when exposed to an intensely 
arousing stimulus requires an initial period 
of reactivity to that stimulus. When this 
initial period of response is inhibited, not 
only is adaptation not possible but the indi- 
vidual is more vulnerable to high levels of 
arousal in subsequent exposures to that 
stimulus. The paradox which remains to be 
understood is how both of these processes 
(reactivity and inhibition of reactivity) can 
occur in different psychophysiological systems 
_(i.e., skin conductance and heart rate) within 
the same individual during the same general 
exposure period. 

There is another aspect of patterning 
Present in these data, namely, the patterns 
within each coping group when broken down 
by anxiety level. The number of subjects in 
each group is small, yet it may be of interest 
to the reader to see the five groups presented 
ìn a fashion which permits ready comparison 
Of their initial reactivity trends when ex- 
amined across four psychophysiological vari- 
ables. In Table 5 each group is presented in a 
fashion which permits this sort of comparison. 
Instead of using numbers, the trend in each 
group is represented by a +, 0, or — sign 
indicating whether the group was significantly 
above (+), at the mean (0), or below the 
Mean (—) on their corrected Lacey scores 
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on that variable. In this table, the NSD 
group is once again presented for comparison 
purposes. If the reader will read across the 
NSD row, he will note an intense skin- 
conductance response but little or no reaction 
in the other psychophysiological channels. As 
mentioned earlier, the other four groups with 
clear coping styles are unreactive in skin 
conductance, but vary in the reaction in the 
other variables. The high-anxious avoiders, 
for example, fail to respond in skin con- 
ductance but show a positive response in 
each of the other response systems. The 
low-anxious avoiders also respond minimally 
in skin conductance, show a positive response 
in heart rate but fail to respond in finger 
pulse volume and respiration. The high- 
anxious sensitizers are reactive in every chan- 
nel except skin conductance. These patterns 
are no doubt of questionable reliability and 
yet they raise the question of whether dif- 
ferentiation between individuals using dif- 
ferent coping styles, and between individuals 
who vary in the effectiveness of their coping 
styles might not require a comparison of 
patterns rather than a variable comparison. 
It is extremely difficult, when examining the 
data in Table 5, to conceive of a unitary 
pattern of autonomic discharge when exposed 
to a stressor stimulus. 


DISCUSSION 


The present experiment hypothesized 
marked differences between avoiders and 
sensitizers on reactivity and adaptation to a 
stressor film. By and large we have not found 
these differences. Instead, the comparison be- 
tween subjects who show a stable defense 


TABLE 5 


PATTERNS OF PSYCHOPHYSIOLOGICAL RESPONSE TO 
First FILM VIEWING IN AVOIDER AND 
SENSITIZER GROUPS, BY 


ANXIETY LEVEL 
Skin Finger | Respira- 
conduct- | Heart) pulse | tion 
ance volume | rate 
Avoiders-high anxious = + + Ee 
Avoiders-low anxious = + = 0 
Sensitizers-high = = 0 i 
anxious 
Sensitizers-low = a a ui 
anxious | 
NSD-low anxious + =- 0 oe 
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pattern and subjects who do not indicate such 
a pattern proved most interesting. Clearly, 
the NSD group warrants closer scrutiny to 
determine if the members exhibiting this per- 
ceptual defense pattern are examples of psy- 
chological health or extreme vulnerability. If 
the former should prove to be the case, then 
it suggests that the reasonably conflict-free 
individual may respond initially quite in- 
tensely to a stressor stimulus and may adapt 
following this initial response. This would 
further imply that the markedly defended 
individual may, through the use of his de- 
fensive manners, short circuit affective arousal 
in the short run only to experience the fear 
associated with the stimulus at a later date 
when reexposed to the stimulus. There is 
Some support for this notion in the work of 
Janis (1958) on surgical patients in which 
he found that a reasonable level of anxiety 
before surgery was ordinarily followed by 
adaptation to postsurgical pain and discom- 
fort, while two other groups, one which denied 
the threat and one which experienced in- 
appropriate neurotic anxiety before surgery, 
failed to achieve relief following surgery. 
Theorizing here must be tempered by the 
fact that these findings hold true mainly for 
skin conductance. When we turn our atten- 
tion to the other variables, Particularly the 
circulatory Ones, we become aware of how 
Cautious we must be in theorizing about 
adaptation to stress as adaptation appears 
to be specific to certain Psychophysiological 
variables and not to others measured at the 
same time. 

The results of the Present study naturally 
warrant comparison with another program of 
research using motion-picture stress, the work 
of Lazarus and his colleagues (Lazarus, 
Speisman, Mordkoff, & Davison, 1962 $ 
Speisman, Lazarus, Mordkoff, & Davison, 
1964), as ego-defense theory has played an 
important role in much of their research. In 
this research, there are certain parallels to 
the present findings on initial reactivity to 
the film. First, skin conductance was found 
to be the most sensitive indicator of the 
stress response and of individual differences 
in stress response. The data on heart rate 
were frequently much less clear in the 
Lazarus studies, although not as confusing as 
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in the present study. Second, when experi- 
mental sound tracks representing the defen- 
sive styles of intellectualization and denial- 
reaction formation were presented with their 
stressor film, a marked attenuation in skin 
conductance response was noted when com- 
pared to a group (trauma group) which 
received a sound track which emphasized the 
painful aspects of the film to the participants, 
In our study, subjects were classified on the 
basis of perceptual defense patterns found 
previously to be associated with these same 
defensive styles (avoidance with denial- 
reaction formation) and (sensitization with 
intellectualization) and the data are further 
Congruent in that both groups differed more 


È P 
from a group without a defensive track avail- 


able than they did from each other. In fact, 
in the Speisman et al. (1964) study little or no 


difference was found between the denial and 
intellectualization tracks in attenuating the 


skin-conductance response, This suggests a 
certain degree of convergence between studies 


which attempt to build in certain aspects of - 


the coping response with studies which at- 
tempt to categorize people on the basis of 
their characteristic coping response to stress. 
Unfortunately, no data have been published 
to date by the Lazarus group on adaptation 
to repeated exposure to the stressor film. It 
would be even more supportive of the present 
results if the groups receiving the defensive 
and trauma sound tracks shifted their 
respective positions on the second viewing. 
The suggestion of pattern differences among 
our different defensive groups on the meas- 
ures comparing first to second viewing scores 
can be interpreted from a number of points 
of view. One interpretation might be that 
through chance our defensive and NSD 
groups contain individuals varying markedly 
in their Particular system of autonomic re- 
Sponse specificity (Lacey, 1959). The fact 
that groups that did not respond on the first 
showing of a film in a particular psycho- 
Physiological system did respond markedly 
in that same System on the second viewing 
does not Support this interpretation. If there 
is meaning to the concept of autonomic re- 
sponse specificity then the specificity should 
be reasonably constant across stimulus condi- 
tions. Another explanation, suggested by 


Copinc STYLE IN RESPONSE TO TENSION 


Roessler, Alexander, and Greenfield (1963), 
is that “there may be an optimum order of 
response magnitudes among various physio- 
i logical variables for a particular stimulus and 
that deviations from such a hypothetical 
optimum may be maladaptive in nature 
[p. 152].” Thus, a marked skin-conductance 
response accompanied by a heart-rate re- 
sponse of a lesser magnitude to Wages of 
Fear may represent a more optimum organi- 
zation of responses than little or no skin- 
conductance response coupled with marked 
responses in the circulatory variables. While 
this explanation has a certain appeal and 
relates to the nature of the classification of 
subjects in the present experiment, it raises 
the question of how to evaluate what is 
optimum in psychophysiological responses. 
We require some independent criteria of ef- 
fective emotional responding and if these 
criteria correlate with different organizations 
of response in the various psychophysiological 
variables, then this hypothesis will have 
greater conviction. 
A third possibility, more parsimonious than 
the other two in some sense, is that the 
method used in scoring the circulatory vari- 
ables serves to mask rather than reveal law- 
fulness. Malmstrom (1964) has recently 
argued that traditional methods of analyzing 
heart-rate data fail to take into account the 
humerous automatic homeostatic mechanisms 
designed to bring deviations back to the 
ptimum range. It may indeed be possible 
that when the kind of time series analyses 
recommended by Malmstrom are carried out 
on circulatory variables that greater uniform- 
ity between heart rate and skin conductance 
would be apparent and that the relationship 
between individual difference variables and 
the various psychophysiological systems will 
be more congruent than is apparent in the 
present data. This type of argument certainly 
must be pursued before seriously considering 
hypotheses which postulate more complex 
Principles of psychophysiological organization. 

If we focus on the skin-conductance data 
_ for the first film viewing, there is a need for 

better understanding of the processes under- 
i lying the minimal response in the two groups 
F Which clearly defined defense patterns. One 
m Possibility suggested by the Lazarus group 
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is that this response attenuation is based 
upon a cognitive reappraisal of the stressful 
aspects of the situation such that the impact 
of what is viewed is minimized. This is a 
reasonable interpretation and one which we 
can neither support nor contradict with the 
present data. However, since the film used 
in the present research is quite different in 
length, complexity, and character develop- 
ment than that used by the Lazarus group 
there is some question of how readily the 
various arousing incidents in Wages of Fear 
can be minimized by cognitive reappraisal. 
There is evidence in our data for a possible 
alternate interpretation of the limited respon- 
siveness of the avoider and sensitizer groups, 
namely, one emphasizing the general involve- 
ment or lack of involvement of the subjects 
in the total film-viewing experience. It should 
not be overlooked in film studies of stress that 
experiments are essentially asking subjects to 
become involved in a pictorial representation 
of an arousing event or series of events. If 
the reader will recall the data on skin con- 
ductance presented earlier, it will be recalled 
that the avoider and sensitizer groups showed 
a marked drop in skin conductance relative 
to the small rise shown at any point during 
the film, suggesting a progressive state of 
relaxation rather than tension or activation. 
It is therefore possible that these defensive 
subjects never became seriously involved with 
the characters or their plight at any point 
in the film and that this lack of involvement 
(absence of narrowing of attention, failure 
to identify with the characters) accounts for 
the limited skin-conductance responses of the 
avoider and sensitizer groups. The fact that 
the film used in the present study was made 
primarily for entertainment while the film 
used by the Lazarus group was a realistic 
film of an actual event suggests greater rele- 
vance for the involvement hypothesis for 
the entertainment film. We are not trying 
to deny the value of the cognitive reappraisal 
hypothesis of response attenuation; rather we 
are suggesting that the defensive response of 


-a defensive subject may occur prior to the 


onset of a projector in the form of an in- 
ability or unwillingness to become involved 
in the whole procedure of film viewing and 
that this type of generalized task set should 


302 


be given as much consideration in stress 
research with films as the specific implicit 
verbal operations believed to occur during 
the various scenes of the film. 
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FURTHER DATA ON CHANGES IN WORD-ASSOCIATION NORMS* 


JAMES J. JENKINS an» DAVID S. PALERMO? 


University of Minnesota 


Previous study of adult word-association norms collected in 1910, 1925, 1927, 
1933, and 1952 has shown that systematic changes have taken place. 2 new 
norm groups (1942 and 1960) are here added to the previously available 
material. Major findings of the earlier study are supported: responses change 
systematically as a function of the time between normative collections; high 
frequency responses are most resistant to change; superordinate responding has 
steadily decreased. The frequency of popular responses is irregularly increasing. 
There is a suggestion that a plateau has been reached with respect to the 
change in frequency of popular responses and the decline of superordinates. 


In a recent article Jenkins and Russell 
(1960) called attention to the fact that word- 
association norms change over time, even 
when the subjects are defined as the same 
social group (i.e., University of Minnesota 
students). They went on to examine the 
major normative collections available for 
the Kent-Rosanoff Free Association Test 
(K-R) with the following findings: 

1. The frequency of popular responses to 
the K-R stimuli seemed to be increasing 
rapidly over the time period 1919-52. No 
smooth function was found, however, and 
the writers hypothesized that other factors, 
such as whether the testing had been in- 
dividual-oral or group-written, exercised some 
effect in addition to the general temporal 
change. 

2. Words used as responses to stimuli 


with the highest ranking responses having the 
highest stability. 

3. Abstract responses, specifically super- 
ordinates, tended to decrease steadily in 
Popularity across the time period of the 
Study, 1910-52. 

The writers suggested that a change in 
test-taking attitudes might account both for 
the increase in popular responses and the 
decrease in abstract responses. It was further 
Suggested that the change in individual re- 
sponse terms (aside from the frequency and 
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abstraction changes) might be attributed to 
changes in the meaning of particular stimuli 
over time. 

The purpose of the present article is to add 
new data to the previous study, to determine 
whether the trends may still be described in 
the same fashion, and to assess whether any- 
thing may be added to our understanding 
of them. Two new sources of data have been 
employed, first, a new set of norms collected 
at the University of Minnesota, and second, a 
set of norms collected by Wilson (1942) at 
the Compton Junior College, California (a 
period for which we previously had no data). 
The first set of data makes possible both 
the extension of the general normative study 
by 8 years and the specific comparison of the 
“same” social group over two intervals of 
time. The second set of data provides an 
additional test of the trends asserted to occur 
during the period already studied. The specific 
comparison of Minnesota norms will be dealt 
with first and the more general case treated 


last. 


COMPARISON OF THE 1927, 1952, AnD 1960 
Minnesota Norms 


Detailed data concerning the date of col- 
lection, characteristics of samples, and con- 
ditions of testing are given below. All three 
studies used the same stimulus list and the 
same order of presentation of items, though 
minor differences in the written format were 
present (1952 used two columns of stimuli to 
a page, 1960 used one column; 1927 is 
believed to have used two columns). 
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Norm 1927 


Norms collected by Schellenberg (1930) in fall 
1927. 

Testing. Group test, printed form, written an- 
swers. Instructions suggest time pressure. Other 
stimulus words follow the K-R list. f 

Sample. Students numbering 925 entering the 
University of Minnesota. Seventy-nine percent were 
freshmen; 21% advanced students. Fifty-seven per- 
cent male; 43% female. 


Norm 1952 


Norms collected by Russell and Jenkins (1954) 
in fall 1952. 

Testing. Group test, printed form, written an- 
swers. Instructions suggest a time pressure, (Same 
instructions with regard to time as Norm 1927.) 

Sample. Students numbering 1,008 in introductory 
Psychology at the University of Minnesota, Largely 
sophomore population. About 60% male, 40% 
female, 


Norm 1960 


Norms collected by Palermo and Jenkins (1964) 
in fall 1960. 

Testing. Group test, printed form, written an- 
Swers. Instructions suggest a time pressure, (In- 
“structions with regard to time very similar to those 
of Norm 1927 and Norm 1952.) Other stimulus 
words follow the K-R list. 

Sample. Students numbering 1,000 in introductory 
psychology at the University of Minnesota, Largely 
sophomore population, Fifty percent were male; 
50% female. 

The 1927, 1952, and 1960 norms were compared 
with respect to the following characteristics: num- 
ber of changes in primary, secondary, and tertiary 
Tesponses; the relation of the Strength of the 
primary Tesponse to each stimulus word in each 
norm set to its strength in each other norm set; 
and the characteristics of the stimulus-response pairs 
which change from one set to another, 


TABLE 1 


COMPARISON oF THE Response Worps Occurrinc 
IN THE First THREE RANKS IN THE 1927 
AND 1952 Minnesota Norms 


1927 norms 
1952 norms Rank 
|| Lower | Total words 
2 2 3 
Rank 
1 T LENS 7 100 
2 14} 36| 23 27 100 
3 8| 14| 21 57 100 
Lower 7} 33| 51 91 
Total words | 100 | 100 100| 91 
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TABLE 2 


COMPARISON OF THE RESPONSE WORDS OCCURRING 
IN THE FIRST THREE RANKS IN THE 1927 
AND 1960 Minnesota Norxs 


1927 norms 
1960 norms Rank 
Lower |Total words 
1 2 3 
Rank 
1 71| 17 4 8 100 
2 14| 32| 22 32 100 
3 7| 14| 27 52 100 
Lower 8| 37| 47 92 
Total words | 100 | 100 | 100 92 
RESULTS 


Changes in Primary, Secondary, and Tertiary 
Responses 


Tables 1, 2, and 3 give the results of the 
comparison of the norms as to the identity of 


the three most popular response words. | 


Table 1 reports the findings for the words 


ranking first, second, and third in response to | 


a given stimulus in 1927 compared to the 
words ranking first, second, and third in 
1952. The first, second, and third responses 
of 1927 were then compared to those for 
1960 with the results shown in Table 2. 
Finally, the 1951 and 1960 norms were com- 
pared as shown in Table 3. 

The tables may be read as follows: In 


Table 1 the first column shows the distribu- - 


tion of the 100 primary responses found in 
1927. In terms of the 1952 norms, 71 were 
still primary responses, 14 had become 
secondary responses, 8 had become tertiary 
responses, and 7 were no longer among the 
first three most popular responses. One may 
read the first row of the table in similar 
fashion. Of the 100 words which con- 
stituted the most popular responses in 1952, 
71 were Primary responses to those same 
words in 1927, 17 were second responses 
in 1927, 5 were tertiary responses, and 7 
were not among the first three most popular 
responses in 1927, 

A study of the three tables suggests that 
the change in Tesponses is systematic and 
“reasonable.” In each comparison primary 


responses are more stable than secondary — 


Worp-Assocration Norms 


responses and these in turn are more stable 
than tertiary responses. New primary re- 
"sponses are recruited in the main from words 
which were secondary responses in the pre- 
ceding norm and primary responses which 
change tend to become secondary responses 
in the succeeding norms. The overall picture 
" is consistent with the idea of slow and regular 
change with responses rising in strength to 
displace other responses and these in turn 
gradually declining in popularity. Examina- 
' tion of the normative frequencies of the 
words involved indicates further that, within 
any given response rank, the highest strength 
responses are most resistant and the lowest 
strength most susceptible to change. 
Comparison of the magnitude of change 
indicated by the tables suggests that the 
longer the time interval between samples, the 
lower the agreement (e.g., both 1952 and 
1960 norms are more unlike 1927 than they 
are unlike each other). This, however, must 
be tempered by the finding that the 1952 
and 1960 norms are almost exactly equal in 
< their degree of similarity to the 1927 norms, 
` implying that the difference between a 25- 
and a 33-year interval is of little importance. 
It seems reasonable to suppose that a rather 
sizable number of changes in responses may 
be observed from one year to the next as 
relatively low-frequency responses shift ranks. 
Such changes are undoubtedly due to a multi- 
plicity of causes such as unreliability, pres- 
ence of close competing responses, saliency in 
the daily news, and appearance of a word in a 
popular song title. Over a major period of 


TABLE 3 


Comparison OF THE RESPONSE WORDS OCCURRING 
IN THE First THREE RANKS IN THE 1952 
AND 1960 Minnesota NORMS 


1952 norms 


1960 norms 


Rank 
Lower | Total words 
1 2 | 3 

= Rank 

;! 85/510 PA anes 100 
E- 8| 61| 18| 13 100 
5| 15| 44| 36 100 
_ Lower 2) 14| 34 50 

Total words | 100 | 100 | 100| 50 
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TABLE 4 


CORRELATIONS OF THE RELATIVE STRENGTHS OF 
PRIMARY RESPONSES IN THE THREE 
Muynesota NORMS 


Norms 
Norms 


1952 1960 


All responses 
1927 -70 (N =100)| .73 (N =100) 
1952 = .93 (N =100) 

Identical responses only 
1927 


.67 (N=71) |.66 (N =71) 

1952 =r .92 (N =85) 
Different responses only 

1927 A8 (N=29) | .49 (N =29) 

1952 = .82 (N=15) 


time one might expect to observe a slow, 
cumulative drift of meaning, usage, and 
emphasis. Thus we would expect the 1952 
and 1960 norms to be different from the 
1927 norms to about the same extent, but 
still appreciably different from each other, 
at least insofar as the lower strength responses 
are concerned. 


Strength of Primary Responses to Particular 
Stimuli 


Tt was observed in the earlier article that 
the frequencies of the primary response toa 
particular stimulus in the 1927 and 1952 
norms were related, whether the responses 
had changed or not and in spite of the fact 
that all response strengths had increased 
very markedly. Table 4 presents the correla- 
tions between the three norms for all stimulus 
words, for those in which the response re- 
mained the same, and for those in which 
the response changed. It is clear that these 
data are compatible with the data on the 
changes in responses themselves. The 1952 
and 1960 norms correlate with the 1927 
norms to the same degree and intercorrelate 
more highly with each other. The reader 
should be cautioned concerning interpretation 
of the low correlations for those stimuli to 
which the responses have changed. The cor- 
relations are based on distributions of 
strengths which are low in mean value and 
much more homogeneous than the total dis- 
tributions or the distributions which show no 
change. The low correlation should not be 
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taken as evidence that the strengths of these 
primary responses are changing in any radical 
fashion. 

The data confirm the earlier conclusion 
that the relative frequencies of the primary 
responses to particular stimuli tend to remain 
constant over time. 


Nature of the Changed Stimulus-Response 
Pairs 


The stimulus words which evoked differing 
primary responses in 1927, 1952, and 1960 
are given in Table 5. The changes are grouped 
in terms of changes in the first interval, 
changes in both intervals, changes in the 
first interval which changed back in the 
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second interval, and changes in the second 
interval alone. 

Of the 29 responses which changed from 
1927 to 1952, 17 remained changed in the 
later norms, 9 changed again to still other 
responses, while 3 changed back to old re- 
sponses. Comparing 1927 and 1960 the same 
overall number of changes is seen because 
the three responses which changed back were 
exactly counterbalanced by three new changes 
subsequent to 1952. 

From 1952 to 1960 only 15 changes are 
seen; 9 of these were responses which had 
changed during the first period also. Of the 
71 items which did not change from 1927 to 
1952, only 3 changed in the subsequent 


TABLE 5 


STIMULI AND THEIR PRIMARY RESPONSES FOR Parrs WHICH WERE DIFFERENT 
IN THE THREE MINNESOTA NORMS 


Response 
Stimulus 
1927 % 1952 % 1960 % 
Changed from 1927 to 1952 
BATH WATER 23 CLEAN 31 CLEAN 35 
BEAUTIFUL PRETTY 19 UGLY 21 UGLY 17 
BIBLE BOOK 29 cop 23 Gop 32 
BLUE COLOR 24 SKY 17 SKY 25 
BUTTERFLY INSECT 20 MOTH 14 MOTH 13 
DOCTOR SICK 15 NURSE 24 NURSE 17 
Joy HAPPINESS 16 HAPPY 21 HAPPY 26 
LION ANIMAL 23 TIGER 26 TIGER 22 
MUTTON SHEEP 34 LAMB 36 LAMB 32 
RED COLOR 23 WHITE 22 WHITE 16 
SHORT LONG 33 TALL 39 TALL 41 
SICKNESS ILLNESS 19 HEALTH 37 HEALTH 37 
SLEEP REST 25 BED 24 BED 21 
STOVE HEAT 25 HOT 23 HOT 23 
THIEF ROBBER 22 STEAL 28 STEAL 26 
TROUBLE SORROW 8 BAD 9 BAD 11 
RKING LABOR 11 15 
Changed from 1927 to 1952 to iat a a ag 
1960 
BABY CHILD 18 BOY 12 
CABBAGE VEGETABLE 40 HEAD ie ence 14 
CHEESE FOOD 12 CRACKERS 11 MOUSE 12 
COMFORT EASE 13 CHAIR 12 SOFT 10 
EARTH GROUND 17 ROUND 13 DIRT 14 
JUSTICE JUDGE 14 PEACE 25 LAW 24 
SHEEP ANIMAL 18 WOOL 20 LAMB 21 
aa NOISE 17 STOP 13 TRAIN 11 
DESIRE 
Changed from 1927 to 1952 3 CE B vets i 
and back in 1960 
CITIZEN MAN 21 U.S. 11 N 21 
WINDOW GLASS 26 DOOR 19 cae 22 
YELLOW COLOR 29 BLUE i5 COLOR 9 
Changed from 1952 to 1960 
RELIGION CHURCH 30 CHURCH 3 24 
SOLDIER ARMY 14 ARMY n ore 17 
STREET AVENUE 16 AVENUE 19 car(s) 16 


£ TABLE 6 


“AVERAGE FREQUENCY OF THE THREE Most POPULAR 
Responses TO 100 STIMULI IN THE 
SEVEN NORMATIVE STUDIES 


i Norms 

Response rank 

4 1910 | 1925 | 1927 | 1933 | 1942 | 1952 | 1960 

First 260 | 338 | 289 | 357 | 382 | 375 | 348 

~ Second 122 | 134 | 121 | 125 | 128 | 136 | 130 

~ Third 77| 80| 79| 78| 78| 80| 76 
Total 459 | 552 | 489 | 560 | 588 | 591 | 554 


 Note.—All frequencies expressed as frequencies per 1,000. 


“period, while of the 29 items which had 
changed, 12 were observed to change again. 
_ The most general description of the change 
" data is one of instability in a small set of 
relatively weak responses with some evidence 
for the trend described by Jenkins and 
Russell (1960) away from superordinate and 
Synonym response classes. 


General Norm Comparisons 


To the norms which were studied in the 
earlier article—Norm 1910, Kent and Rosa- 
noff (1910); Norm 1925, O’Connor (1928); 
Norm 1927, Schellenberg (1930); Norm 1933, 
" Keene (1951); and Norm 1952, Russell and 
Jenkins (1954)—were added the Palermo and 
M Jenkins (1964) norms (Norm 1960) and 
those of Wilson (1942). The Wilson norms 
were based on a smaller sample than the 
re Others (150 male and 150 female native white 
Students at the Compton Junior College, 
California) and used individual-oral adminis- 
tration. While these two differences from the 
later college norms may make the comparisons 
hazardous, it was felt that it would be valu- 
able to include some data from the 1940 
decade in the interest of obtaining the most 
complete picture possible. 

3 Frequency of popular responses. Table 6 
lists the frequencies of the most popular 
Tesponses (first, second, and third rank) for 
the seven norm groups. The average fre- 
quency of the third response may be seen to 
e remarkably stable, varying by a frequency 
‘of only 4 per 1,000 over the entire period. 
‘The first two ranks vary considerably, how- 
€ver, and the total over the three ranks shows 
the irregular increase noted in the earlier 
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study. The 1942 norm fits the series relatively 
well. The 1960 data suggest a slight decrease 
from the high point reached in 1952. This 
decrease receives further weight from the 
fact that the 1960 sample contained a higher 
proportion of females than the 1952 sample 
(50% as opposed to 40% ). Females typically 
give higher proportions of popular responses 
than males (see Palermo & Jenkins, in press; 
Schellenberg, 1930). If there had been no 
change in the general level of popular re- 
sponding, one would expect the 1960 data to 
show an increase on the basis of the greater 
representation of females. The decrease, 
therefore, is relatively impressive. The ir- 
regular nature of the figures and the absence 
of a clear rationale for accounting for the 
variation in frequency makes these findings 
primarily of descriptive interest. It remains 
to be seen whether a real reversal or stabiliza- 
tion of the generally increasing trend of 
popular responding is taking place or not. 
Identity of words used as responses. Table 
7 gives the percentage of identical responses 
at each rank level for each pairing of the 
normative groups. At all ranks it is clear that 


TABLE 7 


PERCENTAGE OF IDENTICAL RESPONSES IN THE 
SEVEN NORMATIVE STUDIES 


Norm 
Response and 
ead 1910 | 1925 | 1927 | 1933 | 1942 | 1952 | 1960 
Primary 
1910 — | 81| 79 | 74 | 66 | 58 | 61 
1925 81 | — | 83 | 79 | 74 | 65 | 66 
1927 79 | 83 | — | 80 | 73 | 71 | 71 
1933 74 | 79 | 80 | — | 83 | 73 | 76 
1942 66 | 74 | 73 | 83 | — | 73 | 73 
1952 58 | 65 | 71 | 73 | 73 | — | 85 
1960 61 | 66 | 71 | 76 | 73 | 85 | — 
Secondary 
1910 — | 53 | 44 | 26 | 22 | 26 | 26 
1925 53 | — | 48 | 47 | 34 | 34 | 30 
1927 44 | 48 | — | 43 | 34 | 36 | 32 
1933 26 | 47 | 43 | — | 51 | 46 | 43 
1942 22 | 34 | 34 | 51 | — | 44 | 41 
1952 26 | 34 | 36 | 46 | 44 | — | 61 
1960 96: |\-304| E A E. A DE S AE aS B A 
Tertia 
010. — | 36 | 20 | 19 | 15 | 10 | 17 
1925 36 | — | 32 | 29 | 19 | 19 | 22 
1927 IAM 3I — AT A ARNA, 
1933 19 | 29 | 27 | — | 36 | 29 | 30 
1942 15 | 19 | 21 | 36 | — | 28 | 27 
1952 10 | 19 | 21 | 29 | 28 | — | 44 
1960 17 | 22 | 27 | 30 | 27 | 44 | — 
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TABLE 8 
Nomper or Worps DIFFERENT IN THE 
First THREE RESPONSES 
Norm | 1910 | 1925 | 1927 | 1933 | 1942 | 1952 | 1960 
1910| — | 69 79 | 94 | 111 | 123 | 115 
1925 | 69 | — 66 | 71 88 95 97 
1927 79 | 66 — 88 | 100 91 92 
1933 | 94 | 71 88 | — 66 77 77 
1942 | 111 | 88 | 100 | 66 = 73 77 
1952 | 123 | 95 OT ey: 73 | — 50 
1960 | 115 | 97 92.) 77. 77} 50) — 


Note.—Basis: 300 words. 


the hypothesis of a steady change over time 
is acceptable. There are minor variations in 
the pattern but, in general, each norm group 
shows the most overlap with the norm group 
which is closest chronologically; values in 
the table decrease regularly as one departs 
from the diagonal. Again, in spite of the 
smaller number of cases, the 1942 norms fit 
relatively well in the sequence. The 1960 
norms likewise show good agreement with 
the trend. 

Table 8 presents the other side of the con- 
sistency picture, showing how many response 
words are found to be different in the com- 
parison of the first three responses to each 
of the 100 stimuli. With three relatively minor 
reversals, the data are congruent with the 
hypothesis of progressive change. If the lack 
of independence involved in multiple-pair 
comparisons is ignored, and number of differ- 
ent words is plotted as a function of the 
difference in years between the norms, a 
surprisingly regular increasing function may 
be obtained. 

Considering all the findings with respect to 
the identity of response words, it seems safe 
to conclude that the earlier assertion con- 


! 
tinues to be supported: the major e 
nant of response overlap in word-association 
norms is the time interval between collections : 
of data. 

Use of superordinates. The same criterion — 
was used for the identification of a super- 
ordinate response as that used in the earlier 
article, namely, that the word was given by 
a majority of college subjects when complet- 
ing the sentence, “[K-R stimulus word] is 
a member of the class of .” Thirty-nine 
of the 100 stimulus words were found to have 
a superordinate by this criterion. A list of 
the 39 superordinates so identified is given in 
Jenkins and Russell (1960) and repeated in 
Palermo and Jenkins (1962). Occurrences of 
these 39 superordinates as responses were 


tabulated for each of the complete norms, | 


For the 1927, 1933, and 1942 norms, where 
complete tables of responses were not avail- 
able to us, the tabulation was necessarily 
restricted to the superordinates which oc- 
curred in those norms as one of the first 
three ranking responses. Accordingly, separate 
tabulations were made of the 26 superordi- 
nates occurring in the 1927 norms, the 18 
Superordinates occurring in the 1933 norms, 
and the 14 superordinates occurring in the 
1942 norms. (The decreasing frequency of 
appearance of the superordinate responses 
among the first three responses can be im- 
mediately noted.) Table 9 reports the re- 
sults of these tabulations. It is clear that 
there is a steady drop in the frequency 0 
usage of superordinate responses from 1910 
to 1952 no matter what subgroup of super- 
ordinates is considered. From 1952 to 1960, 
however, little change is seen, and what 
change there is, is counter to the long-term 


TABLE 9 


USAGE or SUPERORDINATES AS RESPONSES, EXPRESSED 


AS THE AVERAGE PERCENTAGE 


or SUBJECTS USING GIVEN SUPERORDINATE RESPONSES 


Norm 1910 1925 1927 1933 1942 1952 1960 
ATEB Percentage of use 
superordinates 17.5 15. — = 
26 superordinates (1927 i 5 “8 “ 
norms) 24.7 22. — 
18 ae Ld (1933 oe z ine ie 
norms, 31.1 28. 
14 superordinates (1942 k ai 7a ra ee Sa 
norms) X 32.1 31.9 30.8 25.6 26.8 23.3 23.2 


| 


trend. As with other trends apparent in these 
data, the 1960 norms suggest that a plateau 
"may have been reached, or that there is 
actually a reversal in the trend. 


Discussion 


The addition of two independent word- 
“association norms to the five previously 
‘studied (Jenkins & Russell, 1960) furnishes 
support for the generalizations drawn from 
that study. What was uncertain (the ir- 
regularly increasing frequency of popular 
responses) is found again to be uncertain and 
‘the suggestion of technique or methodology 
effects remains viable. The 1942 norm, which 
was individual-oral, has the highest fre- 
quency of primary response yet observed and 
‘caps the generally high level achieved by 
other norms (1910, 1925, and 1933) collected 
in the same manner. The 1960 norms, ad- 
ministered in the same fashion as the 1952 
norms and to the same nominal population, 
show a slight decrease compared to those 
norms. The hypothesis of increasing fre- 
quency of popular responses over time must 
‘be regarded with some suspicion. 

The most salient findings of the earlier 
comparison, the change in the response words 
as a function of the chronological distance 
between normative collections and the supe- 
rior stability of strong associations, are given 
increasing support and must be considered 
; ko be well-confirmed at this point. 

The decrease in the frequency of super- 
ordinate responses has again been shown with 
the 1942 data falling nicely in place, but 
there is a suggestion that the trend may have 
‘Tun its course in that no further decreases 
have been observed in the period from 1952 
to 1960. 

Tf one is to use the association test for 
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individual measurement, it appears necessary 
to restandardize tests from time to time or to 
restrict scoring to high-frequency responses 
which have been shown here to change most 
slowly. From the point of view of social and 
linguistic change, it ‘appears important to 
obtain further normative data periodically 
to continue the evaluation of the trends which 
we have thus far observed and to attempt 
to tease out their determinants. 


REFERENCES 


Jenxuss, J. J, & Russett, W. A. Systematic changes 
in word association norms: 1910-1952. Journal 
of Abnormal and Social Psychology, 1960, 60, 
293-304. 

Keene, C. M. Commonality of response on a word 
association test: A study of standardization pro- 
cedures and an attempt to forecast moderate 
emotional maladjustment. Unpublished doctoral 
dissertation, Stanford University, 1951. 

Kent, Grace H., & Rosanorr, A. J. A study of 
association in insanity. American Journal of In- 
sanity, 1910, 67, 37-96, 317-390. 

O'Connor, J. Born that way. Baltimore: Williams 
& Wilkins, 1928. 

PALERMO, D. S., & Jenxins, J. J. Superordinates, 
“maturity” and logical analyses of language. 
Psychological Reports, 1962, 10, 437-438. 

PALERMO, D. S., & Jenxms, J. J. Word association 
norms: Grade school through college. Minneapolis: 
Univer. Minnesota Press, 1964. 

PALERMO, D. S, & Jenxmns, J. J. Sex differences in 
word association. Journal of General Psychology, 
in press. 

Russet, W. A, & Jenxins, J. J. The complete 
Minnesota norms for responses to 100 words 
from the Kent-Rosanoff Word Association Test. 
Technical Report No. 11, 1954, University of 
Minnesota, Contract N8 onr 66216, Office of 
Naval Research. 

SCHELLENBERG, P. E. A group free-association test 
for college students. Unpublished doctoral dis- 
sertation, University of Minnesota, 1930. 

Wison, D. P. An extension and evaluation of 
association word lists. Unpublished doctoral dis- 
sertation, University of Southern California, 1942. 


(Received August 9, 1963) 


Journal of Personality and Social Psychology 
1965, Vol. 1, No. 4, 310-318 


COMMUNICATION OF BODY COMPLAINTS AND PARANOID 


SYMPTOM CHANGE UNDER CONDITIONS OF 
PHENOTHIAZINE TREATMENT ' 


NORBERT FREEDMAN, DAVID ENGELHARDT, DAVID MANN, 
REUBEN MARGOLIS, ano SHARON LONDON 


Downstate Medical Center, State University of New York 


160 schizophrenic outpatients randomly assigned to chlorpromazine, promazine, 
or placebo treatments were studied. A method of coding the frequency of 
treatment emergent somatic complaints (TEF) from psychiatric chart notes 
was devised. The study sought to determine: (a) the association between such 
complaints and the 3 treatments, and (b) the association between frequency 
of TEF and reduction in paranoid symptoms in a high paranoid subsample 
receiving drug treatment only (N=39). Both active drugs could be differ- 
entiated from placebo in terms of frequency of TEF (p <.01), and the com- 
munication of such complaints was positively associated with reduction in 
paranoid thought (p <.05), Drugs appear to facilitate a form of communica- 
tion which in turn is a response leading to symptom change. The communica- 
tion of somatic complaints is viewed as a cognitive process interfering with 


paranoid ideation. 


The alteration and reduction of paranoid 
ideation with drug, especially phenothiazine 
treatment, has been repeatedly reported (for 
example, Casey, Lasky, Klett, & Hollister, 
1960). Such clinical observations offer a chal- 
lenge to a psychological conception of the 
Paranoid symptom. Waelder (1951) in his 
discussion of the structure of paranoid ideas 
has emphasized the stubborn and persistent 
characteristic of these thoughts and their in- 
accessibility to influence. How can drug treat- 
ment in a relatively short period of time be 
capable of altering such a complex response 
pattern? 

The present study offers data which are 
consistent with a psychological interpretation 
of the reduction of paranoid symptomatol- 
ogy during psychopharmacologic treatment, 
The view described elsewhere (Freedman, 
1963), holds that drug-associated modifi. 
cation of paranoid sypmtoms in schizo- 
phrenic patients may be described in terms 
of two relatively independent processes: 

1This study stems from a Project of the Psycho- 
pharmacological Research Unit, Department of Psy- 
chiatry, Downstate Medical Center, and is supported 
in part from Grant MY 1983 from the National 
Institute of Mental Health, United States Public 
Health Service. The authors wish to express their 
gratitude to Michael Brecher for his significant con- 
tribution in the development and analysis of the 
scoring system for subjective somatic complaints. 


the patient’s initial adaptation, his response 
set to treatment; and the communication 
about body experiences in the course of 
treatment. The emphasis in this paper will 
be on the second of these processes. 

Altered body feelings have frequently been 
implicated as an important phenomenon in 
schizophrenia particularly in the development 
of secondary symptomatology (Szasz, 1957). 
Moreover, alterations in body feelings either’ 
externally induced (Freeman, Cameron, & 
McGhee, 1958) or resulting from physical 
illness (Mayer-Gross, Slater, & Roth, 1954). 
have been reported to be correlated with 
reduction in the manifest psychotic state 
Such observations render plausible the general 
Proposition that altered body feelings may 
represent a class of events which affect the 
Psychotic state. There are a number of 
formulations based upon psychoanalytic” 
theory leading to the prediction that an in- 
Crease of focusing and attention on body 
feelings and the communication of such feel- 
ings would reduce paranoid ideation. Feder 
(1952) has described the schizophrenic pa 
tient in terms of dysfunctions in ego bound- 
aries manifesting themselves in the form of 
a breakdown in body-boundary cathexs. 
When the patient’s attention is directed onto 
his own body we may witness a reorganiza- 
tion and redefinition of his body-ego bounda- 
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ries, with the result that the communication 
with other objects, for example, a therapist, 
becomes less autistic and more reality ori- 
ented. These observations have not made the 
explicit point that it may be the communica- 
tion of the body experiences to another person 
which are instrumental in symptom change. 
A psychological interpretation of drug ef- 
fects requires that one identify a class of 
drug-associated responses which also may act 
as a stimulus altering the psychotic state. 
Psychiatric treatment involving the adminis- 
tration of pharmacological agents facilitates 
the emergence of body-oriented communica- 
tions. To the authors’ knowledge there is no 
“psychoactive” agent free from such subjec- 
tive reactions. These somatic complaints are 
usually described as irritants and are listed 
in clinical reports as undesirable side effects 
of the drug. In this study, however, these 
subjective somatic complaints are conceptu- 
alized as a form of communication which 
critically affects the clinical interaction and 
thus may be instrumental in the reduction 
of paranoid ideation. More specifically, this 
point of view implies two major propositions: 
that ataractic medication is associated with 
a greater amount of verbalization about 
bodily complaints in comparison with placebo 
treatment; and that this type of verbalization 
is associated with reduction of paranoid 
ideation independent of type of treatment 
administered (for example, drug). 
i The demonstration that drug treatment 
increases the probability of eliciting subjec- 
tive somatic complaints presents a number 
of methodological issues. In particular if these 
complaints are to be attributed to the treat- 
ment it becomes important to differentiate 
treatment emergent somatic complaints from 
the kinds of discomfort verbalized prior to 
the onset of treatment. Second, it also be- 
comes important to determine the frequency 
of complaints as they emerge visit by visit. 
For example, the failure of one investigation 
to demonstrate drug-associated subjective 
Complaints (Michaux, 1961) may be at- 
tributable to such methodological considera- 
tions. These methodological aspects of assess- 
ing subjective somatic complaints become 
Critical in the direct test of the association 
between complaints and paranoid symptom 


out 


change. Subjective somatic complaints, the 
independent variable as assessed here, is a 
treatment process variable; that is, it involves 
the communication between the patient and 
the psychiatrist. It is the repeated communi- 
cation of discomfort visit by visit which 
may be implicated in the reduction of autistic 
communication. 

The purpose of this report then is: to 
delineate the association of repeated treat- 
ment emergent somatic experiences and 
ataractic therapy among schizophrenic out- 
patients and, to delineate the role of these 
repeated body complaints alone and in inter- 
action with the initial treatment response in 
altering paranoid thought. 


PROCEDURE 


Subjects and Treatment Regimen 


Subjects utilized in the study of drug-associated 
subjective somatic complaints were the first 160 
admissions to the State University of New York 
Psychopharmacology Clinic who had completed 3 
months of treatment. All subjects had a primary 
diagnosis of chronic schizophrenia, with subdiag- 
noses of “chronic undifferentiated” or “paranoid” 
schizophrenia, The age range was from 19 to 44 
years. Socioeconomic status was low, represented by 
Categories 4 and 5 of the Hollingshead classification 
(Redlich & Hollingshead, 1958). The 160 patients 
were randomly assigned to three treatment groups: 
54 of the patients received chlorpromazine, 56 
promazine, and 50 placebo. 

A subsample was utilized in the study of the 
association between communication of somatic ex- 
periences and change in paranoid thought which 
consisted of 54 patients drawn from the larger 
sample, 17, 22, and 15 patients receiving chlor- 
promazine, promazine, and placebo treatment, re- 
spectively. These were patients who had completed 
3 months of treatment, and were identified as high 
in paranoid symptomatology at intake. 

The treatment regimen for all patients consisted of 
the double-blind administrations of chlorpromazine, 
promazine, or placebo. Treatment sessions comprised 
a 20-30 minute visit with a psychiatrist, where the 
emphasis was on a discussion of the reality problems 
in the patient’s life. The double-blind drug adminis- 
tration was maintained with provisions for flexible 
dosage administration; mean daily dosage level for 
patients on active drug was 179.6 milligrams with a 
range from 100 to 400 milligrams. Several checks 
were employed to ascertain that patients were in fact 
taking their medication as prescribed (reports by 
the patient, members of the family, return of 
remaining capsules, and periodic urine examinations). 
Patients were seen weekly for the first month of 
treatment and biweekly thereafter. 
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TABLE 1 


List or CATEGORIES or Somatic 


FREEDMAN, ENGELHARDT, MANN, Marcos, AND LONDON 


COMPLAINTS AND CODING EXAMPLES i 


Coding examples 


Complaint category 


Complaints about: 


1. Dysfunction of energy 


Sleepiness, drowsiness, 


tiredness, grogginess 


2. Body weakness Body weakness, listless, lethargy 
3. Headaches Headaches 
4. Dizziness and fainting Dizziness, fainting, falls to ground 
5. Insomnia Insomnia betsy hh 
6. Gastrointestinal disturbances Nausea, vomiting, indigestion, heartburn 
+ Anorexia Anorexia poles : 3 y 
8, Specific organ discomfort: A Heart palpitations, chest pain, difficulty in breathing, back pain 


9. Specific organ discomfort: B 

10. Specific organ discomfort: C 

11. Excess perspiration 

12. Dermatological discomfort 

13. Oral discomfort 

14. Muscular incoordination 

15. Dysfunction of urination and defecation 


Stomach, 


Excess pe 


Tremors, 


amenor 


Skin rash, itching and soreness, 
Dry mouth, dry throat, related 


Constipation, diarrhea, 


lower digestive tract and anus 


Limbs, eye, ear, nose, tooth, face flushed, neck 


cspiration 

bruises, boils Y 
mouth and throat complaints 
shakes, twitches, tics A 
increased urination, neurogenic bladder, 
rhea 


„Assessment Methods 


Three major assessment procedures are relevant 
to the present investigation: the assessment of 
somatic complaints, the initial treatment response, 
and paranoid symptom level, 

Assessment of somatic complaints. The source for 
the assessment of somatic complaints was the psy- 
chiatrists’ chart dictations. The psychatrists dictated, 
after each session, their clinical impressions of the 
patients’ status as well as a record of the patients’ 
complaints particularly those involving bodily func- 
tion. Thus, the present measure of somatic complaints 
represents those complaints which the patient com- 
municated and the psychiatrist recorded. 

The psychiatric charts of the patients were then 
coded by experimenters who did not have knowl- 
edge of the medications the 


a separate sample, 
verbatim from the psychiatric charts and were clas- 


sified into the 15 categories presented in Table 1. 
Next the charts of 


Pretreatment somatic complaints—that is, the dif- 
ferent kinds of complaints (represented by the 15 
categories in Table 1) during the first two visits 
(prior to drug administration), 


Treatment observed somatic complaints—that is, 
the different 


1. Treatment recurrent somatic complaints refer to 
the different kinds of complaints reported after the 
administration of drug which were also present 
during the pretreatment period. 

2. Treatment emergent complaints refer to the dif- 
ferent kinds of complaints not present during the 
pretreatment visits, and reported only after the 
administration of drug. F 

These indices of somatic complaints refer to dif- 
ferent kinds of complaints over the span of treat- 
ment. They do not reflect the frequency with which 
any of these complaints appeared during the treat- 
ment period. Frequency scores were obtained which 
reflected the number of times the different kinds 
of complaints were communicated during the drug- 
treatment period. These frequency scores could of 
course express the frequency of treatment observed, 
treatment recurrent, and treatment emergent com- 
plaints. The treatment emergent frequency (TEF) 
Score is the critical measure under consideration 
here, reflecting the number of different emergent 
complaints and their frequency of communication 
during the span of treatment, The scoring reliability 
of the TEF score was determined by the independent 
coding by two judges of 20 cases of the 160. The 
tank-order correlation coefficient was 91, 

Assessment of initial treatment response. The 
initial response refers to the modification of pre- 
senting symptomatology during the first three 
treatment sessions, Since the interest is on modifica- 
tion of paranoid symptomatology, the evaluation of 
initial response was limited to the patients in the 
high paranoid subsample. It was noted that 54% of 
the patients in the subsample showed some reduction 
or blunting in the intensity of their paranoid symp- 
toms, either upon the second visit to the clinic 
(before drug was introduced), or within the first 
2 weeks of drug administration. This response t00! 
Place at a time when it was not as yet clinical 
meaningful to speak of a drug effect. It was foun 
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that initial response was not associated with ataractic 
drug administration. The observation that initial 
response was associated with a high rating of co- 
operativeness by the psychiatrists suggests that it 
reflects a favorable treatment set. Furthermore, the 
response was found to be related to a number of 
patient characteristics assessed at intake: the pa- 
tients’ style of social interaction reported by rela- 
tives, the patients’ cognitive organization and mode 
of self-representation in childhood memories (Freed- 
man, 1963). Thus, the initial adaptation to treatment 
of a paranoid patient appears to reflect a signifi- 
cant aspect of the patients’ cognitive and social 
functioning. 

Assessment of paranoid symptom level and its 
alteration. For the assessment of paranoid thought 
the mean of a cluster of items derived from a larger 
psychiatric rating scale was utilized. The four symp- 
toms composing the paranoid cluster, ideas of refer- 
ence, influence, persecution, and general suspicious- 
ness, were rated on a 4-point scale of symptom 
intensity at intake, and after 3 months of treatment. 
Patients in the high paranoid subsample received 
ratings of at least moderate intensity on two or 
more of the above symptoms at the intake evalua- 
tion, Change in paranoid thought was measured by 
the difference between the mean cluster score at 
intake and the 3-month mean cluster score, a nega- 
tive score signifying a reduction in paranoid 
symptom level. 

An attempt was made to determine whether such 
changes in rated symptom severity would be reflected 
by observed changes outside the patient-psychiatrist 
interaction, specifically by the report of the patient’s 
social behavior derived from interviews with relatives 
and psychological test performance. Intercorrelation 
Of change scores for a sample of 165 patients showed 
Significant positive relationships between the criterion 
Measure (reduction in paranoid thinking) and the 
telative’s report of reduction in socially dysfunctional 
behavior, as well as reduction in “perseveration” as 
teflected by performance on the Cattell Perseveration 
Test (Cattell, 1935); reduction in paranoid thought 
was also correlated with changes in Rorschach 
Performance (reduction in number of responses) and 
reduction in tapping speed. The meaning and struc- 
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ture of changes in paranoid thought, as defined by 
these independent observations, represents an ongoing 
research task. 


RESULTS 


Drug Administration and Subjective Somatic 
Complaints 


In the study of the association between drug 
administration and subjective complaints the 
somatic complaints were analyzed both in 
terms of different kinds of complaints and 
in terms of their frequency of occurrence 
during treatment regardless of kind, When 
only kinds of complaints were studied, an 
average of two to three different treatment 
observed complaints was found both for 
placebo and active drug treatments. The 
analysis of kinds of complaints as to whether 
they were treatment emergent or treatment 
recurrent did not yield any significant dif- 
ferences between the treatment groups. These 
data suggest that the response to active drug 
cannot be distinguished from the response 
to placebo when only different kinds of 
complaints during treatment are evaluated. 

When the notion of frequency of com- 
plaints (that is, the repetition of complaints 
session by session over the 3 months of 
therapy) is taken into consideration the 
findings become more striking. Table 2 pre- 
sents the mean frequency of somatic com- 
plaints regardless of kind during the 3 months 
of treatment exposure for the three treatment 
conditions. Again mean incidence of treatment 
observed and treatment recurrent complaints 
for the three treatments was similar. How- 
ever, when the frequency of treatment 
emergent complaints was analyzed, a clear 


TABLE 2 


MEAN FREQUENCY OF SOMATIC COMPLAINTS DURING TREATMENT EXPOSURE 
FOR THREE TREATMENT CONDITIONS 


Treatment condition 


Class of complaints i 3 
ii Chlorpromazine 
pcs eae. we 
Treatment observed 4.62 1o fe 265 
Treatment recurrent 2.54 AI oe jose 
Treatment emergent 2,088» 54 € : 


{t test between placebo and chlorpromazine =4.27, b <.001. 


£ test between placebo and promazine =2.96, p <.01. 


x5 cae between chlorpromazine and promazine =1.37, p <.20. 
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differentiation was found between the drugs 
and placebo treatments (F = 9.09, p < .01). 
Means for the three treatment groups were 
2.08 for placebo, 3.54 for promazine, and 
4.26 for chlorpromazine; the range was from 
O to 12 treatment emergent complaints per 
patient. Further ¢ tests revealed significantly 
more treatment emergent somatic complaints 
for each of the drug groups when compared 
to placebo (¢ = 4.27, p < .001 for the chlor- 
Promazine-placebo comparison and ¢ = 2.96, 
2<.01 for the promazine-placebo com- 
parison). 

A separate analysis was also carried out 
for the subsample of 54 “high-paranoid” 
patients. Mean treatment emergent com- 
plaints for high-paranoid patients on placebo 
was 2.00, for promazine 3.86, and chlor- 
Promazine 4.71 (F = 4.76, p < 05). The two 
drug groups were differentiated from placebo 
(p< .01 for chlorpromazine-placebo, and 
É < .05 for promazine-placebo comparison). 
The two drug groups, however, were homo- 
geneous with each other, Thus, the data on 
the paranoid subsample parallel closely the 
findings for the larger group of 160 patients. 

The data indicate that phenothiazine treat- 
ment facilitates the communication of somatic 
complaints relative to placebo treatment. This 
relationship is only in evidence when sub- 


TABLE 3 


MEAN CHANGES IN PARANOID SYMPTOMATOLOGY AS A 
FUNCTION or INITIAL RESPONSE AND FREQUENCY 
OF EMERGENT SOMATIC COMPLAINTS (TEF) FOR 
DRUG-TREATED PATIENTS Onty (N =39) 


Frequency of emergent complaints 


Initial response 
High 


(4 or more) Gak | 

Initial responders —.66 8 —.32 1t 
Nonresponders —.58 13 | +.07 7 

Total —.61 21| —.17 | 18 

Analysis of variance 
Source Ms df F 

Response group (A) 0553 1 1.15 
TEF complaints (B) -2451 1 5338 

X B -0239 i 52 
Error -046 35 

* p <.05. 
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jective complaints are evaluated in terms of 
their frequency of occurrence over the span 
of treatment and only when for each patient 
that class of complaints is considered which 
emerges in the course of treatment. 


Relationship between Subjective Somatic 
Complaints and Alteration in Paranoid Symp: 
tomatology 


Having determined that the frequency of 
treatment emergent subjective complaints is 
a concomitant of phenothiazine treatment we 
next investigated the significance of this 
response to alteration in symptomatology. If 
treatment emergent complaints are involved 
as significant conditions mediating clinical 
change (rather than an epiphenomenon of 
drug treatment), one would expect that with 
drug treatment the presence of somatic com- 
plaints (high TEF) should be associated with 


symptom reduction and the absence of such 


complaints (low TEF) should not be associated 


with symptom reduction. Also, symptom reduc- 


tion was deemed to vary as a function of the 
initial response (that is, response set) hence 
the effect of somatic complaints and initial 
Tesponse (alone and jointly) on symptom 
change was tested. The subsample of 39 pa- 
tients characterized by a high level of para- 


noid symptomatology at intake and receiving 


only ataractic treatment (chlorpromazine and 
Promazine) over 3 months, were selected for 
this study. Placebo-treated patients could not 
be included in this analysis since there was 


not a single placebo-treated patient who 


revealed high treatment emergent somatic 
complaints; that is, with placebo a patient 
communicated complaints infrequently, and 
not in the more consistent session-by-session 
manner noted with drug treatment. The 
sample was stratified both according to the 
Patient’s TEF score (utilizing the median of 
4 as a cutoff), and in terms of initial response. 

Table 3 presents mean change scores 12 

*There is also a rational basis for excluding 
placebo-treated patients. Having shown that drug 
is one condition facilitating the induction of body 
complaints, the present design is capable of demén- 
strating the effect of these complaints on symptom 
level only under these drug conditions, The more 
general extension of the hypothesis to apply to body 
communication in the absence of the drug condition 
would require a separate study. 
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paranoid symptomatology at the end of 3 
months of treatment as a function of initial 


| response and TEF scores.’ An analysis of 


variance for unequal number of cases in the 
subclasses (Walker & Lev, 1953, p. 381) 
revealed a significant somatic complaint 
(TEF) effect (F = 5.33, p< .05) but no 
significant initial response or interaction ef- 
fect. That is, those patients on ataractic 
medication who communicated a high fre- 
quency of treatment emergent complaints also 
evidenced significantly greater reduction in 
paranoid symptomatology, compared with 
patients who did not articulate such com- 
plaints. For patients who articulated a high 
frequency of treatment emergent complaints 
the mean decrease in paranoid symptomatol- 
ogy was —.58 whereas for patients who had 
a low incidence of emergent complaints the 
mean change was —.17. These data suggest 
that while high frequency of treatment emer- 
gent complaints was associated with reduction 
in paranoid symptomatology, the initial re- 
sponse did not appear to play a significant 
role. It is noteworthy, however, that patients 
in the cell for which the least amount of 
symptom reduction was expected (low TEF 
score and absence of initial response) tended 
to show an increase in symptomatology 
(+.07); on the other hand the group of pa- 
tients for whom a favorable treatment out- 
come was expected (high TEF score and 
presence of an initial response) revealed the 
greatest reduction in paranoid symptomatol- 
ogy (—.66). As would be expected the other 
two cells assume an intermediate position. 


Additional Analyses 


Some additional analyses were conducted 
to rule out artifacts which may possibly have 
Contributed to the preceding findings. The 
intake paranoid symptom ratings for the 
four patient groups just described were 
analyzed and were found to be homogeneous. 

An analysis of the dosage levels for the 
four groups was conducted to test the pos- 
sibility that frequency of somatic complaints 
Was associated with dosage levels. It must 
be stressed, however, that the study used a 
flexible dosage design seeking to preserve as 


° Chlorpromazine- and promazine-treated patients 
are equally represented in the four cells of Table 3. 
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TABLE 4 


MEAN DOSAGE LEVELS DURING TREATMENT BY FRE- 
QUENCY OF EMERGENT Comptaints (TEF) AND 
INITIAL RESPONSE FOR DruG-TREATED 
Patients Onty (V=39) 


Frequency of emergent complaints 
Tnitial response 
High Low N 
(4 or more) (3 or less) 
Initial responders 141.5 8 | 203.0 11 
Nonresponders 168.9 13 | 206.5 7 
Analysis of variance 
Source MS df F 
Response group (A) 5.00 1 .48 
TEF complaints (B) 50.30 1 4.81* 
AXB 2.00 1 19 
Error 10.45 35 
Note.—Dosage levels given in milligrams. 
* p <.05. 


much freedom for the treating physician as 
possible. In a flexible design (unlike a strati- 
fied design where dosage level is predeter- 
mined by the experimenter), dosage level is 
not only an input but a response variable, 
Patients always started at a level of either 
100 or 150 milligrams per day and only when 
the physician felt it clinically advantageous 
would he raise the dosage on subsequent 
visits. A mean dosage level was determined 
for each patient reflecting the doses pre- 
scribed over the course of 3 months of treat- 
ment. These individual dosage levels ranged 
from 100 to 400 milligrams per day; the 
mean was 179.6 milligrams and all but one 
patient received less than 300 milligrams 
per day. 

Analysis of variance of dosage levels by 
initial response and somatic complaints pre- 
sented in Table 4 shows a significant TEF 
effect (F = 4.81, p < .05). Examination of 
the means indicates that patients having high 
treatment emergent somatic complaints (high 
TEF) received lower dosage levels than did 
patients with a low frequency of complaints 
(low TEF). In the interpretation of these 
findings, two aspects may be noted: first, 
the observation of a higher dosage level for 
low TEF patients rejects the notion that 
complaints are directly dependent upon 
dosage level, and these findings point to indi- 
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vidual differences as a determinant of com- 
plaints. Second, in a flexible dosage design 
the individual patient who is not initially 
responsive is likely to receive increased doses 
by the physician. Thus, the higher dosage 
level for the low TEF patient may be viewed 
as a reflection of the physician’s added effort 
when he fails to obtain a response, be it a 
somatic complaint or symptom alteration. 
The significance of such dosage manipulation 
as a variable effecting treatment outcome 
is a problem that warrants further study. 

So far we have described the role of sub- 
jective somatic complaints without reference 
to the kind of complaint most prominently 
implicated. Table 5 Presents an item analysis 
of the treatment emergent complaints cover- 
ing the 15 categories of complaints listed in 
Table 1. Table 5 lists both the percentage of 
patients (within the high paranoid sub- 
sample) showing the Presence of complaints 
in each category and the frequency distribu- 
tion of 162 treatment emergent complaints 
made by the 39 patients in the high paranoid 
subsample. Clearly the most prevalent cate- 
gory of complaints is “dysfunction of energy” 
(sluggishness, drowsiness, tiredness) both 
when number of patients showing the pres- 
ence of a complaint is considered and when 
the proportion of all complaints made is 
studied. 

The association between communication of 
somatic complaints and Symptom change was 
further evaluated by case observations. In a 
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previous publication it had been reported that 
psychotic ideation could be reduced and a 
low level sustained with phenothiazine treat. 
ment for periods up to 24 months (Engel- 
hardt, Freedman, Hankoff, Mann, & Margolis, 
1963). Five of the patients in that study 
were also in the high paranoid subsample of 
the present study. Persistent somatic com- 
plaints, especially “dysfunctions of energy” 
could be traced for at least 12 months for 
these five patients. Such observations suggest 
the possibility that sustained communication 
of somatic complaints is a necessary con- 
comitant of sustained symptom reduction, 


Discussion 


The data presented indicate that pheno- 
thiazine treatment is associated with subjec- 
tive somatic complaints, particularly when 
these were evaluated in terms of the frequency 
of treatment emergent responses. The kind of 
complaint implicated most prominently ap- 
pears to be in the area of “energy dysfunc- 
tion.” High frequency of treatment emergent 
complaints is positively associated with rated 
reduction of paranoid thought over 3 months 
of treatment. The findings are consistent with 
the hypothesis that communication of body 
complaints is a prescursor to symptom 
reduction. 

In the introduction mention has been made 
of clinical observations which point to an 
association between body experiences and 
alteration in the psychotic state (Freeman et 


TABLE 5 


ANALYSIS or TREATMENT EMERGENT COMPLAIN 


TS BY CATEGORY oF COMPLAINT: FREQUENCY DISTRIBUTION 
OF ALL COMPLAINTS MADE AND PERCENTAGE OF PATIEN 


TS SHOWING PRESENCE OF EACH 


COMPLAINT CATEGORY 


Frequency distribution Number of patients showing presence 
of complaints of complaint category 
Complaint category 
% % 
complais ie (Based on 162 N (Based on 39 
complaints) cases) 
Reiner o “ei 35 | 
Dysfunction of energy 86 2 
Gastrointestinal 14 % ue i 
lzziness and fainting 13 8 8 21 
Body weakness 8 5 7 18 
Insomnia 7 4 7 18 
Headache 11 r 6 15 
Miscellaneous* 23 14 13 33 
Total 162 


al., 1958; Mayer-Gross et al., 1954). Similar 
observations are also suggested by studies 
involving the use of pharmacological agents 
(Denber, Bente, & Rajotte, 1961; Hankoff, 
Rudorfer, & Paley, 1962). In these studies 
the definition of “body experiences” is often 
ambiguous. Body experiences have been 
‘defined in terms of external stimulation 
(tactile), in terms of internal physiological 
‘states (fever, drugs), or in terms of response 
processes (complaints). The data of the 
‘present study may be used to emphasize the 
“importance of response processes. A psycho- 
logical interpretation of drug effects thus may 
"be stated: drugs facilitate the emergence of 
a certain class of responses (somatic com- 
plaints) which in turn act as stimuli affecting 
the psychotic state. 

_ This general hypothesis will be discussed 
by considering the meaning of the communi- 
Cation of subjective somatic complaints as 
well as the meaning of the association between 
Subjective somatic complaints and symptom 
change. 


Subjective Somatic Complaints as a Variable 
Cognitive Process 

© Subjective somatic complaints refer to the 
Patient’s communication about his body state 
to the psychiatrist. A substrate of altered 
" Physiological functioning is undoubtedly a 
‘factor underlying the communication of the 


) would support such assumptions. 
Drugs clearly seem a necessary precondition 
facilitating somatic complaints. A significant 
“Cognitive component in these complaints is 
Suggested by the data on dosage levels. We 
noted that patients who communicated treat- 
“Ment emergent complaints also tended to be 
given lower doses by the physician, whereas 
j those patients complaining less were given 
_ higher dosage levels. While this differential 
‘dosage administration is viewed to reflect a 

treatment process event (the psychiatrist’s 
g stration at not obtaining a response from 
the patient), the data also argue against 
y hypothesis which would consider higher 
Somatic complaints a direct function of higher 
dosage input. The patient’s readiness to turn 
his attention upon bodily processes and to 
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articulate these may be more important in 
eliciting somatic complaints than the amount 
of medication ingested. Another observation 
emphasizing the cognitive component -in 
somatic complaints is indicated by the quality 
of communication among patients not voicing 
somatic discomfort. From an example such 
as “Your pill does not touch me,” one gains 
the impression, not that these patients lack 
the sensation, but that a sensation is being 
denied. 

Subjective somatic complaints may be con- 
ceptualized to represent a cognitive process 
stemming from the patient’s general propen- 
sity to communicate complaints, but varying 
with a number of internal and external 
stimulus conditions. These may include altered 
physiological activity (for example, drug) and 
situational conditions encouraging or inhibit- 
ing the complaining behavior. The degree to 
which the psychiatrist’s behavior may encour- 
age or inhibit complaints is one of the impor- 
tant, as yet undefined parameters of this 
study. 


Some Potential Mechanisms of Change in 
Paranoid Thought 


Two aspects of body-oriented communica- 
tion may be deemed to have specific thera- 
peutic properties. The first may be termed 
the interpersonal aspect. Following Szasz 
(1957) the body is used as a self-object 
through which the patient seeks the accept- 
ance of the psychiatrist. In this sense, body 
complaints may be regarded as a transference 
manifestation. Body-oriented communication, 
from this point of view, represents the estab- 
lishment of a safe relationship which facili- 
tates a reduction in the preoccupation with 
external threat objects. 

The second aspect emphasizes the patient’s 
focusing behavior and attention upon his 
bodily processes. In making body complaints 
the patient attempts to identify body cues. 
This type of self-focusing may be said to 
sharpen the awareness and distinctness of the 
body schema. From the psychoanalytic point 
of view delineated by Federn (1952), the 
more accurate identification of body proc- 
esses can be understood as a reestablishment 
of body-boundary cathexis. A dynamic shift 
may be taking place involving the definition 
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of body boundaries which allows for the 
reduction in preoccupation with threat 
objects.* 

These two aspects of somatic complaining, 
the interpersonal aspect and the body- 
focusing aspect, are not mutually exclusive 
and are probably mutually reinforcing. What 
is of theoretical importance is that this com- 
munication may represent the mediating link 
necessary for the reduction of threat ideation. 
A critical test for such a psychological inter- 
pretation would involve the systematic induc- 
tion of body communication under conditions 
when drug is not used (for example, by means 
of hypnosis, suggestion, or reinforcement of 
body verbalizations during treatment) and 
tracing the effects of such communication on 
symptom level. 

A different point of view has been pre- 
sented by Ostow (1962). Ostow has defined 
the role of tranquilizers in reducing psychotic 
states in terms of depletion of “psychic 
energy.” In one respect the data of the 
present study appear to dovetail descriptively 
with Ostow’s formulation; the data presented 
have shown that one particular category of 
complaints appears to be implicated most 
strongly, namely, “dysfunctions of energy.” 
Is communication about energy depletion of 
particular relevance to the alteration of the 
paranoid state? Patients in remission from 
paranoid states often describe themselves as 
weak, washed out, and in need of a rest. What 
is at issue, however, is the meaning of energy 
depletion as an explanatory concept. The 
energy concept cannot at this point be defined 
independently of what is communicated. We 
did not observe in our study “psychic energy 
depletion,” but the Perception or communica- 
tion of a state of depleted energy, Moreover, 


* These differing 
portance of defining 
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we have noted that patients who lack the 

appropriate communication even though they 

received the same drug (presumably compa: 

rable central nervous system stimulation) also | 
fail to show symptom reduction. It would 

appear possible that the altered paranoid 

state can be parsimoniously described by a] 
behavioral model involving altered communi- 

cation which does not take recourse to an i 
independent psychic energy construct. 
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SOME EFFECTS OF VARIATIONS IN OTHER 
STRATEGY UPON GAME BEHAVIOR* 


CHARLES G. McCLINTOCK, PHILIP GALLO,? ann ALBERT A. HARRISON ê 


University of California, Santa Barbara 


Internationalist and isolationist Ss played a 2-part modification of the 
Prisoner’s-Dilemma-type game. Though they did not know it, they were actually 
playing against E, rather than each other, In the 1st part of the experiment, 
Ss had little control over the payoff matrix. In the 2nd part, Ss had virtually 
total control over the payoffs. During the 1st part of the experiment, E played 
a “cooperative” strategy against 4 of the Ss and a “competitive” strategy 
against the other +. The results indicated that neither the strategy played 
against Ss nor the personality of Ss influenced the number of cooperative 
choices. However, a significant 3-way interaction indicated that internationalists 
are more sensitive to changes in their opponents’ strategies. 


Several experiments have been conducted 
recently to investigate the relationship be- 
tween internationalism-isolationism and game- 
playing behavior. Lutzker (1960) developed 
an internationalism scale and found that 
internationalists made more cooperative re- 
sponses in a Prisoner’s Dilemma game than 
isolationists. In a subsequent study, McClin- 
tock, Harrison, Strand, and Gallo (1963) 
varied the strategy that the internationalist 
and isolationist players encountered in a 
modified Prisoner’s Dilemma game. Unbe- 
knownst to the subjects in this experiment, a 
preprogramed sequence of “other-player” 
responses was provided by the experimenter. 
The subjects received either 85% cooperative, 
50% cooperative, or 15% cooperative re- 
sponses. These experimenters found that the 
internationalist group made more cooperative 
choices than the isolationistic group, but also 
observed that the three strategies followed by 
the simulated other player had no differential 
effect upon the number of cooperative re- 
sponses within either group. Bixenstine, 
Potash, and Wilson (1963) also report that 
no fixed proportion of cooperative behavior 
on the part of a presumed partner affected 
differential cooperative behavior on the part 
of the subject. 

The fact that the simulated other player’s 


1 This research was supported by the United States 
Air Force through the Air Force Office of Scientific 
Research of the Air Research and Development 
Command, under Contract Number AF49(638)-794. 
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responses did not have an effect upon the 
subject’s choices poses a difficult problem 
of interpretation. One cannot determine 
whether the subjects failed to respond to 
variations in the other player’s strategy, or 
whether differences in the other player’s 
strategy evoked different subject goals. 

Depending upon the strategy of the other, 
different goals in different subjects, such as 
maximizing one’s own gains regardless of the 
other player’s gains or losses, or maximizing 
the difference between one’s own and the 
other player’s gains, could produce the same 
proportion of cooperative responses across 
subjects. 

The task of determining what predisposi- 
tions are operative is difficult given the 
structure of the Prisoner’s Dilemma game. 
The players are in a relationship which is 
more or less symmetrical and interdependent, 
and in which the subject may spend a con- 
siderable portion of his time attempting to 
modify the response pattern of the other 
player. In fact, this latter factor introduces 
another source of variance in interpreting the 
subject’s responses. That is, one cannot deter- 
mine whether a given response or response 
sequence by the subject represents: an 
attempt to modify the other player’s re- 
sponse(s); a characteristic response to the 
game independent of the other player's 
response(s); and/or an attempt, given the 
subject’s perception of the other player’s 
strategy, to achieve a given goal such as 
maximizing his own gains, or the difference 
between his own and the other player’s score. 


319 


320 
MATRIX FOR SESSION 1 
A B 
YOU YOU 
WIN 5¢ WIN 10¢ 
X 
OPPONENT OPPONENT 
WINS 5¢ WINS 5¢ 
YOU YOU 
LOSE 1¢ LOSE 1¢ 
Yy 
OPPONENT OPPONENT 
WINS 1¢ WINS 1¢ 
Fic, 


The present study was designed to obtain 
additional information regarding some of the 
factors which contribute to the response pat- 
terns of individuals in the Prisoner’s Dilemma 
game. The usual symmetry and interdepend- 
ence of the Prisoner’s Dilemma game were 
modified so that subjects could be observed 
under two conditions: a situation in which 
they could not markedly affect the outcome 
of the game, and subsequently, a situation 
in which they had almost complete con- 
trol over their own and the other player’s 
outcomes. 

Thus, during the first situation (Session 1), 
it was possible for the experimenter, in the 
guise of the “other player,” to vary sys- 
tematically the experiences of subjects who 
had little control over their own and no con- 
trol over the other player’s outcome, And 
during the second situation (Session 2), in 
which the distribution of Power was reversed, 
it was possible to observe whether the 
subjects’ strategy varied as a function of this 
initial experience. 

More specifically, the matrix employed in 


PLAYER I 


A B 
P YoU YOU 
x WIN 5¢ WIN 7¢ 
Y OPPONENT OPPONENT 
A WINS 5¢ LOSES 4¢ 
Ir you YOU 
LOSE 4¢ LOSE 1¢ 
OPPONENT OPPONENT 
WINS 7¢ LOSES 1¢ 


Fig. 2. An example of a conventional prisoner’s 
dilemma payoff matrix. 
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MATRIX FOR SESSION 2 


Xi Y 
YOU YOU 
WIN 5¢ WIN 1¢ 
A í 
OPPONENT OPPONENT 
WINS 5¢ LOSES 1¢ 
YOU 
WIN 5¢ 
B 
OPPONENT 
WINS 10¢ 


1. Session 1 and Session 2 payoff matrices, (Subject chooses A or B in Session 1, X or Y in Session 2) 


Session 1 (Figure 1) is structured to allow 
the “simulated” other player to select one of 
two strategies, I—the “X” response, which 
maximizes his own and the subject’s payoffs, 
but may result in the subject winning more 
than himself, or 1I—the “Y” response, which 
enables the simulated other player to win 
more than the subject (in fact forces the 
subject to lose on each trial), but which 
tesults in a much lower absolute level of 
winning for the simulated other, 

The subject also has a choice between two 
alternatives in Session 1. If the simulated 
other chooses the Y response, the subject's 
choice has no effect upon the payoffs, How- 
ever, if the simulated other chooses the X 
response, the subject’s choice will influence” 
his own payoff but not the payoff to his 
Opponent. The subject may select either the 
“A” reponse, which gives each player a 
equal payoff, or the “B” response, which 
doubles the subject’s payoff but leaves his 
Opponent’s unchanged. The choices and the 
Power relationships are reversed in Session 2, 
with the subject controlling the X and Y 
choices, 

The response alternatives available to the 
subjects in a conventional Prisoner’s Dilemma 
game (Figure 2) are usually referred to in | 
the literature as cooperative and competitive: 
An analysis of the structure of the conven- 
tional game indicates that a cooperative 
choice, if reciprocated, has two effects: (a) it 
gives an equal and satisfactory payoff and, 
(b) it is the point of maximum joint payoff: 
The competitive response, if unreciprocated, | 
has three effects: (c) it maximizes the sub 
Ject’s payoff, (d) it minimizes his opponent's 


GAME BEHAVIOR 


payoff, and, (e) it maximizes the difference 
between the two payoffs in favor of the 
competitor. 

The consequences of the choices in the 
present game only partly parallel those of 
the conventional Prisoner’s Dilemma game: 
The A response fulfills Criterion a above, 
whereas the B response fulfills Criteria b, c, 
and e. The X response fulfills @ and can 
fulfill b and the Y response fulfills d and e. 
However, since the B and Y responses each 
fulfill two of the three criteria for competi- 
tive responses and the A and X each 
fulfill at least one criterion for cooperative 
responses, it would appear that the choice of 
B or Y is a more competitive response than 
the choice of A or X. For these reasons, the 
A and X responses will be referred to as the 
“cooperative” responses and the B and Y 
responses will be referred to as the “com- 
petive” responses. It is, however, important 
to maintain the distinction between our 
definition of these terms and the usual 
definition encountered in the Prisoner’s 
Dilemma literature. 

Thus, in the present study the structure 
of the Prisoner’s Dilemma game is modified 
so that the subject will be aware that he 
cannot directly affect the other player’s re- 
sponses and that he cannot be affected by 
them. Furthermore, the payoff matrices con- 
tain more extreme values than those normally 
employed in Prisoners Dilemma games. 
These particular values were selected to deter- 
mine whether subjects make responses which 
maximize their own outcomes even though 
such responses enable the other player to 
win as much or more than the subject, or 
responses which markedly reduce their own 
absolute winnings, but assure the subjects 
that they are winning more than the other 
Player. Under these conditions, an attempt 
is made to determine whether either or both 
the more internationalistic and/or the more 
isolationistic subjects modify their behavior 
in Session 2 as a function of differences in the 
Strategy of a simulated other in Session 1. 


METHOD 
Subjects 


Eighteen male and 18 female students in a summer 
session psychology course at the University of Wash- 
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ington served as subjects. Within each sex, the sub- 
jects were randomly assigned to each of two experi- 
mental conditions, that is, a “cooperative other” 
or a “competitive other.” Thus, each experimental 
group was composed of 9 females and 9 males. 
All subjects were run in like-sexed pairs. Whenever 
two subjects of the same sex were not available, 
a paid participant of the same sex was used to 
maintain the illusion that the subjects were playing 
against each other. The Internationalism-Isolationism 
Scale (Lutzker, 1960) was administered at the end 
of Session 2, For purposes of analysis, each experi- 
mental group was split at the median of its respec- 
tive distribution of scores. There was no median 
overlap between the two groups, that is, all subjects 
called “internationalists,” regardless of the experi- 
mental group, scored higher than all subjects called 
“Gsolationists.” Thus, four orthogonal groups were 
employed in the analysis: internationalist-cooperative 
other, internationalist-competitive other, isolationist- 
cooperative other, and isolationist-competitive other. 
Although the distribution of sexes was unequal in 
the four groups, a separate analysis of variance, 
using sex as one independent variable, demonstrated 
that the results of the experiment could not be 
attributed to sex differences in style of play, 


Experimental Procedure 


Two game matrices were employed (Figure 1). In 
the Session 1 matrix the subjects controlled the 
A and B choices; in the Session 2 matrix, the X 
and Y, Since the other player was simulated, each 
subject had the same matrix at any given time, 
although each was led to believe that the other 
player controlled the X and Y choices in the 
Session 1 matrix, and A and B in the Session 2 
matrix. 

Each subject had a game board in front of him 
with the appropriate matrix displayed on it. Each 
of the game boards was connected to a master 
control panel operated by the experimenter, so 
that the experimenter had the option of sending 
the subject either a “cooperative” or a “competitive” 
response on any given trial. In the cooperative-other 
condition, in Session 1, the experimenter sent the 
subject 26 cooperative responses, X, and 4 com- 
petitive responses, Y. In the competitive-other 
condition, in Session 1, the experimenter sent the 
subject 4 cooperative responses and 26 competitive 
responses. In Session 2, the subjects in both condi- 
tions were sent 27 A and 3 B responses. In this 
instance an A response by the simulated other indi- 
cated to the subject that the other was playing 
a strategy which would enable both players to win 
the same amount and that he was not attempting to 
win more than the subject. 

When the subjects entered the experimental room, 
a coin was flipped by the experimenter to determine 
the booth assignment. The subjects were informed 
that one subject had an initial advantage over the 
other, and the purpose of the coin was to determine 
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TABLE 1 


MEAN NUMBER OF COOPERATIVE CHOICES FOR 
INTERNATIONALISTS AND ISOLATIONISTS BY 
CONDITION AND BY SESSION 


Internationalists Isolationists 
Strategy of 
„ Simulated | 
Stir biaye | Seasion't | Session 2 | Session 1 | Seaaton 2 
Cooperative EAE 2ST 9.44 18.56 
Competitive | 11.22 16.33 8.56 21.00 


the advantage by chance. They were told further 
that the experiment would be halted halfway 
through, and that they would then trade booths and 
reverse the advantage. In actuality, both subjects 
received the same treatment in both sessions, 

The subjects were then seated in partitioned 
booths. They could not see each other or the experi- 
menter, Each subject's game board was equipped. 
with two response buttons and four lights to indicate 
mutual choices. The game was then explained and 
the equipment demonstrated. The instructions were 
designed to be as neutral as Possible. Each subject 
was given $1.40, and told that they could keep 
whatever they had at the end of the experiment, 
They were then handed the matrices, which slipped 
over the game board, in such a way that they 
were not aware that they both had identical matrices. 
Payoffs were made after each trial by slipping coins 
through slits in the partitions, 

At the end of 30 trials the subjects were told to 
hand the Session 1 matrices to the experimenter, to 
gather up their money, and to trade booths. When 
they were settled, Session 2 matrices were handed 
to them. After 30 more trials, the experiment 


TABLE 2 
Rests or ANALYSIS oF VARIANCE 


Source df MS F 
Between groups 
Internationalism- 
isolationism (A) 1 .89 _ 


Strategy of the 


“other player” (B) 1 3.56 — 
AXB 26.88 — 
Error (b) 32 62.86 

Within groups 
Re a versus 

sion 2 (C) 1 2,112.50  29,83+** 

AXC 1 05 — 
BXC 1 76.05 = 
A XBXC 1 352.38 4.98* 
Error (w) 32 70.81 

*p=.05. 

pmo 
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Internationalism-Isolationism Scale. When they had 
completed this task, the experiment was explained 
to them, and they were awarded $2.50 for having 
participated. 


RESULTS 


The choice behavior of internationalistic 
and isolationistic subjects by condition and 
by trial are presented in Table 1 in terms 
of the mean number of cooperative responses, 
Table 2 includes the results of a mixed 


A 
was concluded. The subjects then completed. . 
design analysis of variance which takes into 
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Fic. 3. Mean number of “cooperative” responses 
made by internationalists and isolationists in Ses- 
sion 1 and Session 2 under conditions of “coopera- 
tive” other and “competitive” other. 


consideration the main effects of internation- 
alism versus isolationism, cooperative versus 
competitive simulated other, and Session 1 
versus Session 2, as well as relevant inter- 
action effects, 

In addition to the main analysis of vari- 
ance, a supplementary analysis of variance 
was performed on the choice behavior 
of the 10 subjects in each experimental 
group scoring highest and lowest on the 
Internationalism-Isolationism Scale. The ee 
sults of the supplementary analysis are quite 
similar to the results of the main analysis. 
The relative Position of the means that enter 
into the three-way interaction is preserved. 
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It can be seen from these tables that in 
this particular game situation, international- 
ists did not demonstrate an overall tendency 
to be more cooperative than isolationist. In 
line with previous findings, the cooperative- 
other and competitive-other strategies did not 
have an effect on the overall number of co- 
operative choices made by the subjects. How- 
ever, the interaction between personality, 
strategy of the other, and parts of the game 
is significant. This interaction, using the data 
from all subjects, is presented visually in 
Figure 3. Inspection of this interaction indi- 
cates that the two-way interaction between 
parts of the experiment and strategy of the 
other player was significantly different for 
internationalists as compared to isolationists. 
Both groups made significantly more coopera- 
tive responses in Session 2, which is not 
surprising since a cooperative response in this 
session also maximizes the subjects’ absolute 
gains. 

It appears that the strategy of the other 
player had little differential effect upon the 
isolationists in Session 1 and Session 2. How- 
ever, the internationalists made more coopera- 
tive responses in Session 1 when faced with 
a competitive opponent than when playing 
against a cooperative opponent. This pattern 
was reversed in Session 2 with the interna- 
tionalists being more likely to allow a previ- 
ously cooperative opponent to win as much 
or more than their previously competitive 
Opponent. 

Finally, it is apparent that even though 
a greater proportion of cooperative responses 
were made in Session 2 than in Session 1, 
there was still a substantial number of com- 
Petitive responses made across subjects in 
Session 2. These responses occurred even 
though the subjects knew they would receive 
$.05 if they made a cooperative response, 
and only $.01 if they made a competitive 
response. Thus, it appears that even when 
it is substantially more profitable in terms 
of absolute winnings to make a cooperative 
Tesponse, subjects continue to make a sur- 
Prising proportion of competitive responses 
to ensure that they win more than the other 
player. 
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Discussion 


In the present experiment two major modi- 
fications of the Prisoner’s Dilemma game 
were made to determine whether differences 
in the strategy of the other player differen- 
tially affected the responses made by interna- 
tionalistic and isolationistic subjects. First, 
the nature of the interdependent relationship 
between players was modified so that the sub- 
jects were first in a position where they had 
a minimal effect upon the outcome of the 
game (Session 1), and then they assumed a 
position where they could control the outcome 
(Session 2). In terms of the theoretical 
paradigm of Thibaut and Kelley (1959), one 
could say that at the beginning of the game 
the other player had “fate control” over the 
subject, that is, “A can affect B’s outcomes 
regardless of what B does,” and during the 
second half of the experiment the subject had 
fate control over the other player. Second, 
the structure of the payoff matrix was modi- 
fied so that when one player had fate control 
over the other, he could select one of two 
behavioral alternatives, a response which 
maximized his own personal gain, but which 
permitted the other subject to win as much 
or more than himself, or a response which 
provided the player with a markedly smaller 
absolute payoff, but assured him that the 
other player would receive an even smaller 
one. 

During the first half of the experiment 
when the subject was under the fate control 
of the other player, the subject was still 
able to exercise an effect upon his own out- 
come under one condition. Namely, if the 
other made a cooperative response, that is, 
one which maximized the other’s outcome but 
which also permitted the subject to win as 
much or more, the subject could select a 
response which would enable him to gain 
equivalent winnings, or a response which 
would enable him to win twice as much as 
the other player. If, however, the other player 
selected a competitive response, that is, one 
which maximized the difference between his 
own and the other player’s score, the subject’s 
response did not affect the outcome. During 
this phase of the experiment, a determina- 
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tion was made as to whether the subjects 
given a simulated competitive or cooperative 
strategy by the other player would opt for 
equivalent or higher payoffs should the simu- 
lated other select a cooperative response. 
That is, would the subject attempt to inform 
the other player who had fate control over 
him that he was attempting to achieve equiv- 
alent rewards (cooperation), or that he was 
attempting to achieve the highest rewards, 
and to maximize the difference between his 
own and the player’s winnings (competition). 

The data from Session 1 indicate that a 
substantial proportion of the responses 
(70%) made by the subject when under the 
fate control of a simulated other player were 
ones which reflected an attempt by the sub- 
ject to achieve the highest reward possible 
for themselves—one substantially higher than 
the other player. There were no significant 
differences in the proportion of such responses 
selected between subjects who were “co- 
operated” with or “competed” against by the 
other player, nor were their significant differ- 
ences between internationalists and isolation- 
ists. In comparing groups (internationalists 
versus isolationists) by treatments (coopera- 
tive versus competitive other), we also find 
no significant difference in Session 1. We 
can note that the internationalists who play 
with a cooperative other tend to select the 
highest proportion (76%) of competitive 
responses, whereas the internationalists who 
play with a competitive other select the lowest 
Proportion (63%) of competitive responses, 
The two isolationistic groups fall between 
these two proportions. Thus, the internation- 
alists in the cooperative conditions seem to 
be more likely to play to maximize their own 
gain and the difference between their own and 
the other player’s score—perhaps under the 
assumption that when the positions of fate 
control are reversed they will support similar 
behavior on the part of the other player. The 
internationalists under the competitive condi- 
tion made the largest proportion of coopera- 
tive responses—perhaps to indicate to the 
other player that they would settle for 
equivalent rewards if he would shift from a 
competitive to a cooperative strategy, 

When the positions of power of the players 
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were reversed in the second part of the e 
ment, we observe that indeed the intel 
alists who were cooperated with durin 
sion 1, and who most frequently sé 
responses which would enable them to 
mize their own gains, show a greate 
ment in cooperative responses in Se 
That is, they most frequently select r 
which permit themselves the highest ab: 
reward while permitting the other pl 
win as much or more than themselves, 
versely, the internationalists who were 
peted against in Session 1 show the sm 
increment in cooperative responses in 
sion 2. It would appear that when the 
player has fate control over the intel 
tionalistic subjects, and selects compe 
responses, the internationalists attem} 
modify the others behavior by mal 
greater proportion of responses that i 
they are willing to settle for equiv: 
rewards. When this fails, and when 
assume fate control in Session 2, the int 
tionalists retaliate by selecting the hi 
Proportion of competitive responses, 
when such responses markedly reduce 
own absolute payoffs. In contrast to the int 
nationalists, the isolationists appear to be 
sensitive to differences in the strategy of t 
other player, both in Session 1 and Session 
In general it appears that variations in 
strategy of a simulated other player ` 
produce differences in subjects’ respon 
when, those subjects are internationalistic il 
orientation, and when the Prisoner’s Dile 
game is modified in the ways described ab 
It seems that for internationalistic : 
jects “cooperation” fosters “cooperatiol 
and “competition” fosters “competitio: 
this point in research on game-playing I 
havior, it seems necessary to investigate 
intensively the differences between tw 
dispositions or player goals, namely, the 
tion of a strategy which maximizes pers 
gains regardless of the outcomes affordet 
the other player, and a strategy which m 
mizes the differences between own and 


two strategies. 
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RECOVERY OF A DISCRETE MEMORY’ 


PHEBE CRAMER 


Barnard College 


Using a*technique of indirect priming it 
of specific responses on both an associati 


was hypothesized that the occurrence 
ion test and a test of recall could be 


increased. The results indicated that although the associative response fre- 


quency was unchanged, indirect priming 
recall. Primed-for words, although never 


did affect the kinds of errors made in 
overtly presented to S, were imported 


with significant frequency. Correspondences between this study and certain 
aspects of Freud’s theory of memory are discussed. 


In the Psychopathology of Everyday Life, 
Freud (1951) outlined a theory of memory 
and forgetting which postulated that forget- 
ting is a motivated event—that is, that lapses 
in memory are due to repression. However, 
although less accentuated in his writing, 
Freud also indicated that there are two addi- 
tional factors which are important in bringing 
forgotten events (or words) to consciousness: 
first, the concentration of attention, and 
second, the identification of the associative 
links from the conscious derivative memory 
to the forgotten event (or word). 

These ideas had been presented previously 
by Freud (1954). There, Freud discussed 
rational thought as based on a network of 
discrete (verbal) memories linked together 
by associative bonds, whose strength depends 
in large part on the frequency of their simul- 
taneous occurrence in past experience. 

This theory is quite similar to that held by 
a number of contemporary psychologists 
studying verbal behavior (Deese, 1959, 1961 
1962; Jenkins & Russell, 1952; Russell, 1955, 
1959). By careful study of associative bonds, 
as determined from word-association data, 
they have been able to make predictions about 
the more complicated cognitive functions 
represented in transfer, clustering, memory, 
and recognition threshold phenomena. There 
is a further link between the two views, as 
seen in their approach to the modification of 
associative processes. Both conceive of verbal 


, 


1 This paper is based on a doctoral dissertation 
submitted at New York University. The author 
expresses appreciation to Charles N. Cofer (chair- 
man) and to Edith Neimark and Donald P. Spence 
for their help. Donald P, Spence offered many 
helpful suggestions about the Present manuscript. 
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memories as organized into an associative 
network; and both hypothesize that the rela- 
tive strength of bonds within the network 
may be temporarily changed through, for 
example, experimentally focusing the indi- 
vidual’s attention on certain aspects of the 
network. Storms (1958), and subsequently 
Segal and Cofer (1960), have demonstrated 
that word-association responses of low fre- 
quency can be increased significantly if those 
response words are presented to the subject 
just prior to the association test. For ex- 
ample, if CABBAGE elicits HEAD as a response 
with a frequency of 16% under standard con- 
ditions, the prior presentation of HEAD can 
increase that frequency. 

Besides this direct priming, is there an- 
other means to increase the response fre- 
quency? Freud also suggested that the direc- 
tion of attention to associatively related words 
should do this. Deese (1959) has demon; 
strated that this relationship will predict which 
words appear as intrusions in recall. The pres- 
ent study attempts to influence word-as- 
sociation responses through knowledge of as- 
sociative networks (Russell & Jenkins, 1954), 
by directing the attention of the subject 
toward a discrete verbal memory. It is hy- 
pothesized that these two factors, as Freud 
Suggested, will increase the likelihood of the 
memory being recovered. 

To illustrate this technique of indirect 
priming, let us again consider the CABBAGE- 
HEAD example. This time, we present HAIR, 
FOOT, SHOULDERS, NECK, and sopy prior to 
the association test, thus directing the sub- 
ject’s attention to words which are associ- 
atively related to the primed-for word, HEAD: 
Tt is assumed that the excitation of these 
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words will spread to the hub word (HEAD), 
thereby increasing the probability of it being 
given as a response. Theoretically, this tech- 
nique may also be applied to increase the 
infrequent reverse association, HEAD-CABBAGE. 
Jn this case, FOOD, SALAD, SOUP, LETTUCE, and 
VEGETABLE would be presented prior to the 
association test. 

The following specific hypotheses were in- 
vestigated: 

I. The strength of an associative bond can 
be temporarily increased through indirect 
priming. 

II. Indirect priming of a hub word will 
cause it to be incorrectly recalled as being 
part of the priming list (the list of associative 
constellations presented for one-trial learn- 
ing). 

III. The presentation of a list of words for 
indirect priming (the peripheral members of 
the associative constellation) will increase the 
probability of those words occurring as re- 
sponses to various stimuli on the association 
test. In terms of the definitions provided 
above, this would be a demonstration of a 
kind of direct priming. 
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EXPERIMENT I 
Method 


Subjects. The subjects were 220 undergraduate 
students in elementary psychology classes at New 
York University, School of Education.? They were 
run in groups during their regular classes. 

Procedure. The subjects were divided into two 
experimental groups and two control groups. Each 
group was asked to learn a list of 26 words. The 
words, read by the experimenter at the rate of one 
per 2 seconds, were presented simultaneously on 
printed cards. Following this, subjects were given 
a word-association test. Then they were given 
5 minutes for written recall of the first (priming) 
list. 

The lists of words presented for learning varied 
across the groups. One experimental group (Group 1) 
learned words which formed the four constellations 
around four different B words (for example, the 
constellation HAIR, FOOT, SHOULDERS, NECK, and 
Bopy surrounds the hub word HEAD). The other 
experimental group (Group 2) Jearned words which 
formed the constellations around four different A 
words (for example, CABBAGE is the hub word and 
LETTUCE, VEGETABLE, FOOD, SALAD, and SOUP make 
up the constellation). 

Two control groups (Groups 3 and 4) learned a 
third list of priming words. This list also consisted 


25. Lepato, J. A. Piekarz, I. L. Smith, and H. 
Turkel generously made their classes available. 


TABLE 1 


EXPERIMENT I: PRIMING Lists, 
FOR RESPONSES 


ASSOCIATION-TEST STIMULI, 


AND 
Association-test stimuli Priming lists Primed-for words 
+ Groups Groups Group Group Groups 
a 4 Grou Group Groups 2 3and 4 
ad Zapas Ca 2 3 and 4 a 2 S 
HAIR LETTUCE | HILL 
FOOT VEGETABLE | HIGH 
BABBACE HEAD HOULDERS | FOOD PEAK HEAD CABBAGE | MOUNTAIN 
NECK SALAD VALLEY 
BODY sour CLIMB 
TABLE BED MAD 
sIT EASE FEAR 
OMEORT CHAIR LEGS HOME HATE CHAIR COMFORT | ANGER 
SEAT DISCOMFORT | RAGE 
DESK REST TEMPER 
SOFT LOAFING LAME 
ROCK PLAYING SHEEP 
WORKING HARD EASY LABOR MEAT HARD WORKING | MUTTON 
EGG LAZY CHOPS 
BOILED SLEEPING | BEEF 
MAN CHILD STEAL 
SCOUT cRY ROBBER 
BABY BOY DOG MOTHER CROOK BOY BABY THIEF 
FRIEND SMALL BURGLAR 
BROTHER | INFANT MONEY 
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TABLE 2 
EXPERIMENT I: EFFECTS or INDIRECT AND Direct PRIMING 


PHEBE CRAMER 


Mean occurrence B Mean occurrence A 
Indirect priming 
eee Norm Group 1 Group 3 Norm Group 2 
CABBAGE-HEAD 16% 6% 9% 2% 3% 
COMFORT-CHAIR 12% 16% 13% 0 To 
WORKING-HARD 13% 28% 4% 0 0 
BABY-BOY 16% 9% 11% 0 0 
Direct primin; 
List 1 (or 2) words only 45 8 14 
List 3 (or 4) words only 7 8 4 
Both 1 and 3 (or 2 and 33 30 36 
4) or neither 
N 85 46 54 


of four different associative constellations of five 
words each, but these constellations were unrelated 
to those used for the experimental groups, (For a 
complete listing of the constellations used, see Table 
1.) Within each group, the constellations were 
chosen so that there was no overlap or inter- 
relatedness among them, 

The 20 priming words were presented in the 
order they appear in Table 1. In addition, three 
buffer words appeared at the beginning and at the 
end of the list, to control for primacy and recency 
effects, 

Two different association tests were given, Group 1 
and its control group (3) were given a test con- 
taining the four critical A words (for example, 
CABBAGE) as stimuli, Group 2 and its control group 
(4) were given a test containing the four critical B 
words (for example, HEAD). Both tests began with 
the same two buffer words and concluded with 
the same 22 filler words chosen from the com- 
monality list (Peterson & Jenkins, 1957). These 
latter words were included to give credence to 
the association test and to reduce the apparent 
importance of the four critical stimulus words. (For 
a listing of the critical stimuli, see Table 1.) 

To summarize the procedure: Group 1 was 
primed for A words to occur as responses on the 
association test. Knowing the normative frequency 
of B—>A associations (for example, HEAD —> CAB- 
WAGE = 2%), any significant increase in this fre- 
quency would be taken as evidence of successful 
priming. In addition, Group 3 served as another 
control to Group 1. Since both groups were sub- 
jected to the same general procedure and took an 
identical association test, the only difference be- 
tween them was in the nature of the priming task. 
The priming words in Group 1 were associates to 
the primed-for response (for example, HEAD) while 
the priming words in Group 3 were not related to this 
response, but were related to another word which 
was irrelevant to the purposes of the association test, 
(In Table 1 this word is designated X.) 

Group 2 was primed for B words to occur as 
responses on the association test, Again, the norma- 


tive frequency (for example, CABBAGE > HEA 
16%) served as one control; Group 4, who 
same association test but was primed for X 
than B words, served as another control, 
frequency of B words as responses on the ai 
tion test is greater in Group 2 than in eit 
these control conditions, this will be tal 
evidence of indirect priming. 

Finally, all groups of subjects were reques 
write down as many words of the first (primi 
as they could remember. Of special interest we 
kinds of errors in memory the subjects would mi 
Specifically, the question was raised if Gro 
would erroneously recall (that is, import 
primed-for B words as part of the original 
Group 2 would import the primed-for A words, 
if Groups 3 and 4 would import the (irrele 
primed-for X words. 


Results 


Analysis of association-test data. Thi 
sponses to the 28 stimulus words 
recorded for all subjects, the four grou 
being tabulated separately. 

Indirect priming. Table 2 presents 
mean occurrence of the primed-for resp 
(B or A) for the experimental, the conttt 
and the Minnesota normative groups. A i 
Wallace method of allowances test for dif 
ences between group means (Mostelle 
Bush, 1954) indicated there were no 
nificant differences between groups 
regard to the effect of indirect priming, 
across groups among the four A-B pairs. 

Direct Priming. Table 2 also presents 
absolute frequencies, the number of sub; 
who gave as association responses W 
which appeared on their priming lists 
also presents the number of subjects who 


as association responses words which ap- 
peared on the priming list of their correspond- 
ing control group (but did not appear on their 
* own list). A third category includes subjects 
who gave as responses words which appeared 
on both or neither list. The categories are 
mutually exclusive. 

The chi-square values based on these data 
are 15.78 (Group 1 versus 3) and 16.99 
(Group 2 versus 4); for df = 2, both ps< 
001, showing a significant overall difference 
among the groups. Group 1 had more subjects 
who gave List 1 words as responses on the 
| association test than did Group 3 (53% 
versus 17%); and Group 3 had more sub- 
jects who gave List 3 words as responses than 
did Group 1 (17% versus 8%). Furthermore, 
Group 2 had more subjects who gave List 2 
words as responses on the association test 
than did Group 4 (26% versus 3%); and 
Group 4 had more subjects who gave List 4 
words as responses than did Group 2 (37% 
versus 7%). 

Analysis of recall data. Correct recall. 
Table 3 shows that the number of words 
correctly recalled in each of the four groups 
was quite similar. An analysis of variance 
yielded an F value of 1.973, which was not 
significant. 
Importation. In addition to words correctly 
recalled, subjects tended to import into recall 
words which were not presented on the orig- 
inal priming list, but which were the “primed- 
_ for” words. For example, subjects in Group 1 
recalled cassace (or in Group 2 recalled 
HEAD) as being part of the original list, while 
in fact these words were never presented. 
f The number of subjects in Group 1 and 
in Group 3 who imported List 1 and List 3 
primed-for words is tabulated in Table 3. 
Fifty-four percent of the subjects in Group 1 
imported List 1 words (B) into recall, while 
no subjects in Group 3 imported these words. 
On the other hand, 48% of the subjects in 
Group 3 imported List 3 words (X) into 
recall, while no subjects in Group 1 imported 
these words. For these frequencies x? = 45.32, 
which, for df =2, is significant beyond the 
001 level. 
ae 3 also presents the comparable data 
G Toups 2 and 4. Seventeen percent of the 
Toup 2 subjects imported List 2 words (A) 
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TABLE 3 


Experiment I: Anazysis or RECALL Data; NUM- 
BER OF SUBJECTS IMPORTING PRIMED-FOR WORDS 


Group 
1 2 3 4 
Correct recall 
13.4 | 12.7 | 13.8 | 14.7 
SD 3.8 | 36) 34] 49 
Importation 
Primed-for words 
A = 9 — 0 
B 46 — 0 — 
X 0 0 15 14 
Number of subjects 
importing neither 39 45 31 21 
N 85 54 46 |35 


into recall, while no subjects in Group 4 
imported these words. On the other hand, 
40% of the subjects in Group 4 imported 
List 4 words (X), while no subjects in Group 
2 imported them. Due to the small expected 
frequencies, the significance test for importa- 
tion of A words was done separately from 
that for importation of X words. When Groups 
2 and 4 are compared on the number of sub- 
jects importing A words, x° = 6.41, which, 
for df = 1, is significant at the .01 level (one- 
tailed test). When these two groups are com- 
pared for number of subjects importing X 
words, x” = 25.67, which, for dj = 1, is sig- 
nificant at the .001 level. 

Thus, although indirect priming did not 
result in changed association-test responses, 
it did show an effect on the recall test. Sub- 
jects imported those primed-for words which 
were predicted to occur as association re- 
sponses. Furthermore, the importation of 
these words occurred only in the group which 
was primed for them, and this occurrence was 
highly significant. Ẹ 

Experiment IT was an attempt to show this 
effect on the association test itself by chang- 
ing the associative relationship between the 
priming and primed-for words. 


EXPERIMENT IT 


As mentioned earlier, the associative con- 
stellations in Experiment I consisted of a 
hub word and the responses to that word. In 
using these (response) words for indirect 
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priming, a crucial assumption was made—that 
reverse associative bonds existed. That is, it 
was assumed that if there is a bond from 
A to B there is also a bond from B to A. 
A series of studies on the bi-directionality of 
associative bonds, cited by Russell (1955) 
supports this assumption. 

Considering Experiment I in retrospect, it 
appeared that since the priming was indirect 
(and thus probably weaker than direct prim- 
ing), it may have been too much to assume 
that reverse association could result in the 
desired priming effect. Moreover, in Group 2 
of Experiment I the paradigm had built in 
two reverse associative relationships—that of 
the priming words to the critical hub word, 
and that of the stimulus word on the associa- 
tion test to the desired response. 

Because of the preceding considerations, 
the words of the priming list in Experiment 
II were chosen to have a different relation- 
ship to the word they were expected to 
prime. That is, the priming words were 
stimuli (for example, BIRD, MOTH, EAGLE) 
which are known to elicit the desired response 
(rLy) rather than being elicited by it, as was 
true in Experiment I. Furthermore, only for- 
ward priming (A —> B) was investigated. 

An example of 1 of the 14 associative con- 
stellations used in Experiment II follows: 
the words BIRD, MOTH, EAGLE, and SPIDER 
(A) are all known to elicit FLY (B) asa 
response. The first three of these A words 
were used for priming, and the last was used 


as the critical stimulus word on the associa- 
tion test. 


Method 
Subjects. The subjects were 141 undergraduate 
students in elementary psychology classes at the 


TABLE 4 
Experiment IT: EFrECTS ov DIRECT Priminc 


Group 


1 3 2 4 


List 1 (or 2) words only | 15 9 16 
List 3 (or 4) words only 4 8 1 : 
Both 1 and3 (or 2 and 4) 

or neither 17 21 14 27 


N 36 38 31 36 
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TABLE 5 


ExpermeENT II: ANALYSIS OF RECALL Data; Nuw 
BER OF SUBJECTS ĪMPORTING PRIMED-FOR WORDS 


Group 
1 2 3 4 
Correct recall 
M 15.6 | 14.7 | 15.8 | 16.0 
SD 31 3.5 | 3an 
Importation 
Primed-for words 
Bı 10 — 0 l= 
Bs — 15 — 1 
y 0 0 a 3 
Number of subjects 
importing neither 26 16 33 32 
N 36 31 38 | 36 


Albany State Teacher’s College. They were run in | 
groups during their regular classes. 

Procedure. The procedure is essentially the same 
as in Experiment I, Four groups of subjects (two 
experimental and two control) were read (and | 
shown) three different lists of 25 words for one- 
trial learning. Following this, they took a 28-item 
word-association test, and then gave a written re- 
call of the first (priming) list. a 

The priming lists consisted of seven associative 
constellations of three words each (plus two buffer 
words at the beginning and end of the list). Bach 
experimental group had a different list presented, and 
the two control groups learned yet a third list. The 
priming words were chosen so that none of them . 
was directly associated to any of the critical stimuli 
on the association test. 

Two different association tests were given. Group 
1 and their control group (3) were given one test 
containing seven critical A stimuli, and Groups 2 
and 4 were given another test with seven different 
A stimuli. Both tests began with the same two 
buffer words and concluded with the same 19 words 
chosen from the commonality list. 7 

Thus, each experimental group was primed for dif- 
ferent B words to occur as responses on the associa- 
tion test. Each had a control group which took the 
same association test but was irrelevantly primed. 
The normative data (for example, semer —> FIY 
=4%) on the associative frequency of A wor 
eliciting B words as responses provided a secon 
control. 


Results 


Analysis of association-test data. Indirect 
priming. As before, the mean occurrence 0f 
the primed-for B responses was determine 


2H. E. Eson, N. Greenfeld, and E. D. Lawson 
generously made their classes available. 
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for the experimental, control, and normative 
groups. Again there were no significant dif- 
ferences between groups with regard to in- 
direct priming. 

Direct priming. Table 4 presents the data 
for direct priming. Group 1 had more sub- 
jects who gave List 1 words as responses on 
the association test than did Group 3 (42% 
versus 24%); and Group 3 had more subjects 
who gave List 3 words as responses than did 
Group 1 (21% versus 10%). Furthermore, 
Group 2 had more subjects who gave List 2 
words as responses on the association test 
than did Group 4 (52% versus 11%); and 
Group 4 had more subjects who gave List 4 
words as responses than did Group 2 (14% 
versus 3%). 

Although these differences are in the ex- 
pected direction, only the chi-square of 
13.64 for Groups 2 versus 4 is significant 
(.002 level, df = 2). 

Analysis of recall data. Correct recall. 
Table 5 shows that the four groups had 
very similar recall scores. An analysis of 
variance yielded an F value of 1.044, which 
was not significant. 

Importation. As in the previous experiment, 
these subjects also imported the primed-for 
words into the test for recall. 

Table 5 presents the number of subjects in 
Group 1 and Group 3 who imported List 1 
and List 3 primed-for words, as well as the 
number of subjects in both groups who im- 
ported neither. Of the subjects in Group 1, 
28% imported List 1 words (B1) into recall, 
while no subjects in Group 3 imported these 
words. Moreover, 13% of the Group 3 sub- 
jects imported Y words into recall, while no 
subjects in Group 1 imported them. 

The significance test for importation of Bı 
words and Y words was done separately, due 
to the small expected frequencies. When 
Groups 1 and 3 are compared on the number 
of subjects importing Bı words, xè = 12.03, 
which, for df = 1, is significant at the .001 
level. When these two groups are compared 
ict number of subjects importing Y words by 

isher’s exact probability test, p= 03. 

Table 5 also presents the data for Groups 2 
and 4. Forty-eight percent of the Group 2 
Subjects imported Bə words, while only 3% 
(one subject) in Group 4 imported a B2 
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word. On the other hand, 8% of the subjects 
in Group 4 imported Y words, while no sub- 
ject in Group 2 imported these. Again, the 
significance tests were done separately. When 
Groups 2 and 4 are compared for importation 
of Bə words, x? = 19.44, which is significant 
at the .001 level. When these two groups are 
compared by Fisher’s test for importation of 
Y words, p = .149. 


Discussion 


Does directing attention to one word in 
an associative constellation activate other 
words in that constellation? Does it do so to 
a degree sufficient to influence overt behavior? 
When the overt behavior studied is measured 
by a word-association test, the answer to 
the second question is negative. To test 
Hypothesis I, a particular word was chosen to 
be primed. This hub word was chosen either 
because it was a stimulus which elicited the 
words of the constellation (Experiment I) 
or because it was elicited by the words of the 
constellation as a response (Experiment IT). 
In neither case did presentation of the con- 
stellation words influence the frequency with 
which the primed-for hub word was given as 
a response. 

The answer to the first question above can- 
not be determined. The predicted behavior 
perhaps did not occur because the activation 
of other words in the constellation did not 
occur; or perhaps it did occur, but not to a 
degree sufficient to be picked up by the 
measure used.’ If this latter alternative is 
correct, then we should look to other response 
measures. 

The importation data of Hypothesis IT 
provide that opportunity. In this case, presen- 
tation of associatively related words caused 
the hub word to be recalled as part of the 
original list. 

The confirmation of Hypothesis III agrees 
with the findings of previous direct priming 
experiments. Directing attention to certain 
words increases the likelihood of their being 
given as responses on a subsequent association 
test. 

These findings have relevance for the theory 
of memory presented at the beginning of this 


4 The low A-B associative strength may be another 
factor involved. 


332 


paper. That theory stated that forgotten 
events or words can be brought to con- 
sciousness if one makes use of the associative 
links which exist between the forgotten event 
and memories which are available to con- 
sciousness. Furthermore, it stated that this 
recall can be facilitated by actively directing 
the individual’s attention to those associatively 
linked memories. 

The present experiments have provided a 
miniature experimental model and method to 
test these assumptions. They have shown that 
directing an individual’s attention (via in- 
structions to learn) to a group of words which 
are associatively related to another, never- 
presented word, causes that latter word to 
be remembered as having been presented. 
That is, use’of this method activates an 
internal memory and raises it to conscious- 
ness; in the absence of this method, the 
memory does not become conscious. 
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EFFECT OF ALCOHOL ON THE GUILFORD-ZIMMERMAN 


s SCALES OF EXTRAVERSION * 


CHARLES A. HAERTZEN anp EDWARD J. MINER 


United States 


80 postnarcotic addicts were given 
Survey under no-drug and 
significant differences were 
lack of significance for items. 


Therefore, 


was suggested that some extraversion (R) items as 


According to Eysenck’s theory depres: 


version. Lack of significance for the Rhathymia (R) scale was 
The R scale was 
the Addiction Research Center Inventory scales for alcohol or amphetamine. 
no direct confirmation of Eysenck’s theory was 


Public Health Service Hospital, Lexington, Kentucky 


sant drugs should increase extraversion. 
the Guilford-Zimmerman Temperament 
alcohol (3.0 cc/kg of 30% alcohol) conditions. No 
found on scales that measure extraversion-intro- 


associated with 
not significantly correlated with 


found. However, it 
well as items from other 


scales would be sensitive to effects of drugs if they were rewritten so as to 
suggest reactivity or current psychological status. i 


According to Eysenck (1957) central nervous 
system depressants such as the alcohols and the 
barbiturates should “increase cortical inhibition, 
decrease cortical excitation and thereby produce 
‘extraverted behavior patterns [p. 2291.” Further- 
more, subjects who are more extraverted, or who 
belong to a clinical group that exemplifies extra- 


“yersion should be more susceptible to the effects 


of alcohol than those who are introverted, or who 
exemplify introversion. The basic theory has been 
elaborated (Eysenck, 1960; Franks, Holden, & 
Phillips, 1961) and: evaluated in a number of 
ways (Bartholomew & Marley, 1959; Carrigan, 
1960; Honigfeld, 1962). But only a few studies 
are concerned with the measurement of change 
of extraversion on inventories. Thus, Laverty and 
Franks (1956) found that sodium amytal, a bar- 
biturate depressant, increased extraversion on the 


R scale of the Guilford-Zimmerman Tempera- 


“sumed to measure extraversion (Carrigan, 


ment Survey. Haertzen and Hill (1959), however, 
found that pentobarbital did not increase extra- 
version in postaddicts on the MMPI scales 
(Hathaway & McKinley, 1951) that are pre- 
$ 1960). 
Pichot and Reiman (1959) also failed to find a 
change on the R scale using cognac. 

The current study was designed to determine 
whether addicts who are presumed to be extra- 
verted would show more extraversion as an effect 


Tr 

Hae research was carried out under NIMH ARC 

for ph: 0. 57-C. D. E. Rosenberg examined subjects 

ae va and mental fitness. J. Gay contracted for 

EDA the GZTS scales. Statistical analyses were 

ons a with the assistance of S. O. Navarro, 
puting Center, University of Kentucky. 


‘of alcohol than the more introverted subjects 
tested by Laverty and Franks (1956). 


METHOD 

Subjects 

The 80 male narcotic postaddict subjects (mean 
age = 29.0, SD = 4.88) used in this study were part 
of a larger group of 219 subjects who were used to 
develop empirical drug scales (Hill, Haertzen, Wol- 
bach, & Miner, 1963) for the Addiction Research 
Center Inventory (ARCI; Haertzen, 
ville, 1963; Hill, Haertzen, & Belleville, 1958), All 


are prisoners 
cotic laws. Thirty percent of the subjects were white 


and the remainder were Negro. Addicts are average 
in intelligence and are predominantly diagnosed as 
psychopaths (single spike on the Psychopathic Devi- 
ate scale of the MMPI) but additional differentiation 
is possible with the remaining MMPI scales. Subjects 
were rejected for the alcohol tests in particular who 
had a history of ulcers, hepatitis, or hostility asso- 
ciated with the use of alcohol. 


Procedure 


The Guilford-Zimmerman Temperament Survey 
(GZTS; Guilford & Zimmerman, 1949) was given to 
subjects under no-drug and after ethanol. A solution 
of 95% alcohol was diluted to 30% alcohol with wa- 
ter. The no-drug test was administered on the first 
day. The GZTS administered under alcohol was ran- 
domized among 14 other tests that were scheduled at 
weekly intervals (Hill et al, 1963). At 8 A, sub- 
jects were given 3.0 cc/kg of body weight of 30% 
ethanol which was to be drunk in 15 minutes, Sub- 
jects were also given a maintaining dose of .239 cc/kg 
of 30% alcohol at intervals of 20 minutes, or until 
the GZTS (started at 9 am.) was completed. If a 


subject appeared overly intoxicated or complained of 
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TABLE 1 
MEANS AND STANDARD DEVIATIONS OF GUILFORD-ZIMMERMAN SCALES 
No drug* Alcohols 
High score Low score 
M SD M SD 
tivi Inactivity 7 17.4 | 5.1 47 
R RT eee Rhathymia-impulsive 15.4 | 44 3.9 
A Ascendance Submissive 18.9 | 3.8 45 
S Sociability Shyness, 20.8 | 4.6 A 
E Emotional stability Depression 18.4 | 4.7 F 
O Objectivity Subjectivity 18.2 | 5.1 4 
F Friendliness Hostility ý 11.8 | 5.6 4 
T Thoughtfulness Thoughtlessness-extroversion 19.6 at T 
P Personal relations-cooperative Criticalness-intolerance 15.2 | 4.9 4. 
M Masculinity Femininity 18.1 | 3.9 SA 


= Raw scores were used. 
* p <.05, df = 1/78, 


side effects, the maintaining dose was discontinued. 
Subjects knew they were receiving alcohol but they 
did not know the names of six other drugs that were 
given. A no-drug condition was used as a control 
rather than placebo because preliminary studies with 
the ARCI indicated that postaddicts did not show 
placebo effects. Subjects were also given Cattell’s 16 
Personality Factor Questionnaire (16 PF; Cattell, 
1957) on the last day of testing, using standard in- 
struction under no-drug conditions. All tests were 
given in an air-conditioned day room. Subjects were 
allowed to smoke and make minor comments to each 
other, 


Statistical Analysis 


All of the GZTS scales were scored. Differences be- 
tween the no-drug and alcohol conditions were tested 
using analysis of variance for paired replicates (Ed- 
wards, 1957). Other incidental analyses will be given 
brief mention in the Discussion, 


RESULTS 


In this study no blood alcohol concentrations 
were determined. The effectiveness of the dose 
and time for administering the GZTS was estab- 
lished in a pilot study using a shortened form of 
the ARCI and the Purdue Pegboard (Tiffin, 
1948). Alcohol impairs performance on the Pur- 
due Pegboard. 

Table 1 presents the mean raw scores and the 
significance of the difference between the no-drug 
and alcohol conditions. As can be seen from 
Table 1, the difference between the no-drug and 
alcohol conditions is not significant on the R 
scale nor the G, A, S, and T scales that are 
sometimes associated with extraversion (Carrigan, 
1960). Significant, but nonpractical differences 
were found on objectivity and masculinity. 
Table 1.also gives the label which identifies the 
meaning of high and low scores on various scales, 


High scores on the GZTS generally indicate p os 
tive attributes, f 


DISCUSSION ' 


According to Eysenck’s (1957) theory, depr 
sant drugs such as alcohol should increase €x 
version and extraverted subjects should e 
ence greater effects from alcohol than intro 
subjects. In a preliminary study employini 
subjects, alcohol produced no effect on any G 
scale. In order to rule out explanation on the bi 
sis of an insufficient number of subjects, 50 n 
subjects were added to the first 30, but even 
the larger number (80) no significant diffe: 
were found on extraversion scales though 
practical differences were found on scales for 
jectivity and masculinity (less than one ha 
standard deviation). A 

Attention in this study was focused i 
marily on the R scale (restraint or its oppo j 
thathymia) as a primary measure of Eys 
concept of extraversion. A preliminary atte 
was also made to determine why alcohol di 
increase extraversion. Lack of significance et 
be due to lack of effect on items, cancellatio 
effects, or variation in the effect of alcohol 
different subjects. Item analysis of the R 
suggests that lack of significance was mainly 
sociated with lack of significance of change 
items. When items of the GZTS were class 
according to the typology of Haertzen 
(1963) it was found that most items fell im 
category of feeling; items in this category 
most resistive to change. Certain subtypes wit 
this category which are sensitive to drugs suc 
cognition, mood, anxiety, and attitudes abo 
test are also lacking in the GZTS. The GZT 
noticeably lacking on items on sensation an 
ception and bodily symptoms that are most 
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tive to drugs including depressants such as alcohol 
or pentobarbital. GZTS items also suggest more 
permanent characteristics. Such items are more 
resistive to change than those which suggest re- 
activity in the present. Hence, it would seem that 
lack of change on the GZTS is partially due to 
item characteristics. Possibly some GZTS items 
including extraversion items would reflect drug 
effects if they were rewritten to suggest more re- 
activity, current status, or contrasts with more 
stable traits. However, such alteration would pos- 
sibly also reduce the relationship with extraver- 
sion or increase the relationship with social unde- 
sirability, since traits expressed in the present 
tense are endorsed less frequently (fewer true re- 
ae than those in the past (Haertzen et al., 
1963). 

In evaluating the importance of initial level of 
extraversion on the effect of alcohol, it was found 
that subjects who were classified high or low on 
the Impulsivity scale of the ARCI (Haertzen, 
1963) and the Surgency scale of the 16 PF (Cat- 
tell, 1957) did not exhibit any significant effect 
to alcohol on the R scale. 

_ Tt may be argued that although the R scale in 
its present form may not reflect alcohol effects, 
it may either be related to performance on drug 
effect scales or drug-placebo differences on drug 
scales, With regard to the former possibility, drug 
Sensitive scales of the ARCI did not have im- 
pressive loadings on a factor of which R was the 
main marker. Also the correlations (determined 
under no-drug conditions) between R and the 
ARCI alcohol (r = — .12) and amphetamine (r 
ay 09) empirical scales were not significant. 
Slightly lower negative correlations were found 
between R (no-drug condition) and the same em- 
pirical scales under alcohol (same dose as used 
in this study). Thus, no evidence for an interac- 
apn between R and the effect of alcohol was 
evident. 
i Thus, in conclusion, alcohol does not alter ex- 
raversion as measured by the R scale, nor does 
the R scale predict subjective effects of alcohol. 
E aos sense, the findings agree with those of 
i chot and Reiman (1959). They provide no con- 
ea of predictions derived from Eysenck’s 
(1936) or the finding of Laverty and Franks 
VERA because neither extraverted nor intro- 
as a) showed an increase in extraver- 
ete ack of change may be due to item charac- 
bf a that are associated with resistance to 
rie In this sense the present study con- 
Hace le significance of drug sensitive item char- 

eristics (Haertzen et al., 1963). 
sree relevance of other possible explanations 

its further investigation. Thus, the present re- 
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sults may be due in part to the sophistication 
of the subjects. The customary placebo effect 
(Beecher, 1959) was not found on the ARCI 
scales (Hill et al., 1963). Though sophistication, 
here regarded as equivalent to a history of nar- 
cotic addiction, apparently accounts for skill in 
labeling drugs given in double-blind experiments 
or certain qualitative differences in the pattern 
of subjective effects (Beecher, 1959), its rele- 
vance is discounted in this study because Pichot 
and Reiman (1959) failed to find an effect from 
alcohol on the R scale. Second, the drug itself 
may be a factor. A theory should not stand on 
the basis of the use of one drug such as sodium 
amytal, the one used by Laverty and Franks 
(1956). Thus, Eysenck (1957) framed his theory 
both in terms of depressants and stimulants as 
well as the alcohols and barbiturates. The use of 
alcohol merely broadens the definition of depres- 
sants over that of sodium amytal. There are fac- 
tors of similarity and difference in the subjective 
effects of alcohol and pentobarbital (Haertzen, 
1962), but the scales representing these factors 
are not significantly correlated with the R scale. 
The applicability of Eysenck’s theory in the pre- 
diction of the effect of stimulants and depres- 
sants on the R scale is further weakened by a 
finding of a nonsignificant effect of amphetamine 
(unpublished study). 
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JUDGMENT AND FEEDBACK IN A QUASI-CLINICAL SITUATION + 


JOSEPH R. NEWTON 2 


University of Colorado 


Ss estimated college students’ grade averages from 4-sentence descriptions on 
2 occasions: in a naive state and-after feedback. The feedback consisted of 5 
combinations of the correlation coefficients of Brunswik’s lens model, Results 
show that Ss provided with the ecological correlations increased their achieve- 
ment, An examination of the components of achievement indicates that Ss 
increased achievement by improving their match between the utilization cor- 
relations and the ecological correlations. 


There is substantial evidence showing that 
perceptual-motor performance improves when 
knowledge of results, or feedback, is provided 
for subjects (see, for example, Ammons, 1956). 
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More problematical is the effect of feedback 
when performance consists of a judgment ba 
on clinical stimulus materials, For example, | 
immediate feedback as important for clinic 
judgment as it is for perceptual-motor perform= 
ance? Should feedback be specific to the 
judgment or may it consist of an abstr: 
summary statement of performance over a sêl 
of trials? q 
The present study was designed to explore 
effects of an abstract form of feedback on ju 
ment in a quasi-clinical situation. The feedb: 
consisted of various combinations of correlatiol 
Coefficients calculated from the variables of 
lens model (Brunswik, 1955, 1956; see Ha 
mond, 1955, for an application of the lens mo 
to the clinical method). The accuracy of ju 
ment was assessed both before feedback 
following feedback. Moreover, a recent statisti 
development by Hursch, Hammond, and Hurst 


estimates of the distal variable, 


(1964) permits an analytical examination of the 
components of the subjects’ judgmental accuracy. 
This examination indicates how the subjects’ use 
of the stimulus materials changed after feedback, 
and shows the manner in which some of the 
subjects improved their accuracy of judgment. 
i 


METHOD 


The study was designed within the framework of 
Brunswik’s (1956) probabilistic functionalism. Ac- 
cordingly, the lens model was employed and the 
cues had probabilistic relations with both the distal 
variuble (the criterion) and the subject’s responses 
(see Figure 1). 

Subjects estimated grade averages of 53 students 
from four-sentence personality descriptions on two 
occasions, In the first session the subjects estimated 
Stade averages according to their “first impression.” 
subjects then were randomly assigned to one of the 
feedback conditions indicated in Figure 2. The feed- 
ack Conditions consisted of one or more correlation 
Coefficients of the lens model. For example: Feed- 
as T consisted of a subject’s achievement correla- 
eon Obtained from the first session; Feedback IV 
Onsisted of a subject’s achievement correlation and 
r four utilization correlations obtained from the 
aa Session, and also the four ecological correlations 

ich had been previously calculated. 
the the second session subjects were provided with 
Reston coefficients and again estimated grade 
CD of the 53 students. The effect of the cor- 
Aneta Coefficients on the accuracy of the subjects’ 
oe nces was measured by comparing the achieve- 

t correlation for each subject in the prefeedback 

at ce conditions. An analytical examination 

cama also was performed by calculating 
components of achievement (see Results). 
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r~ Achievement correlation 
D r 
Distal Estimates 
variable of distal 
variable 
Ecological correlations Cues Utilization correlations 


"fic. 1. The lens model. (The achievement correlation shows the relation between the subject’s estimates of 
the distal variable and the actual values of the distal variable, and indicates the subject’s accuracy of judg- 
ment, or achievement, The utilization correlations consist of the relations between each cue and the subject's 
and are one indication of the subject’s reliance on the cues, The ecological 


correlations show the relation between each cue and the distal variable.) 


Feedbock I 


Feedback IL 


Feedback IL 


Feedback IZ 


"set of rules” 
Feedback X 


Fic. 2. The lines indicate the feedback for subjects 
in each of the five feedback conditions. 
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TABLE 1 


ECOLOGICAL CORRELATIONS, INTERCORRELATIONS, AND 
R. FOR THE 53 APPLICATIONS FOR ADMISSION 


10 HSR CBS PR DV 
I — 
HSR 54 = 
CBS 54 46 = 
PR .52 -68 :28 = 
DV 38 62 65 39 = 


R. = 15, Rê = .56 


Note.—Rs is the multiple correlation coefficient for the 
ecology, found by using the distal variable as the criterion and 
the cues as predictors. 


Subjects 


Subjects were students in introductory psychology 
classes at the University of Colorado. Participation 


in psychological research had been encouraged by 
instructors of the classes. 


Four-Sentence Personality Descriptions 


The personality descriptions consisted of four cues, 
each having a different relation with the distal 
variable, The distal variable was first-semester grade 
averages based on the 12-point scale A, A—... 
D-, F. The cues were: a group IQ test, high school 
rank, a college board score, and a high school prin- 
cipal’s personality rating. Students’ records were 
selected from a recent freshman class at the Uni- 
versity of Colorado if they contained all four cues 
and the distal variable. This resulted in a pool of 
106 acceptable records, which was subsequently 
reduced to 53 by randomly selecting one half of the 
records at each of the 12 levels of the distal variable. 
The final 53 records, used in both the prefeedback 
and feedback conditions, yielded the correlation 
matrix indicated in Table 1. 

For purposes of presentation a brief sentence was 
written for each of the four cues, and the 53 
students’ records were randomly compiled into a 
packet of “personality descriptions.” A box was 
provided in the upper right-hand corner of the 


TABLE 2 
MEAN, MEDIAN, AND RANGE OF fa, 


BRIEF ÅRTICLES 


descriptions for the subjects’ estimates oj 
average. 


Feedback 


All subjects (W = 99) participated in the fir, 
feedback (PFB) session. Subjects’ estimates o 
averages from this session provided the dat, 
to calculate the feedback for the second 
The second sessions consisted of the following: 
of feedback: Feedback I (FB,)—each subje 
achievement correlation (n=21); Feed! 
(FBu)—each subject’s achievement correlatio 
his four utilization correlations (n = 20); Fi 
II (FBu1)—each subject’s achievement corre 
and the four ecological correlations (n = 19); F 
back IV (FBiy)—each subject’s achievement corn 
tion, his four utilization correlations, and th 
logical correlations (m = 20); Feedback V ¢ 
same feedback as FBiy. However, in FBy si 
also completed a questionnaire which required il 
to consider critically the feedback and form a “ 
of rules” with which to estimate grade avi 
(n=19). 


‘i 


Procedure 


Both PFB and FB conditions were administered 
group settings. Subjects were told the perso} 
descriptions contained information about sti 


In the second session 3 weeks later subjects 
urged to improve on their previous estimates. 
the instructions differed in detail for the fi 
conditions, they all contained: a brief expl 
of the concept of a correlation coefficient; 
explanation of the specific correlation coefficie! 
that was provided the subjects of that condi 
the correlation coefficient(s), itself, contain 


PFB, Anp FB 


Condition 
I I I 
PFB FB PFB FB PFB FB 
M .60 61 59 60 i * 
Mdn. ‘62 62 58 60 rt 6 
Range* 68 ate. 67 -70 67 74 
43 52 ‘53 47 31 51 


a This indicates the highest and lowest value obtained by subjects of a condition, 


by one subject of a condition. 
* p <.05, t test, two-tailed. 
** p <.01, £ test, two-tailed. 
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boxes within the subjects’ instructions (for complete 
Í instructions see Newton, 1963). The subjects were 
given time to study the instructions and feedback, 
questions were elicited by the experimenter, and sub- 
jects were told they could refer to the instructions 
at any time. Subjects then again estimated grade 
averages and when they were finished the study 
was fully explained to them, 


RESULTS 
Effects of Feedback on Judgmental Accuracy 


The achievement correlation (rą) for each 
subject, PFB and FB, was calculated. The 
correlation coefficients were transformed into 
Fisher’s z coefficients and a ¢ test for correlated 
means was performed. Table 2 summarizes the 
tesults of this analysis. Small but statistically 
significant differences in the accuracy of judg- 
ment were manifested by subjects in FByy, 
FBiy, and FBy. These conditions, in contrast 
with FB; and FBy,;, were similar in that they 
provided subjects with the ecological correla- 
tions, that is, information regarding the rela- 
tions betweei. cues and the distal variable. It is 
evident, then, that providing subjects with the 
ecological correlations resulted in greater judg- 
mental accuracy; an analysis of the components 
of achievement indicates the specific effects of 
this information. 


Components of Achievement 


Analysis of the components of achievement is 
made possible by the recent work of Hursch 
et al. (1964). They have shown that an achieve- 
ment correlation is a function of three different, 
though related, components when R, is controlled 
as in the present study: the degree of correspond- 
ence between the subject’s utilization correla- 
tions and the ecological correlations; the size of 
Ri, which indicates the extent to which the 
linear regression model accounts for the vari- 
ability in the subject’s responses; the extent to 
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which the subject uses some nonlinear pattern 
in the configuration of stimuli. The lens model 
of the present study (Figure 1, Table 1) is an 
example of their Case VI, in which R,? + 1.00 
and the cues are intercorrelated. Their Formula 
27a is the appropriate formula: 


z Rê + R? = (Bei = Bai) (fei = fosi) 
2 


+ NU- RAC — RA) 


Ta 


where: 


fa = an achievement correlation 
© (B.,i — Bs.) (Te, — Ta.) 
= the differences between the subject’s beta weights 
(6,,:) and the beta weights for the environment 
(Bai), multiplied by the differences between the 
subject's utilization correlations (7,,;) and the 
ecological correlations (7,,;), summed. This term 
will be referred to as Zd, representing “sum of 
the differences,” 


C = a Pearson product-moment correlation coefficient 
between the variance unaccounted for by R, and 
the variance unaccounted for by R,?. That is, the 
correlation between the residuals 1 — R and 
1 — Rê. The size of C indicates the extent to 
which the subject is responding to a nonlinear 
relationship (“pattern”) between the cues and 
the distal variable. 


On the right side of Formula 27a the three 
terms which may vary with the subject’s esti- 
mates are R,?, C, and Sd. The remainder of the 
Results section considers each of these statistics. 

R,2. The median and range of R,? for the PFB 
and FB conditions are summarized in Table 3. 
While significant differences in R,* are indicated 
for FB,, FByy, and FBy, a comparison of the 
PFB with FB columns shows that this finding 
must be interpreted with caution. This is because 
the variability of the values of R,*, PFB (ex- 
tending from .80 to .88) is considerable relative 
to the variability of the values of R,?, FB 


TABLE 3 
MeprAN AND RANGE or R#, PFB, AND FB "i 
Condition 
I II Il IV v 
PFB FB PFB FB PFB FB PFB FB PFB FB 
ate * 33 | 20 30 | ss | sa | get 
Ranges 33 38 cH 95 ‘93 93 1 4 92 ‘94 
61 73 155 65 <68 :78 64 15 56 16 
coe i j iti he highest and lowest value obtained 
by one seats ea and lowest value obtained by subjects of a condition, and not the hig! hest and lowest value obtain 


¥ 
?<.01, sign test, two-tailed. 
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(extending from .88 to .90). This shows that 
the statistical significance of R,? is due to the 
variability in the values of R,*, PFB, and not to 
the ‘differential effects of feedback. Since the 
median FB R,® values are larger than the median 
PFB R,? values, the only sound conclusion indi- 
cated by Table 3 is that subjects in all FB condi- 
tions tended to become more linear. 

C. The mean, median, and range of C, PFB, 
and FB, are summarized in Table 4. The correla- 
tions were transformed into Fisher’s z coefficients 
and ¢ tests for correlated means were not sig- 
nificant for any of the PFB versus FB com- 
parisons. 

Xd. The statistic Sd provides an index of the 
extent to which the subject’s utilization correla- 
tions match ecological correlations: the larger 
the value of Sd, the greater is the discrepancy 
between the two sets of correlations; the smaller 
the value of Xd, the closer is the match between 
the two sets of correlations. The mean, median, 
and standard deviation of Sd, PFB, and FB, are 
summarized in Table 5. A statistically significant 
poorer match between utilization correlations and 
ecological correlations is indicated for FB}. This 
shows that some subjects in this condition 
matched utilization correlations with ecological 
correlations better before feedback than after 
feedback. For FByy, FByy, and FBy, however, 
there is a statistically significant lower Xd, indi- 
cating a better match between the two sets of 
correlations. Since FByy, FB;y, and FBy had 
in common the information provided by the 
ecological correlations, this indicates that sub- 
jects in these conditions used such information 


This interpretation may appear oversimplified 
since the beta weights also enter into the calculation 
of 2d. However, since £. and Bss are solely a 
function of the ecological and utilization correlations 
(and the constant values of the intercorrelations) 
such an interpretation is warranted, 


TABLE 4 
MEAN, MEDIAN, AND RANGE oF C, PFB, anp FB 
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to increase their match between the utilization 
correlations and the ecological correlations, 


Discussion 


Analysis of the effects of feedback on r, shows 
that only subjects who received the’ ecological 
correlations as feedback increased their accuracy 
of judgment. The manner in which the subjects’ 
judgments were influenced by the ecological cor- | 
relations is indicated by the components of rz. 

The results concerning R,? indicate that sube 
jects became more linear in each feedback 
session, regardless of the type of feedback. The 
extent of the linearity of subjects is particularly 
noteworthy. Both in the PFB condition (lowest 
value of R,? =.55) and FB conditions (lowest 
value of R,?=.65), the multiple regression 
model accounts for better than half of the vari- 
ance of subjects’ responses. This result further 
corroborates the findings of other investigators, | 
for example, Todd (1954), Hoffman (1960), } 
Meehl (1960), Anderson (1962), Lee and | 
Tucker (1962), Grebstein (1963), that a linear 
additive model—the multiple regression equation | 
—accounts for a substantial proportion of the 
variance of subjects’ responses. The success with 
which the equation predicts response variance, 
despite dissimilar instructions and widely varying 1 
stimulus materials, points out the unique and » 
important contribution of this technique for the 
study of judgmental processes. i 

The value of C was not significantly different 
for any of the PFB versus FB comparisons. 
This result shows that subjects’ use of pattern 
in estimating grade averages did not vary sys- 
tematically with the different kinds of feedback. 
The ranges of C for the different PFB-FB condi- 
tions in Table 4 (for example, from —.35 to .31), 
however, are large enough to indicate that pat- 
tern effects were not inconsequential. That is, 
even though subjects’ use of pattern did not | 


I II 
PFB FB PFB FB 
M 20 TE o | —.02 
Man. ‘00 | —08 02 | —.02 
Ranges 17 31 20 35 
+31 | -.35 | -20 | ~16 


* This indicates the highest and lowest value obtained by subjects of a condition, 


by one subject of a condition, 
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TABLE 5 
Mean, MEDIAN, AND STANDARD DEVIATION oF 2d, PFB, AnD FB 


Condition 
I Ir Ill IV Vv 
PFB FB PFB FB PFB FB PFB FB PFB FB 
a er Reis 
SD 072 080 091 098 .057 .056 082 058 .057 120 


* p <,02, Wilcoxon matched-pairs signed-ranks test, two-tailed. 
*+ b <,01, Wilcoxon matched-pairs signed-ranks rae two-tailed, 


vary with types of feedback, the ranges of C in 
both the PFB and FB conditions indicate that 
some subjects relied on pattern to an important 
extent. This finding suggests several problems 
for future investigations of judgment: What are 
the conditions which result in the subject basing 
his judgments on pattern in the configuration 
of stimuli? What are the conditions which dif- 
ferentiate between judgments based on pattern 
(C) and judgments based on linearity (R°)? 
Are the above conditions a result of the sta- 
tistical relations among the stimuli, are they 
owing to different instructions for the subjects 
(for example, instructions stressing an intuitive 
or analytical mental set), or is there a subtle 
interplay between stimuli and instructions which 
Prompts the subject to resort to pattern and 
linearity? 

The results of Sd provided the clearest evi- 
dence for the specific effect of (that is, feedback 
of) the ecological correlations: Sd was signifi- 
cantly smaller for subjects who received this in- 
formation, and those same subjects manifested 
Sighificantly more accurate estimates of grade 
averages, It is evident, then, that subjects 
Interpreted the ecological correlations and ad- 
Justed their use of cues to correspond more 
closely with the values of those correlations. 
oh finding obtains despite the fact that all 
Subjects were naive with respect to correlation 
ad before this study, and only received 

8 minute explanation of the meaning of 
Correlation coefficients!) 
de sum, these results show that correlation 
fai cients are a meaningful mode for conveying 
ormation about the relations between cues and 

e distal variable. 
he findings of this study also hold relevance 
indie k problem of clinical training. The results 
Hae te that we can expect clinicians to improve 
ane judgment by providing them with the cor- 
a on coefficients between the tests and the 

erion (that is, the ecological correlations), 


even when the variables are grossly measured. 
While Holt’s (1958) efforts in this direction were 
unsuccessful, the present results indicate that the 
procedure requires further consideration. The 
disparity between the findings of Holt and the 
present investigation may be owing to the much 
more complex stimulus materials (for example, 
Rorschach, TAT) employed by Holt. If this is the 
case it suggests that knowledge of the ecological 
correlations improves clinical judgment, when 
that judgment is based on simple, well-structured 
diagnostic materials. The use of complex diag- 
nostic materials, however, introduces so many 
variables that the beneficial effects of the ecolog- 
ical correlations may be neutralized and judg- 
ment fails to improve. 

In contrast to the naive subjects of the present 
study, judgments rendered by statistically so- 
phisticated clinicians should benefit from both 
the ecological correlations and feedback of the 
utilization correlations. The former provides in- 
formation regarding the relations between the 
“criterion and the tests”; the latter provides in- 
formation regarding the relations between the 
“clinician and the tests.” Clinical judgment 
should be facilitated by providing a clinician not 
only with the test-criterion information but also 
with a past record of his use of the tests. This 
paradigm would make available to a clinician a 
measure of the correspondence between the 
validity of each test and his use of each test. 
Thus, a training situation is made possible in 
which a clinician may regulate his reliance on a 
test to coincide with the validity of the test. 
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COMPETITIVE AND NONCOMPETITIVE PERFORMANCE IN RELATION 
TO ACHIEVEMENT MOTIVE AND MANIFEST ANXIETY 


E. DEAN RYAN ann W. L, LAKIE 


University of California, Davis 


ane ey was concerned with competitive performance of Ss differing in 
n Ac 


(French 


Tn the past 10 years a number of studies have 
attempted to state explicitly how individual dif- 
ferences in strength of various motives influence 
behavior in competitive situations, or in situa- 
tions where achievement is emphasized. Much of 
this work has centered around two motives, the 
achievement motive and anxiety. The strength 
of the first of these motives, need for Achieve- 
ment (n Ach), has typically been inferred from 
the content of imaginative thought and is con- 
ceived as a relatively stable disposition to strive 
for success. In general the achievement motive 
has been positively related to performance 
(Atkinson, 1958; McClelland, Atkinson, Clark, 
& Lowell, 1953), The strength of the second 


motive, anxiety, has usually been inferred from 
Scores on the Mandler-Sarason (1952) Test 
Anxiety Questionnaire (TAQ), the Taylor (1953) | 
Manifest Anxiety (MA) scale, or more recently 

the Alpert-Haber (1960) Achievement Anxiety 

Test. Typically, anxiety scores have been nega M“ 
tively related to performance in stressful of 
achievement situations (Farber & Spence, 1953; 
Mandler & Sarason, 1952; Taylor & Spence, 
1952). A theoretical model of how these tw0 
motives influence risk-taking behavior has been 
Presented by Atkinson (1957). It has been sug- 
gested that measures of n Ach indicate the 
strength of a motive to approach success, while 
measures of anxiety indicate the strength of 4 
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motive to avoid failure. Subjects who score high 
inn Ach and low in anxiety are assumed to be 
higher in the motive to achieve success than in 
the motive to avoid failure. Subjects who score 
“low in n Ach and high in anxiety are assumed 
to be higher in the motive to avoid failure than 
in the motive to achieve success (Atkinson & 

Litwin, 1960; Brody, 1963). 

Previous research has supported this model. 
Individuals in whom the motive to succeed was 
stronger than the motive to avoid failure worked 
longer on a final examination and got better 
grades than those persons in whom the motive to 
avoid failure was the stronger motive (Atkinson 
& Litwin, 1960). 

The present study is concerned with differences 
in performance between a noncompetitive, or 
neutral situation, and a competitive, or arousal 
situation, Atkinson (1950) in studying achieve- 
ment motivation has shown that regardless of 
n Ach scores, subjects completed more tasks 
as the atmosphere became more achievement 
orientated, with the high achievers showing the 
increase most markedly. Under relaxed condi- 
tions, however, the high need achievers performed 
significantly worse, In studying anxiety Mandler 
and Sarason (1952) found that under neutral 
conditions subjects with high anxiety performed 
better on the Kohs Block Design Test than did 
the low-anxiety subjects. Under both failure and 
Success conditions, however, this situation was 
eo with the low-anxiety group performing 
est, 

Thus it would be hypothesized that under 
neutral, or noncompetitive conditions the persons 
in whom the motive to avoid failure is stronger 
than the motive to succeed would perform better, 

while in a competitive situation the person with 
z higher motive to succeed would perform 
etter, 

The purpose of the present study was to test 
this hypothesis by comparing face-to-face com- 
Petitive performance of subjects differing in need 
achievement and manifest anxiety. Two groups, 
one composed of subjects for whom the motive 
to achieve success was assumed to be greater 
than the. motive to avoid failure, and the other 
Composed of subjects for whom the motive to 
avoid failure was assumed to be stronger than 
the motive to succeed were compared on 
Perceptual-motor performance. 


METHOD 
Subjects 


During the fall semester the Taylor MA scale was 


aministered to approximately 300 male university 
dergraduate students enrolled in a voluntary physi- 
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cal education program. The following spring 20 
subjects with the highest MA scale scores and 20 
subjects with the lowest MA scale scores were asked 
to participate in an experiment in motor learning. 
None of these subjects, when questioned later, were 
aware that their participation was in any way 
associated with the anxiety questionnaire. 


Apparatus 


A ring-peg test was used to measure performance 
(Lambert, 1951). The test consisted of a board 
30X 48 millimeters in which two identical patterns 
of pegs were placed opposite each other. The pegs 
were 10 millimeters high. The far pattern was viewed 
through a mirror, direct vision being obscured by a 
15-millimeter plywood shield. On each peg was 
placed 10 washers, alternately colored black and 
white. A switch, mounted on the front of the board 
operated a standard electric .01 second timer. 

The subject sat in front of the board in such a 
manner that the pegs on the far side were obscured 
from view by the shield, but were reflected in the 
mirror. Using only the preferred hand throughout 
the test, the subject placed the index finger on the 
switch and when ready proceeded in a specified 
movement sequence. The task was to take the top 
washer off each of the near set of pegs, and by 
looking in the mirror, place each washer on its cor- 
responding peg on the far side as quickly as possible. 
When the top washer had been removed from all 10 
pegs, the subject touched the switch, stopping the 
electric timer. This constituted one trial. The score 
for each trial was the time required to move the 
top washer from all 10 pegs to the corresponding 
pegs on the mirror-image side of the shield. 


Procedure 


The subject was seated in front of the board and 
instructions were administered. The subject was told 
the experiment was to see how people learned a 
motor skill. After questions were answered, the 
subject was given 10 trials. (The uncorrected odd- 
even reliability coefficient for the first 10 trials 
was .91.) 

At the completion of the 10 trials, a confederate, 
who acted as an assistant, told the subject that his 
scores were among the best of any person tested 
and in fact were almost identical to scores made by 
the assistant earlier in the day. Commenting on this 
“interesting and unusual” situation, the experimenter 
suggested that if both the subject and the assistant 
had time perhaps they would like to take another 
5 trials, this time actually competing against each 
other, In every case, the subject consented and 
seemed eager to continue. To control for possible 
interactive effects of success or failure during com- 
petition, the subject was allowed to win 2 of the 
first 4 trials during this series. Afterward, the subject 
was administered the Test of Insight, Form I 
(French, 1958), a test of 10 short statements about 
people (ie., “Bill always lets the other fellow win”). 
Subjects were asked to explain why the people in 
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TABLE 1 
MEAN PERFORMANCE SCORES OF SUBJECTS CLASSIFIED ACCORDING TO n ACHIEVEMENT 
AND MANIFEST ANXIETY 

Group N Initial U Competitive U Change U 

i 32.47 22.39 10.09 
High n Ach 14 ie of es 

Low n Ach 15 30.99 23.18 7.79 

i i 13 30.53 23.21 7.32 
High anxiety 7 i Bi 

Low anxiety 16 32.66 22.46 10.20 
High anxiety-low n Ach 6 29.46 A 23.17 a 6.26 3 

Low anxiety-high n Ach q 33.49 21.52 11.97 


Note.—The initial test score is the mean of the first 10 trials. The competitive test score is the mean of the subject's competitive | 


trials, The change score is the mean of the difference between each subject's initial mean score and com petitive mean score, 


*p <.05, two-tailed. 


these statements behaved as they did. The tests 
were scored by an experienced scorer according to 
the method of content analysis described by Atkinson 
(1958) 1 


Treatment of Data 


Because of incomplete data, 5 of the 40 subjects 
tested were eliminated from the analysis. The 
remaining 35 subjects were ranked from high to 
low on n Ach, then dichotomized at the median 
(Md=10, M = 10.82). The 6 subjects who scored 
at the median were eliminated, leaving 29 subjects 
for further analysis, 

Performance scores for these subjects were ana- 
lyzed under three conditions: high n Ach versus 
low n Ach, high anxiety versus low anxiety, high 
n Ach-low anxiety versus low n Ach-high anxiety. 
For the noncompetitive conditions the performance 
Score reported is the mean of the first 10 trials, 
the competitive score is the mean of the subjects’ 
5 competitive trials, and the change score is the 
mean of the difference between each subject’s non- 
competitive mean score and competitive mean score. 
A two-tailed Mann-Whitney U test was used to test 
the significance of all the observed differences, 


RESULTS 
n Ach 


Division by n Ach resulted in 14 high achievers 
(M n Ach = 15.07, range 11-22) and 15 low 
achievers (M n Ach = 7.20, range 5-9). Mean 
performance scores for the two groups are shown 
in Table 1. All the mean performance scores 
were in the direction predicted, with the low 
n Ach group performing better on the non- 
competitive test, the high n Ach group perform- 
ing better on the competitive test and making 

1We wish to acknowledge the assistance of Jean 


G. Peterman, University of California, Berkeley, 
who scored the French Test of Insight. 


greater mean changes. Only the change score was 
significant (U = 59, p < .05). 


Manifest Anxiety (MA) Scale 


Of the 29 subjects analyzed, 13 had high MA 
scale scores (M = 22.38, range 19-27), and 16 
had low MA scale scores (M = 4.63, range 1-10). 
Mean performance scores are shown in Table 1. 
Again, while all performances are in the direc- 
tion predicted, with high MA scale subjects 
better on the noncompetitive test, low MA scale 
subjects better in competition and having the 
greater change, only the change from initial to 
competition was significant (U = 59, p < .05). 


High n Ach-Low Anxiety versus Low n Ach- 
High Anxiety 


When subjects were classified on n Ach and 
MA simultaneously, four resulted: 
high MA-low n 
M n Ach = 7.0), low MA-high n Ach (N = 7, M 
MA=442, M n Ach= 14.72), low MA-low n 
Ach (V=9, M MA = 4.78, M n Ach = 7.33), 
and high MA-high n Ach (N = 7, M MA = 22.14 
Mn Ach = 15.43). This study, however, was 
concerned only with those subjects scoring high 
on one variable and low on the other. Mean 
Performance of the two groups is shown in 
Table 1. Performance was in the direction pre- 
dicted, with the high MA-low n Ach group per- 
forming significantly better on the noncompelt- 
tive test (U=7, p< -05), and the low MA-high 
n Ach group better (but not significantly so) in 
competition. The change from the noncompeti- 
tive session to the competitive session was sig- 


nificantly greater for the high achievement group 


(U =3, $ < 05). 


i 
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i DISCUSSION 


The results of this study substantiate the 
hypothesis that an individual whose motive to 


: avoid failure, as measured by the Taylor MA 


scale, is greater than his motive to succeed, as 
measured by n Ach, will perform better in non- 
competitive situations than an individual whose 
motive to succeed is stronger. In competitive 
situations the reverse is true, with the individual 
whose motive to succeed is stronger performing 
better. 

It should be noted that the studies investi- 
gating Atkinson’s risk-taking theory have used 
the Sarason TAQ rather than the Taylor MA 
scale. The MA scale is regarded as an indicator 
of general anxiety while the TAQ is more specific 
to testing situations. While the two scales are 
positively correlated (r = .53; Raphelson, 1957) 
the TAQ might be more appropriate as a measure 
of the fear of failure. As Raphelson stated, “An 
assessment technique constructed to reflect re- 
sponse tendencies peculiar to a certain limited 
class of situations will probably have a predictive 
eet over more generally oriented methods 
p. 17].” 

, One other point should be noted. A possible 
tisk of confounding exists because the MA scale 
and n Ach scores were not obtained under 
identical conditions. The MA scale was adminis- 
tered under neutral conditions several months 
prior to the testing situation, while the n Ach 
Was part of the experimental test session. It has 
been pointed out (McClelland et al., 1953) that 
stories written following competitive conditions 
should reflect both a heightening of the intensity 
of n Ach and also other specific effects related 
to: success or failure experiences. On the other 
hand the MA scale, as noted earlier, is assumed 
to reflect a relatively constant level of internal 
anxiety or emotionality and should be an ade- 
Quate predictor of the presence and effects of 
anxiety ' responses in any situation, The point 
r however, that the results might have 
ae different if the tests had been adminis- 
ered at the same time, either before or after 
Performance. 

i Taken individually both anxiety, as measured 
poet! scale, and n Ach appear to follow 
Studia (©, trend which is consistent with other 
A Han: in this area. The more anxious indi- 
ae appears to do well in a noncompetitive 
A lon when he is not threatened or under 
essure, but when placed in competition his 
cty or fear of failure tends to interfere 


with performance, In contrast, the competitive 
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situation seems to motivate the individual with 
the high desire to succeed, energizing and im- 
proving his performance. It is only when the 
two measures are combined, however, that the 
results are most clear. Thus it would appear that 
a knowledge of both motivational variables 
should enhance the possibility of predicting 
success in competitive situations. 
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RACE AND EXAMINER AS SIGNIFICANT INFLUENCES ON- 
BASAL SKIN IMPEDANCE * 


ALVIN S. BERNSTEIN 
Franklin D. Roosevelt Veterans Administration Hospital, Montrose, New York 


In agreement with a previous study, 
Negroes than for comparable white Ss 


whether they were seen by a white 


phrenic samples as well. An examiner 
seen by one examiner yielding significa: 


basal skin impedance was higher for 
among normal samples, This was true 
or Negro examiner and among schizo- 
effect was also demonstrated, with Ss 
ntly higher base impedance levels than 


those seen by the other. Neither S’s race nor examiner effects were due to 
differential reaction by white or Negro Ss to either examiner, Race differences 
seem restricted to basal impedance level since differences in spontaneous electro- 
dermal fluctuations were not significant, Such fluctuations showed considerable 


intergroup stability with regard to 
occurrence over a 6-min, period. 


Johnson and Corah (1963) recently reported 
higher skin resistance for Negroes than for a 
comparable white population. When this report 
appeared the writer was involved in a study of 
arousal level, combining white and Negro sub- 
jects in the samples, and using basal resistance 
as one measure of arousal, A problem had arisen 
because of an unusually pronounced intersubject 
variability apparent within each sample on this 
measure. Examination revealed that this was, in 
fact, chiefly due to substantially higher resist- 
ances recorded for Negro subjects. Accordingly, 
the present study was undertaken to provide 
independent validation of the earlier report, and 
to extend it by determining whether a similar 
race difference exists among schizophrenic pa- 
tients; whether such differences could be at- 
tributed in any way to a differential reaction 
by white and Negro subjects to the race of the 
examiner; and whether such differences simply 
represent an instance of broader racial differences 
in arousal level, or in electrodermal phenomena 
generally, 


METHOD 
Subjects 


A total of eight different samples were examined, 
One sample each of normal white and Negro sub- 
jects was examined by the writer, while two addi- 
tional such samples were seen by a Negro examiner. 
In selecting patients for the schizophrenic samples, 
a distinction was maintained between clear, well- 
oriented, essentially remitted patients, and those who 


1The author is indebted to Jacob Cohen for 
advice as to the proper statistical procedures; Jesse 
Johnson for examining two of the samples; John P. 
Bonsal who helped collect the data and carry out 
the statistical tests; and Anne Leonard who typed 
the manuscript. 


both mean frequency and variability of 


were confused, disoriented, and showed considerable ` 
deterioration, All patients were chronic, with an 
average hospitalization of 9.2 years (SD =8.9) for 
the clear samples, and 15.7 years (SD = 6.5) for the 
confused. Patients were selected for these categories 
on the basis of: the judgment of the ward phy- 
sician and/or psychologist that the patient concerned 
belonged at either extreme of the “clear, well- 
oriented to confused, deteriorated” continuum; score 
on the Montrose Rating Scale (MRS; Rackow, 
Napoli, Klebanoff, & Schillinger, 1953) filled out 
by the physician or Psychologist on each patient. 
With possible MRS scores tanging between 0-28, 
all the confused patient’s ratings ranged between 0-9, 
all clear patients between 13-28. Two samples each 
of white and Negro patients were examined—one 
clear, and one confused for each race. Number of 
Subjects in each group and their average ages are 
Presented in Table 1. None of the patients had a 
diagnosis or history of CNS damage, and none had 
received any tranquilizing medication for at least 
30 days preceding examination, All subjects in the 
study are male. 


Procedure 


Subjects were comfortably seated in a semi- 
darkened, sound-deadened room. A whirring fan 
Provided low-intensity masking noise. The first and 
third fingers of the preferred hand were cleaned 
with methyl alcohol, and electrodes applied using 
Cambridge paste. The electrodes were stainless steel 
discs, 9.5 millimeters in diameter, set in a lucite 
base. Measurement of both basal impedance level 
(BIL) and Spontaneous electrodermal fluctuation 
(SF) was obtained through an Airborne Instruments 
Model 152 Tissue Resistance Monitor, recorded on 
a Texas Instrument single-channel Recti/Riter. The 
instruments were calibrated anew for each subject 
mong a Precise Model 468 Resistance Decade Box. 
The Resistance Monitor utilizes an ac generator, 
producing an 8 cycles-per-second square wave, an d 
imposing a constant current of 20 microamperts: 
While an ac circuit minimizes polarization problems; 
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N TABLE 1 
MEAN BASE IMPEDANCE AND SPONTANEOUS FLUCTUATION MEASURES, AND DESCRIPTIVE DATA 
FOR Eacu Group 
White Negro 
a Cl 
Basal skin impedance 
46.0 87.3 | 61.3 78.2 | 100.1 | 142.1 | 147.9 | 139.5 
SD 13.4 33.5 | 24.1 28.9 46.0 | 54.6 74.9 70.2 
Number of spontaneous fluctuations 
A a2 ie are 9.9 ae 9.7 9.2 10.7 
s il ni 6.8 Ee} i n 4 
Men) 8.1 7:3 7.1 
M 41.8 44.4 | 40.7 43.5 32.1 | 343 | 44.3 | 41.9 
SD 13.3 11.3 | 10.5 9.1 3.6 9.9 6.2 10.2 
N 12 24 20 20 12 24 10 10 


Note.—Base impedance values are given in kilohms. 
*Samples administered by a Negro examiner, 
è Samples administered by a white examiner. 


J 


Ìt measures impedance (a combination of resistance 
pend capacitative reactance) rather than the dc 
apparent resistance” employed by many writers 
“(including Johnson and Corah). Since these are 
telated phenomena, however, similar results may be 
expected, 
Subjects sat facing a rear-projection screen. After 
"4 S-minute adaptation period, lights of varying in- 
tensity were flashed for 1-second periods at irregular 
| intervals between 15-60 seconds. An assistant sat 
ina corner facing the subject, and signaled the 
Writer (in an adjacent room with the instruments) 
by Means of an amber light whenever subjects turned 
their heads away from the screen or closed their 
yes. This procedure was part of separate studies 
Of the orienting reflex and arousal level in which 
Some of the present subjects also participated. 
BIL was measured at the end of the initial 
S-minute adaptation period, immediately before onset 
À the first light. Total number of SFs was summed 
Or each subject over a 6-minute period beginning 
a 10 seconds before onset of the first light. 
D E defined as a deflection of at least 500 ohms 
4 4 Occurring within 1-3 seconds following onset 
f; any light. To distinguish between consecutive 
Spontaneous responses and continuous, complex ones, 
Separate SFs were scored only where deflections were 
“Patated by at least 2 seconds.? 


RESULTS AND DISCUSSION 


E average measures of both BIL and SF for 
“of Sample are shown in Table 1. By analysis 
hi he nce, BIL for Negroes was significantly 
4 aa than that for whites both among normal 
A yects in the groups seen by a white examiner 
* well as in those examined by a Negro 


i 
H Agreeme 


cae nt between two judges independently 
8c 

ne SF for an additional set of 30 records was 
Y product-moment correlation. 


(p<.01).2 Similarly, BIL was significantly 
higher (p < .01) for Negroes among the schizo- 
phrenic samples. 

The findings of Johnson and Corah (1963) 
are thus strongly supported. Negroes have reli- 
ably higher impedance or resistance base levels 
than whites, and this difference holds for both 
normal and schizophrenic subjects. The similar 
results with a Negro examiner, and the absence 
of any interaction effect between subject’s race 
and examiner (that is, the same difference was 
found between white and Negro subjects whether 
they were seen by a white or Negro examiner) 
indicates that this cannot be readily explained 
as an artifact due to different qualities of 
response by white and Negro subjects to a white 
examiner. 

While the two Negro normal samples are some 
10 years younger than the remaining groups, 
observation of Table 1 readily indicates that 
there is no regular relationship of any sort 
between age and BIL, 

The possibility remains, however, that these 
findings represent broader racial differences in 
electrodermal responsiveness, or, since BIL is a 
function of arousal level (Woodworth & Schlos- 
berg, 1961), that the races may differ still more 
generally in level of manifest arousal, Examina- 
tion of the SF measure provides relevant in- 
formation here, since, in addition to representing 


3 Because the heterogeneous variance of the BIL 
data brought the assumptions of the analysis of 
variance into question here, a log transformation 
was performed. The transformed data satisfied the 
requirement for homogeneous variance, but did not 
alter the outcome of the analysis of variance in 


any way. 
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another, independent (Corah & Stern, 1963) 
electrodermal measure, SF has been substanti- 
ated also as an index of arousal (Burch & 
Greiner, 1958, 1960). Analysis of variance re- 
veals that none of the differences in SF between 
any pair of white and Negro samples approached 
significance, making both these explanations un- 
likely, This also is in general agreement with 
Johnson and Corah (1963). 

Thus, race differences seem restricted to this 
single, specific phenomenon—base level of skin 
impedance or resistance. While the origin of 
this difference remains obscure, a recent report 
by Darrow (1962) may be related to it. Though 
stressing “two distinctly different electrophysio- 
logical response processes occurring respectively 
at high and low levels of resistance,” Darrow 
indicates that high resistance level is apparently 
due to electrode contact with the relatively 
nonconductive surface layers of the skin and the 
outer tissue of the sweat-gland openings: low 
levels of resistance result from the filling of the 
tubules with highly conductive sweat, shorting 
out effects from outside the glands and providing 
low resistance contact with the interior of the 
depolarizing sweat glands, This would suggest the 
possibility of decreased activity, and perhaps 
number, of sweat glands in Negroes. 

In addition to subject's race, BIL was also 
strongly affected by the particular examiner 
involved. That is, impedance levels for the two 
groups, white and Negro, seen by one examiner 
were significantly different (p<.01) from those 
of the comparable groups seen by the other, 
Again, this was not due to any differential re- 
action by white or Negro subjects to either 
examiner, but was generally true of all subjects 
seen by either. While it is not known whether 
this is due to the difference in race between 
examiners or to other differences in appearance 
or manner, the presence of such an effect is not 
surprising, An examiner effect has long been 
known with regard to a variety of psychological 
tests, and a similar effect might have been 
anticipated with regard to Measures of autonomic 
function, 


ference between orienting reflex, 
Sampling problems, particularly in selecting a 
sufficient number of Negro Patients, may have 
reduced the effectiveness of the distinction 
between clear and confused schizophrenic groups. 

Thus, the data indicate that both subject’s 
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race and still undefined differences between ex- 
aminers significantly affect basal skin resistance 
phenomena. Comparisons across different racial 
or examiner groups therefore appear to be of 
questionable validity, and moreover, the pres- 
ence of subjects of different race, or those seen 
by different examiners within a sample, should 
increase intragroup variability, possibly becloud- 
ing meaningful intergroup differences. Research- 
ers should include appropriate control in studies 
involving basal resistance or impedance, and 
possibly, in work involving autonomic function 
generally. While the present data and those of 
Johnson and Corah (1963) suggest that the 
subject-race effect is restricted to basal resist- 
ance, the possibility of examiner-associated vari- 
ability across a broad band of autonomic function 
requires study. 

The present data do indicate that differences 
in BIL between the groups seen by the two 
examiners were not accompanied by significant 
differences between them in SF. A similar 
specificity of effect may therefore be suggested 
for both subject-race and examiner variables; 
however, additional tests on other functions 
are required, Thus, it is possible that the present 
findings may be due to marked vulnerability of 
basal skin levels to external influence; to marked 
invulnerability of SF to such influence; or both, 
The stability of SF through time, within any 
subject, has been demonstrated by others (Lacey 
& Lacey, 1958; Wilson & Dykman, 1960). In 
Particular, the Lacey’s cite such stability in 
Suggesting that SF originates not in “adventitious 
stimuli” but “in physicochemical processes at 
the level of localized cell aggregates.” The 
Present data, however, go further, suggesting a 
degree of intergroup stability not reported 
heretofore, That is, both the mean frequency of 
occurrence of spontaneous activity and its at- 
tendant variability remained essentially un- 
changed across the samples, despite differences 
among them in race, psychiatric condition, and 
examiner. The basis and meaning of this finding 
is not understood at present. This, and the 
curious divergencies between these two measures, 
BIL and SF, both arising from electrodermal 
Teactions, and both related to arousal, deserves 
further study, In Particular, the manner in which 
each relates to different aspects of arousal 
requires closer attention. 
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SOME SITUATIONAL AND COGNITIVE DETERMINANTS 
OF DISSONANCE RESOLUTION + 


O. J. HARVEY 
University of Colorado 


10 representatives of each of the 4 main levels of abstractness posited by 
Harvey, Hunt, and Schroder (1961) argued in opposition to their own 
beliefs about philosophy, under the instructions that their arguments would 
never be heard by anyone other than themselves (private) or that their 
arguments would be heard by a university curriculum committee (public). 
Opinions were retested immediately after argumentation and again 1 wk. 
later. Opinion change at both retests was affected significantly by concreteness- 
abstractness and by interaction of this variable with the public-private variation. 
The latter variable significantly affected change only at Retest 2. Effects of 
cognitive structure and conditions of commitment on resolution of dissonance 


were demonstrated. 


Despite the impressive number of studies 
Seeking to extend the application of the theory 
of cognitive dissonance (Brehm & Cohen, 1962; 
Festinger, 1957), only scant attention has been 
Biven to the potential effects of varied media- 
tional systems on responses to incongruous or 

Ssonant events. Theory and evidence abound, 
however, which suggest the possibility that varia- 
he in the cognitive structures of individuals, 

€ mediational prism through which events are 
Psychologically processed and transmuted into 


a “This study was supported by the Group Psychol- 
By Branch, Office of Naval Research, under Contract 
TaD with the University of Colorado. 
ia are extended to ONR for its support; to 
Rector; Catherine Felknor, Darrell Adams, 
if oyle Bishop for their aid in collecting the 
Retin a to William Scott for his editorial sug- 
3 ns, (Reproduction of this study in whole or 
Stat 3s Permitted for any purposes of the United 
€s Government.) 


personal meaning, may result in widely different 
reactions to comparably dissonant situations. 
This study was aimed at testing this possibility 
by inducing the experience of dissonance in per- 
sons who varied on the dimension of cognitive 
structure which we have termed concreteness- 
abstractness (Harvey, Hunt, & Schroder, 1961; 
Harvey & Schroder, 1963; Schroder & Harvey, 
1963). 

Greater concreteness may be inferred from 
a host of states and behaviors, such as: a simpler 
cognitive structure, comprised of poorer dif- 
ferentiation and less complete integration; a 
greater tendency toward bifurcated evaluations, 
viz, good-bad, right-wrong, black-white, etc.; 
a narrower latitude of acceptance, and a wider 
latitude of rejection; a 
external cues, especially 
perceived as emanating f 
institutional authority; a 
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ambiguity, expressed in the tendency to form 
judgments more quickly and on less information, 
even at the expense of accuracy and susceptibility 
to false but salient cues; a greater inability to 
change set, and hence greater stereotypy in the 
solution of novel or more complex problems; a 
poorer delineation between means and ends, and 
hence a paucity of different routes to a common 
goal; a poorer capacity to “act as if,” to assume 
the role of the other, or to think and act in 
terms of a hypothetical situation; a less intrinsi- 
cally anchored self, and hence a lessened effect 
of the self as a causal agent in transaction with 
the environment; and, as a consequence of these 
and other properties, a more closed and rigid 
conceptual system, resulting in a stronger pre- 
dilection toward keeping potentially conflicting 
inputs out of the system and a greater tendency 
for the system to “go to pieces” or be dis- 
rupted in its activity by failure to ward off the 
incongruous events. Greater abstractness implies 
the opposite quantity on these behavioral cri- 
teria. A more abstract system is comprised of 
both highly individualized and synchronously 
integrated parts, organized in such a way that, 
depending on the demands of the situation, each 
part may operate independently or the totality 
may be engaged in interdependent unison.2 

Differences such as those above make it 
reasonable to expect that the person with a 
more abstract cognitive system would be more 
capable than the more concretely functioning 
individual of absorbing or assimilating conflict 
and inconsistency; and owing to a state of 
lower strain, tension, and discomfort from in- 
congruous inputs, to be less motivated toward 
resolving the dissonance and restoring cognitive 
consistency. This possibility is strengthened by 
such findings as those of Campbell (1960) and 
Scott (1963) in which the cognitively simpler 
subjects were shown to have a higher tendency 
toward sociometric balance and cognitive con- 
sistency than the cognitively more complex 
individuals. 

The assumption that more abstractly function- 
ing individuals are more tolerant of dissonance 
than persons of more concrete structures pre- 
Supposes that the incongruous inputs have gained 


2 Experimental documentation of these and other 
characteristics that distinguish between more con- 
crete and more abstract functioning is being detailed 
in O. J. Harvey, System Structure and Adaptability, 
nearing completion. Evidence for most of these dis- 
tinctions has already been Presented, but mainly in 
theses and unpublished technical reports (Brook, 
1962; Felknor & Harvey, 1963; Harvey, 1963a, 
1963b, 1963c, 1963d; Juers & Harvey, 1963). 
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entrance into the subject’s conceptual 
and have been experienced as conflici 
inconsistent with other of his cognitions, Oj 
wise, if dissonance is defined in terms of 
mental operations instead of experience, 
more concretely functioning person, because 
stronger tendencies toward warding off pol 
tially conflicting events, transforming them ini 
congruence with his own interpretive schem 
or in some way neutralizing their deviane 
appear to be more tolerant of cognitive 
compatability than the person with the moj 
abstract structure. 

An induction method was sought which wi 
reduce the likelihood of distorting or neutrali 
the incongruity and which, consequently, wou 
increase the probability of all subjects 
encing cognitive incompatability, A kind 
role-playing technique, requiring the subject 
generate and orally present arguments that 
counter to his own opinion, was used to sal 
these requirements. Arguments were made und 
one of two situational variations. In the pri 
condition, designed to create less involvemi 
in and less commitment to the arguments, sul 
jects were instructed that no one other 
themselves would hear or ever know the a 
ments they presented. In the public condition, 
aimed at producing greater involvement in 
greater commitment to the arguments, subject 
were instructed that the tape recordings of 
arguments and comments would later be hear 
by a curriculum committee of the universit, 
concerned with the issue being argued. 

It was expected that opinion change wot 
be among the more reliable effects produced b 
the individual’s negation of his own belie 
Interest in the effects of the treatment on m 
enduring change as well as on shorter r 
modification resulted in the subjects n 
retested on their opinions about the target issue 
immediately following argumentation and, W 
out further experimental intervention, agai 
week later. N 

Predictions, especially concerning the effects of 
concreteness-abstractness, were complicated by 
the likelihood that the more abstractly co 
tuted persons would play the negating role beti 
and consequently surpass the more concrete su 
jects in the volume of incongruous inputs t0 
which they were exposed. This could result in the 
more abstract subjects changing their opinio 
to a greater extent than the more concrete sub- 
jects even if the latter are more intolerant 
dissonance and are more inclined to change 
equivalent amounts of experienced cognitive. 
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compatibility. Or, because of no relationship 
between quality of role playing and opinion 
change, the more abstract subjects could play 
the role better while the more concrete subjects 
changed their opinions more. This latter occur- 
rence would mean that the amount of change per 
item of stimulation (change divided by volume 
of stimulation) was greater for the more concrete 
than for the more abstract individuals, since the 
former changed more than the latter and from 
fewer conflicting inputs, as objectively defined. 

Better role playing by the more abstract sub- 
jects should not alter the anticipated effects of 
the public-private variation. Argumentation 
under the public condition should be expected 
to produce greater involvement and hence more 
dissonance and greater opinion change than 
arguing in private (Brehm & Cohen, 1962). And 
this public-private difference should be greater 
for the more concrete than for the more abstract 
subjects. 

The following hypotheses, applicable to opinion 
change existing at both Retests 1 and 2, were 
formulated: 


1. If volume of negating stimulation does 
relate significantly to opinion change, there will 
be no significant difference between the more 
concrete and the more abstract subjects in 
opinion change; but there will be a significant 
difference between these subjects in the amount 
of change per item of stimulation. 

2. If volume of negating stimulation does not 
relate to opinion change, the change among 
the more concrete subjects will be greater than 
change among the more abstract subjects. (If 
„the more abstract subjects should play the role 
better and still there was no difference between 
them and the more concrete subjects in amount 
of change, this would mean necessarily that the 
amount of change per item of stimulation was 
greater for the more abstract subjects.) 

3. The opinion change produced by argumen- 
tation in public will be greater than change 
effected by argumentation in private. 

4. The public-private and concreteness- 
abstractness variations will interact, the concrete 
Subjects being influenced more in the public 
than in the private condition. 


METHOD 
Assessment of Concreteness-Abstractness 


This was done by the “This I Believe” (TIB) 
Test, devised by the author specifically as a measure 
of conceptual systems or levels of concreteness- 
abstractness. The TIB Test, found in several studies 
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to have high predictive and construct validity,? 
requires the subject to indicate his beliefs about 
a number of socially and personally relevant con- 
cept referents by completing in two or three sen- 
tences the phrase “This I believe about a 
the blank being filled successively by one of 
the following referents: friendship, the American 
way of life, guilt, marriage, myself, religion, sin, 
majority opinion, people, and compromise. 

The concept referents, one to a page, along with 
items of subject identification, are presented in a 
small booklet under the title of “Opinion Survey.” 
Two minutes, timed by the experimenter, are 
allowed for each completion. 

In this study subjects were classified into one 
of the four principal conceptual systems or levels 
of abstractness posited by Harvey et al. (1961). 
Without elaborating the assumptions under- 
lying the classifications, suffice it to indicate that 
individuals are categorized as representing predomi- 
nantly System 1, the more concrete end of the 
continuum, if their TIB completions manifest ‘such 
characteristics as high absolutism of thought and 
belief, high positive dependence on or cathexis with 
representatives of institutional authority, high fre- 
quency of platitudes or conventional statements, 
and high ethnocentrism and evaluative assertions of 
American superiority. In many respects, this mode 
of functioning is closely related to the syndrome of 
authoritarianism, In several studies subjects clas- 
sified as representing System 1 have been found 
to score the highest of the four systems on 
authoritarianism. 

Subjects are classified as representing System 2 
functioning, the next assumed level of abstractness, 
if their responses indicate high negativism and de- 
nunciation of institutional authority, high abso- 
lutism, and a high drive toward autonomy and 
avoidance of dependency on authority-related cues, 
including God, tradition, and most of the referents 
that serve as positive guidelines for System 1 repre- 
sentatives. Next to System 4 representatives, System 
2 subjects score the lowest on the scales of authori- 
tarianism. In terms of directionality the responses 
of System 2 individuals tend to be opposite to those 
of System 1 subjects, at least in relation to authority 
referents; but structurally, i.e, in the absolutism 
with which certain classes of objects are construed, 
representatives of Systems 1 and 2 are more similar 
to each other than to the other systems. 

Subjects are classified into System 3, the next 
to the highest level of abstractness, if their TIB 
responses indicate less avidity and absolutism than 


3Jn addition to relating appropriately to such tests 
as Edwards’ Personal Preference Schedule, the F 
Scale, and the Anomie scales of Srole and Nettler, 
the TIB instrument has been validated in studies 
concerned with such variables as differentiation and 
integration, conformity, decision making, stereotypy, 
probability learning, concept formation, risk taking, 
creativity, values, and self-esteem. References in 
Footnote 2 are illustrative of these studies. 
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shown by representatives of Systems 1 and 2, an 
emphasis on man’s universal goodness, an emphasis 
on the importance of friendship and mutual inter- 
personal dependencies, and reference to the impor- 
tance of avoiding loneliness and social isolation, 

Subjects are categorized as representing System 4, 
the highest level of abstractness described by Harvey 
et al, (1961), if their TIB completions indicate a 
high degree of novelty and appropriateness, inde- 
pendence without negativism, high relativism and 
contingency of thought, and the general usage of 
multidimensional instead of unidimensional interpre- 
tive schemata. These individuals score the lowest 
of the four systems on scales of authoritarianism. 

In categorizing a subject into a particular system, 
responses to all of the TIB referents are considered 
in totality and one overall score assigned instead of 
each response to each referent being scored inde- 
pendently and a summed value taken to represent 
the conceptual system. While lacking somewhat in 
precision, this more global approach has been found 
to produce a higher reliability and validity than 
single item analyses because, presumably, a context 
is provided against which a single response can more 
stably and accurately be interpreted, The interjudge 
reliability among three or four trained judges has 
been .90 or above for nine different samples of 
subjects. Moreover, the conceptual systems, scored 
in this more general way, have been found to differ 
appropriately on such attributes as evaluativeness, 
categoricalness, etc., derived from Separate and inde- 
pendent scorings of the TIB completions, 


Subjects 


Twenty representatives of each of the four con- 
ceptual systems, selected from larger samples of 
subjects who had been classified identically by three 
independent graduate student judges, received the 
experimental treatment. To achieve this number of 
subjects from each system, TIB booklets were ad- 
ministered to 220 male undergraduates attending the 
University of Colorado during the summer of 1961, 
Of the 220, 41 were unanimously categorized into 
System 1, 31 into System 2, 37 into System 3, and 
21 into System 4; the remaining 90 were scored as 
Tepresenting heavy admixtures of two or more sys- 
tems (V=78) or were not classified into the same 
system by all three judges (Ņ = 12). 


Procedure 


Synopsis. Twenty male subjects from each of the 
four conceptual systems argued in opposition to 
their earlier measured opinion on the question of 
whether or not all college undergraduates should be 
required to minor in Philosophy. Half of the sub- 
jects from each conceptual system played this role 
of argumentation under a private condition while 
the other half argued in the public condition. 

In both role-playing conditions the subject tape 
recorded his arguments while in a room by himself. 

In the private condition, the subject was led to 
believe that no person other than himself, including 
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the experimenter, ,would ever hear his recorded 
arguments, The assumption of privacy was conveyed 
to the subject by having him erase his tape as he 
played it back to himself in isolation. 

In the public condition the subject also recorded 
and played back his arguments in isolation, but, 
unlike subjects in the private condition, he was 
urged strongly at the outset to exercise care that 
he did not erase any of his arguments since a uni- 
versity curriculum committee concerned with the 
place of philosophy in undergraduate education 
would later hear it. 

Immediately following argumentation, all subjects 
were readministered the scale on opinions toward 
philosophy; and they were readministered the same 
scale a second time 1 week later, 

Sequential steps. The steps in the experiment, 
following selection of subjects, were in the following 
order: 


1. Determine the subjects prerole opinions toward 
philosophy: Each responded to 28 statements about 
philosophy (such as “Philosophers are more creative 
than nonphilosophers,” and “Training in philosophy 
provides a basis for better understanding of other 
areas”) by checking on a 6-inch scale his degree of 
agreement with each statement. A subject’s score on 
the questionnaire was the mean of his 28 responses 
Measured to the nearest .25 inch from the anti- 
philosophy end of the scale. Those individuals whose 
mean score was less than 3 were termed “anti- 
Philosophy” while those who scored above 3 were 
labeled “prophilosophy.” By happenstance, the num- 
ber of pro- and antiphilosophy subjects within each 
of the conceptual systems was almost identical. 

2. Argumentation: Prophilosophy subjects were 
required to argue against requiring philosophy as 
an undergraduate minor while the antiphilosophy 
subjects were required to argue in favor of philos- 
ophy as a minor. In both the public and private con- 
ditions subjects were required to make their argu- 
ments from four major premises or starting points 
assigned to them by the experimenter. Depending 
on whether the subject was to argue in favor of or 
in opposition to philosophy, the four premises from 
which he had to argue were that training in philos- 
ophy was in accord with or in opposition to: 
religious principles, the best interests of the family, 
the advancement of freedom, and the best interest 
of minority groups. The assignment of premises 
was aimed at creating fairly standard levels of 
difficulty for all subjects. To achieve this, premises 
Were selected that were assumed to be system- 
relevant, one premise being more relevant for each 
of the four conceptual systems. 

The study was described as being concerned with 
the effects of self-insight into the ability to argue 
on either side of an issue; a playback of the 
recorded arguments was depicted as a means by 
which self-insight might be enhanced. Subjects were 
asked to assume that in this instance they were in 
a college debate and it had fallen their lot to argue 
either in favor of or opposition to the proposition 
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that philosophy should be required as a minor for all 
college undergraduates. They were \then given the 
four premises from which they had to argue orally, 
advised that their arguments had to be based ex- 


' clusively on the assigned assumptions, which could 


be used in any order chosen, and told that they 
would have 5 minutes for argumentation from 
each premise. 

To insure that all subjects had the same amount 
of time to use in argumentation, they were told, 
following instructions in the use of the recorder, to 
wait until all arguments from all four premises had 
been completed before attempting to listen to the 
recording. Subjects were assured in both the public 
and private treatments that while recording their 
arguments and playing them back they would be 
heard by no one but themselves. In the private 
condition they were instructed to erase their tape as 
they played it back so that it could never be heard 
by anone else, while in the public treatment subjects 
were cautioned not to erase their tape in playback in 
order that it could later be heard by the curriculum 
committee, 

During role playing and tape playback, the sub- 
ject, in both the public and private conditions, sat 
alone in a lighted and soundproofed room. By means 
of an intercom system he could ask questions of 
procedural clarification from the experimenter, 
seated in an adjoining room, and could be instructed 
by the experimenter at the end of 5-minute intervals 
to begin arguments from a new premise. In keeping 
with the aim of convincing each subject that he 
would be unheard by the experimenter while making 
his arguments and playing them back, he was led 
Incorrectly to believe that the intercom carried his 
Voice only when he depressed the transmitter 
switch. In reality, however, the transmitting channel 
from subject to experimenter always remained open. 
This provided a second recording of the subjects’ 
arguments in the private condition, necessary for 
another aspect of this study (Harvey, 1963b); at 

e same time it allowed the experimenter to 
determine if any subject departed from the pre- 
scribed procedure, which none did owing to the 
Clarification given to those who asked questions. 
A small light in front of the experimenter was 
activated when the subject depressed the transmitter 
Switch; and to strengthen the subject’s belief that 
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he was heard by the experimenter only when the 
transmitter switch was pushed, the experimenter re- 
sponded only when the light appeared. 

All indications, from taped remarks and post- 
experimental comments, suggested that the subjects 
accepted the instructions and procedure as genuine 
and valid. 

3, Measure of opinion change: Immediately fol- 
lowing argumentation subjects were readministered 
the scale on opinions toward philosophy. To deter- 
mine endurance of change, the same scale was 
readministered a second time, 1 week later. 


RESULTS 


In a phase of this study concerned with cog- 
nitive correlates of role-playing ability (Harvey, 
1963b) the more abstract subjects were found 
to play the role of argumentation significantly 
better than the more concrete individuals, This 
means, at least theoretically, that the volume of 
cognitively incongruous stimulation was greater 
for the more abstract than for the more concrete 
subjects. Even if our assumption is correct that 
more concrete individuals are less tolerant of 
dissonance, the greater stimulation of the more 
abstract subjects could result in these latter in- 
dividuals changing their opinions as much or 
more than the more concrete subjects. 

To determine the effects of differences in 
volume of dissonant stimulation on opinion 
change, the amount of change in opinion toward 
philosophy at Retest 1 and Retest 2 were cor- 
related separately with a measure of stimulation 
volume, represented by the product of the overall 
quality of the subject’s role playing and the 
time, out of 20 minutes, he used for actual 
argumentation or was not inappropriately silent. 
(Some of the more concrete subjects remained 
completely and inappropriately silent for more 
than half of the role-playing period of 20 
minutes.) Neither of these correlations was sig- 
nificant. This meant that the effects of role 
playing on opinion change could be analyzed 


TABLE 1 


MEAN POSITION or THE MORE CONCRETE AND 
or PHILOSOPHY INITIALLY 


THE MORE ABSTRACT SUBJECTS ON THE ISSUE 
AND AT RETESTS 1 AND 2 


Preargumentation Retest 1 Retest 2 
Conceptual structure Public Private Public Private Public Private 
Pro Con Pro Con Pro Con Pro Con Pro Con Pro Con 
2.68 | 3.21 | 2.22 | 4.07 
oe 224 | 4.41 | 2.41 | 3.64 | 2.33 | 4.08 
Abstract 322 tie ots | 426 | 2.17 | 4.29 | 2.32 | 4.01 | 2.12 | 4.16 | 2.44 3.83 


` 
Note.—All responses on the opinion scale were m 


easured to the nearest .25 inch. 


j 


4.01, p< 05), as did the interaction of this 
variable with’ the public-private manipulation 


10.77, p < .01). As a main effect, the more 


concrete subjects changed their opinions more 
than did the more abstract subjects; and as an 
interaction, the more concrete subjects changed 
their opinions more in public while the more 
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TABLE 2 
MEAN OPINION CHANGE AT RETEST 1 (F 
Public Private 

Conceptual 

structure 

Pro | Con | MPC] Pro | Con | MPC 
Concrete 27) 39] .33| .09 | .03 | .06 
Abstract | —.05 | —.13 | —.09 | .17 25 21 


without adjustments for differences in quality 
of the argumentation. 

Moreover, as may be noted from Table 1, 
the preargumentation stand of the more con- 
crete subjects (Systems 1 and 2 combined) and 
the more abstract subjects (Systems 3 and 4 
combined) on the issue of philosophy was so 
highly similar that the’ possible effects of differ- 
ences in initial position on opinion change could 
appropriately be ignored. 


Opinion Change at Retest 1 


This is the difference between the mean of a 
subject’s initial position and the mean of his 
position immediately following role playing. 
Positive change is change in the same direction 
as that argued, i.e., becoming more prophilosophy 
under the pro arguments and/or more anti- 
philosophy under the con arguments. Negative 
change, conversely, is change in the opposite 
direction to that argued, i.e., becoming more 
antiphilosophy under the pro arguments and/or 
more prophilosophy under the antiarguments, 
The mean opinion change of the more concrete 
and the more abstract subjects under each of 
the conditions is presented in Table 2, 

A t test indicated that no significant difference 
existed between the effects of the pro and the 
con arguments (¢=.91). A subsequent two-way 
analysis of variance (by the concreteness-ab- 
stractness and public-private variations) showed, 
however, that while the public-private variation 
had no significant effect on opinion change im- 
mediately following role playing (F = .18), dif- 
ferences in concreteness-abstractness did (F= 


TABLE 3 


MEAN DIFFERENCE BETWEEN OPINION CHANGE 
AT RETESTS 1 AND 2 


Public i 
AIN 'ublic Private 
structure leman aes 
Pro Con |M PC| Pro Con | M PC 
Concrete -27| 43| 35} —.11 | —.01 | —.06 
Abstract | —.07 | —.15 | —.11 | 12] 11g] 15 


abstract subjects changed more in private, 


Maintenance of Change 


This was determined by subtracting the sub- 
jects mean position on the opinion scale at 
Retest 1 from his mean position on the scale at 
Retest 2. A positive difference would mean that 
during the 1-week interval the subject had in- 
creased in positive opinion change or had de- 
creased in negative change. A negative differ- 
ence would mean that the subject had moved in 
the direction opposite to that argued, back 
toward his preargumentation position. The mean 
differences between opinion change at Retests 1 
and 2 for the several conditions are presented 
in Table 3, 

While the majority of subjects in all treatment 
categories moved back toward their initial stands 
during the week, differences existed between the 
more concrete and the more abstract subjects 
and between the public-private variations in the 
extent to which this was the case. The F value 
for differences in concreteness-abstractness was 
4.43 (b< 05); F for the public-private variation 
was 7.70 ( p< .01); and F for the interaction of 
these variables was 5.01 (p< .05). As predicted, 
the more concrete subjects maintained the 
change to a greater extent than did the more 
abstract individuals; argumentation in public 
produced more lasting change than did role 
playing in private; and the more concrete sub- 
jects maintained the change produced by the 
public condition more than the change produced 
in private, while for the more abstract subjects 
the reverse was true. 

The fact that despite a lower volume of cog- 
nitively conflicting stimulation the more con- 
crete subjects both changed their opinions more 
and maintained the change to a greater extent 
than the more abstract subjects seems to suggest 
clearly that differences in cognitive structure 
affect both shorter and longer ranged reactions 
to dissonance. In addition, these results cor- 
roborate earlier findings that greater and more 
lasting opinion change is fostered by public 
commitment, 
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EVALUATION OF PERFORMANCE AS A FUNCTION OF PERFORMER’S 
REWARD AND ATTRACTIVENESS 


MELVIN J. LERNER * 


Department of Behavioral Science, University of Kentucky 


Ss in this experiment observed 2 people working together at an anagrams task. 
Ss were told, at the outset, that 1 of the 2 workers was selected by chance to 
be paid a sizable amount of money for his efforts whereas the other worker 
was to get nothing. Ss also learned that both workers, though ignorant of 
their fate, had agreed to do their best. In addition, the 2 workers differed in 


attractiveness for the Ss—one 


was considerably 


more attractive than the 


other, The major hypotheses of the study were confirmed, Once the outcome 


was known to the observers they tended 
person who had been awarded the money 


to persuade themselves that the 
by chance had really earned it, 


after all. Also, when the less attractive worker was selected for payment the 
performance of the entire group was devalued. 


It is functional as well as less dissonance 
arousing (Festinger, 1957) for a person to be- 
lieve that his efforts will lead to a desired out- 
come. Few people would engage in extended 
activities if they believed that there were a 
random connection between what they did and 
the rewards they received. Since a person is 
ante author would like to express his apprecia- 
ee to Mary Frances Baker, Kay Dickerson, Ron 

nroth, and Dave Hall for their assistance in 
fathering and analyzing the data for this study. 


usually not in complete control of all factors 
which affect the fruitfulness of his, efforts, it is 
likely that he will be surprised, at times, by 
apparently unmerited failures or successes. 
Whenever such a surprise occurs : the person 
should be motivated to rearrange his cognitions 
so that the discrepancy between the effort and 
the outcome is explained away. 

At least three experiments support the notion 
that a person will tend to construe the outcome 
of his actions to match the degree and nature of 
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his efforts. Yaryan and Festinger (1961) demon- 
strated that the greater the effort exerted by 
someone in preparing for an event the more 
likely the person is to persuade himself that 
the event will actually occur. An experiment by 
Aronson and Mills (1959) showed that a person 
is likely to enhance the attractiveness of a goal 
if he has gone through considerable stress to 
achieve it. In a third study, Aronson and Carl- 
smith (1962) found that subjects who had been 
convinced that they had poor skills for a per- 
ceptual task were motivated to reduce the value 
of an unusually good performance in order to 
match their stable beliefs concerning the worth 
of their efforts. 

In these studies the person was responding 
to cognitions concerning his own efforts. The 
study reported here was designed to determine 
whether a person will also construe the actions 
of people he observes to fit this pattern of 
beliefs. If it is more comfortable for a person 
to believe that his own productive efforts will 
lead to a desired outcome then he should also 
be more comfortable if he can perceive this 
causal relation between personal effort and out- 
come operating in the world at large.2 It would 
certainly be difficult for him to maintain this 
system of beliefs for himself if he did not receive 
support for its validity in what he saw going on 
around him (Festinger, 1954), 

The strategy employed to test this general 
hypothesis was to have subjects observe two 
people working at a shared task. The subjects 
were to be aware at the outset that one of the 
two workers was selected by some chance factor 
to receive a sizable amount of money for his 
efforts and that the other worker was to get 
nothing. 

The pretest of the experimental design in- 
dicated that one of the two workers was con- 
siderably more attractive than the other to the 
subjects selected for the study—female college 
students. 

The original hypothesis predicted that ob- 
servers would judge the worker selected for 
payment as having contributed more to the solu- 
tion of the problem. If the worker selected for 
payment were also personally attractive this 


2 There are probably circumstances when a person 
prefers to believe that there is not a connection be- 
tween productive effort and outcome. Although it 
is not the purpose of this paper to explore these 
circumstances thoroughly it is obvious that in trying 
to ward off feelings of guilt, shame, or envy, a 
person may be motivated to feel that he is not 
responsible for his own misdeeds or that another 
person did not earn his rewards. 
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should fit well with the other cognitions: “h 
selected for payment” and “he has made 
greater contribution.” However, if the wo 
selected were the less attractive one this sho 


dissonance for the observers. The subject mi 
resolve this by deciding that the nonrewarded: 
worker was not more attractive, after all, he 
pretest indicated that this would be a difficult 
resolution, since the two workers created q it 
distinct impressions. Thus if the attractiveness 
of the two workers remained stable and the oui = 
come was clear then the subjects would have 
little choice but to arrange their cognitions con- 
cerning the performance of the workers so as to 
reduce the impact of the discrepant outcomes. 
In general, the observers probably would p 
fer to believe that the attractive worker per- 
formed better at-the task than the unattractive a 
however, the information that the attractive 
person will not be paid would make any cogni- 
tions of his excellent performance relatively pain= 
ful. As a consequence the subjects in this con- 
dition should judge the attractive worker as 
less productive than when he is to be rewarded, 
In addition, the observers would also be inclin 


of the workers did very well. Their cognitions 
might be represented as “I like Tom mu 
better than Bill.” “Although Bill is getting paid 


would make the discrepancy between the efforts of 
Tom (attractive) and his lack of reward even” 
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the worker not selected; however, when the ob- 
servers believe the unattractive person has been 
selected for payment they should lower the 
ratings of the productivity of the entire group. 


METHOD 
Subjects 


The subjects were 22 freshmen and sophomore 
female college students. They were exposed to the 
experimental situation in small groups, assigned 
alternately to one of the two experimental condi- 
tions. There were no systematic differences among 
the groups or between conditions in terms of age 
or educational level of the subjects. 


Procedure 


The subjects were seated in a small darkened 
auditorium facing a one-way mirror, From their 
seats they could see into a well-lit room containing 
a table with two chairs. They were led to believe 
that the room would soon be occupied by two 
other subjects who had volunteered to work on an 
anagrams task. Immediately after being seated the 
subjects were asked to wait while the experimenter 
and his assistant completed checking the microphone 
in the workroom which was connected to the 
speaker in the auditorium. After establishing the 
impression that the sounds coming out of the loud- 
speaker originated from the workroom, the ex- 
Perimenter closed the curtains in front of the viewing 
window and explained the purposes of the experi- 
ment to the observers. 

The subjects were led to believe that the experi- 
Menter was investigating the productivity of groups 
working at an intellectual task. The experimenter 
explained that 


much of the research in this area had been 
stymied because of the unavailability of adequate 

G methods for assessing the contribution of an in- 
dividual member to the solution of the problems 
which faced the group. [At this point the ex- 
perimenter introduced the specific situation.] In 
order to study this, I need to develop some 
reliable and meaningful rating scales. The best 
way I know of doing this is to ask a number of 
people to observe various groups working on an 
intellectual task. The observers are merely to de- 
velop impressions and hunches concerning what 
they observe, then by a process of pooling these 
various individual impressions I can begin to 
develop some categories to be used in more 
specific judgments. 


The subjects were then told they were to observe 
two students who volunteered to work on an anagrams 
pe 15 minutes. To establish a rationale for the dif- 
sentia payoff, the experimenter told the subjects 
E since it was important for the workers to take 
f e task seriously, funds available from a grant would 

e distributed among the workers as added payment. 

owever, the experimenter explained further, the 
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funds were so limited that only half of the workers 
could be paid an amount that would insure a 
difference in effort. The experimenter then said that 
the fairest way he knew of selecting a worker for 
payment was by chance—that is, by letting the 
workers draw numbers from a hat. The experimenter 
told the subjects that all workers who picked an 
[even/odd] number would be paid $3.50 at the 
end of the experimental session; the experimenter 
also pointed out to the subjects that the workers 
would not be told in advance whether they had 
chosen a payment number ‘because it was im- 
portant that both workers do their best during the 
work period. 

At this point the experimenter remarked that the 
workers were probably waiting for him, Before 
leaving, the experimenter stated that previous re- 
search had indicated it was better if the subjects 
(observers) did not actually see the workers but 
merely heard their ideas, since the observers might 
be influenced in their judgments by such irrelevant 
factors as the clothing or physical appearance of the 
workers. As the experimenter left the room the as- 
sistant threw a hidden switch that began a tape 
recording which played through the same speaker 
previously established as being connected to the 
microphone in the workroom, According to inter- 
views conducted after the experiment, the subjects 
believed they were actually listening to twoi people 
working at the anagrams task. 

All subjects heard the same tape which began 
with the experimenter introducing the workers to 
one another and to the task. The experimenter ex- 
plained the nature of the “anagrams task” and the 
method of assigning payment, After the experi- 
menter’s introduction the subjects heard the workers 
agree to do their best. The workers then selected 
their numbers out of a hat. The subjects heard one 
worker, “Tom,” tell the experimenter that he picked 
Number 19 and the other worker, “Bill,” tell the 
experimenter that he picked “26,” The experimenter 
repeated the numbers and mentioned that he was 
recording them. 

The experimenter then interviewed both workers 
so the subjects could become familiar with their 
voices. Both workers identified themselves as fourth- 
year engineering students who, as it turned out, had 
roughly the same preferences in music, TV programs, 
and the movies. Following the conversation, the 
workers performed the task for 15 minutes. An 
actual word count and the ratings of two independ- 
ent judges indicated that the two workers did 
about equally well at the task. Actually the workers 
were two graduate assistants, one of whom, Tom, 
had been a professional radio announcer and had 
an unusually deep and resonant voice. In the post- 
experimental interviews the subjects indicated a 
great deal of unanimity concerning the physical 
descriptions of the two workers. Tom was described 
as “tall, dark, and handsome” and Bill as “thin, 
with glasses.” 

After the subjects heard the tape they filled out 
some “tentative rating scales” using their “impres- 
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sions and hunches.” Finally the experimenter con- 
ducted open interviews with each group of subjects. 


Rating Scales 


The first scale on which the subjects rated the 
workers was the most important in terms of the 
hypotheses of the study. The subjects were asked: 


1. How much of a real contribution did each 
subject make to the group task of forming words? 


The scale was represented graphically by a line 
divided into 40 equal units, The subjects then rated 
the workers on two other scales concerned with 
more specific aspects of performance: 


2. How much effort did each subject seem to 
put forth in trying to do his task? 
3. How creative were the ideas of each subject? 


The next question was designed to obtain informa- 
tion about the expectations ascribed to the workers 
by the subjects: 


4. What is your impression of the expectations 
of the subjects? How certain or uncertain was 
each subject that he was one of those who were 
going to receive the $3.50 at the end? 


The subjects then rated both workers on a number 
of polar dimensions concerned with personal char- 
acteristics such as likable-unlikable, mature-im- 
mature, etc. Scores on each of these dimensions 
were combined to yield a general index of pref- 
erence for each of the “workers.” Finally, the sub- 
jects were asked which of the two they would choose 
to work with in a similar situation. 

A question concerning the subjects’ comfort was 
added after one “Tom-paid” group had been run. 
Seven of the subjects in the Tom-paid condition and 
11 in the “Bill-paid” condition were asked, 
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5. How do you feel about your participat 
in this kind of task? s 
The extremes of this scale were “I like it very 
very comfortable” and “dislike it very much, v 
uncomfortable.” ¥ 


Experimental Conditions 


The experimenter’s initial instructions to the sub- 
jects indicated that the number a worker pid ed 
out of a hat would determine whether the worker 
would be paid or not. In one condition the sub- 
jects were told that the odd number was the pay- 
ment number, indicating that Tom would be paid 
the end. In the other condition, the subjects were 
told that the even number was the b. 
number, indicating that Bill would be paid. he 
subjects were expected to make these connection: 
for themselyes after hearing the workers tell the 
experimenter what number they picked. The designa- 
tion of “odd” or “even” by the experimenter con- 
stituted the only difference between the two con- 
ditions. | 
i 
ResuLTs 

Table 1 presents a summary of the major 
findings of the study. The ratings of the real 
contribution of the workers provide the most 
direct test of the hypotheses. An examination 


for payment he was rated as having contributed 
more than Bill (Tom M = 7.90, Bill M = 11.45 
When the subjects were led to believe that B 
had been selected for payment he was rated as 
having contributed more to the common task 
than Tom (Tom M = 16.72, Bill M = 12.72), 


TABLE 1 


A Comparison oF THE RATINGS OF THE “WORKERS” 
IN BOTH “PAYMENT” CONDITIONS 


BY THE OBSERVERS 


Condition 1; il : 
oni fen 1; Ta paid P 2 aul paid t 
T rating B rating T rating B rating Condition 1 Ki 
versus — | T versus B CTD 
M SD M SD M SD M SD Condition 25 3 4 
oi 

Real contribution 7.90°| 5.38} 11.45] 5.03 |16.72| 7,82| 42.79 6.5 + i 
Effort _ 6.45 | 5.54) 7.81] 7.05] 10.27] 4/93 |1236 143 ners 109 "00 Hp. | 
Creativity 8.72 | 5.11|20.54| 4.54] 17.63 7.29 |2181] 827 2.17°* | aoe | 222 
Expectation of money | 14.27 | 5.87 | 12.45] 719 17.09| 7.51] 8.72| 4.67 .07 215% 1.30 
Overall preference 37.81 | 14.48 | 56.09 | 12.86 38.54 | 11.35 | 58.63 18.74 25 5.537 21 


Note.—An analysis of variance 
against which to test the predicted differences between the means. 

* The standard error was di 
Type I design), 


¢ All scales were scored so that the lower the score the 
* <.05, one-tailed test. 3 PoS 


**b < .02, one-tailed test; $ <05, two-tailed te 
wet < 00i, one-tailed test. Baie 


was computed for each of the rati 


lerived from the between-subjects error term yielding df 
ee .. The standard error was derived from the within-subjects error term yielding df 
n). 


ings in order to arrive at the appropriate “error”! term ) 
=20 (see Lindquist, 1953, p. 226, 


= 20 (see Lindquist, 1953, p. 226, Typèti 
“Positive” the rating, 
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This interaction effect is significant at the .05 
level. Another way of looking at the data is to 
count the number of subjects in each condition 
who rated the person selected for payment as 
having made a greater contribution. When Tom 
was selected for payment 9 out of the 11 sub- 
jects rated him as having made the greater con- 
tribution; and when Bill was selected for pay- 
ment 7 out of 11 rated him higher. A Fisher 
exact test (Siegel, 1956) of these two distribu- 
tions yields a p= .04. These findings confirm 
Hypothesis 1. 

It is clear from the overall preference rat- 
ings that Tom created a much more at- 
tractive impression than Bill (¢ = 5.53, p < .001) 
and that this impression was not affected by the 
payment manipulation (Condition 1 versus Con- 
dition 2, ¢#=.25). The subjects’ choice of a 
potential work partner confirmed their general 
preference for Tom. Ten out of the 11 subjects 
in the Tom-paid condition selected him as some- 
one with whom they would prefer to work and 
9 out of 11 in the Bill-paid, also selected Tom. 

According to Hypothesis 2, the combined 
ratings for both workers’ contribution to the 
task should be lower when Bill (the less at- 
tractive person) was selected for payment than 
when Tom was selected. The comparison of the 
combined ratings in Condition 1 (Tom paid) 
with the ratings in Condition 2 (Bill paid) 
yields a t of 2.41, p < .02, with the means in the 
predicted direction. 

No serious predictions could be made as to 
what aspects of the overall contributions of the 
workers were most likely to be affected by the 
assigning of payment. Apparently both the 
amount of effect and the creativity of the ideas 
ascribed to the workers were reliably affected by 
whether the attractive person (Tom) or the 
unattractive person (Bill) was selected for pay- 
Ment (effort, Condition 1 versus Condition 2, 
t= 1.86, p <.05; creativity, Condition 1 versus 
Condition 2, ż= 2.17, p<.02), Even though 
Tom was rated as more creative by the subjects 
* in both conditions (T versus B, t= 4.66, p< 
001) his rating was affected much more by the 
Payment than was Bill’s. Tom’s mean creativity 
Score dropped from 8.72 in the condition in 
which he was selected for payment to 17.63 in 
the condition in which the other worker was 
selected, whereas Bill’s dropped very slightly 
from 20.54 to 21.81 (the interaction effect t= 
222, p< 02). This interaction did not appear 
in the effort ratings (¢ = .20). 

The subjects in Condition 2 (Bill paid) rated 
the workers lower on all three estimates of per- 
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TABLE 2 
SUBJECTS’ COMFORT IN THE EXPERIMENTAL SITUATION 


N M SD 


Condition 1: Tom paid (attractive) 7 3,308 1.28 
Condition 2: Bill paid 
(unattractive) 11 5.54 9 4,12 


* The lower the score the greater the comfort. 


formance than did the subjects in Condition 1 
(see the first three ratings in Table 1). If the 
initial rationale for this difference is correct then 
the subjects in Condition 2 should be less com- 
fortable than those in Condition 1 (Tom paid). 
The subjects should also attempt to explain 
away their lower ratings of the attractive worker 
(Tom) in Condition 2 by ascribing to him less 
interest in the money (see Rating 4). By and 
large these expectations were borne out by the 
data. 

The data provide some support for the idea 
that the selection of the attractive worker or 
unattractive worker for payment made some 
personal difference to the subjects. The subjects 
in the Tom-paid condition (attractive) rated 
themselves as somewhat more comfortable in 
the situation than did those in the Bill-paid 
condition (unattractive) (see Table 2, £ = 1.77, p 
<.05). Apparently the selection of Bill (unat- 
tractive) created a less desirable state of affairs 
for the subjects than when Tom (attractive) was 
selected for payment. The difference between 
the mean ratings in these two conditions must 
be interpreted with caution, in light of the 
marginal value of ¢ and the difference between 
the variances of the ratings (F= 10.36, df 
=11/7, p<.01). This difference in variances, 
though unpredicted, would follow from the idea 
that the Bill-paid condition (unattractive) pre- 
sented the subjects with a conflict. The greater 
range of expressed comfort in the situation could 
be attributed to the individual differences in 
resolving the conflict. Although no statistical 
test of this explanation was feasible, it is inter- 
esting to note that the two least comfortable 
subjects in the Bill-paid condition rated Tom as 
having contributed more to the task than Bill. 

The subjects perceived Tom and Bill quite 
differently in terms of their expectations of 
being paid (see Table 1: expectation of money, 
T versus B f= 2.15, p<.02). When Tom (at- 
tractive) was selected for payment both workers 
were rated around the midpoint signifying un- 
certainty about whether they expected to be 
paid or not (Tom M=14.72, Bill M=12.45, 
midpoint = 15). When Bill was selected for pay- 
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TABLE 3 
Ratincs or Workers on “Easy-Gornc—Bossy” 
DIMENSION 
Experimental conditions 
1: Tom paid 2: Bill paid 
Worker 
M SD M SD 
Tom 3.72 1.76 2.09 1.16 
Bill 2.18 1.46 4.63 2.56 
Analysis of variance 
Source df MS F 
Experimental condition (A) 1 1.84 71 
Error (between subjects) 20 2.57 
Worker (B) 1 2.75 58 
AXB * 1 46.02 | 9.83* 
Error (within subjects) 20 4.68 
*p <.01, 


ment his mean score went up to 8.72, indicating 
that he was perceived as expecting to be paid, 
while Tom’s went down to 17.09. Only in this 
Bill-paid condition is there a significant difference 
between the ratings of Tom and Bill on this 
scale, with Tom rated as less expectant of the 
payment (¢ = 2.52, df = 20, p <.02). 

There is some additional support for the idea 
that the subjects in the Bill-paid condition per- 
ceived Tom as less motivated by the potential 
reward, An examination of the 15 scales which 
were combined to derive the overall preference 
Score revealed that the only scale affected sig- 
nificantly by the experimental conditions was 
the one whose extremes were defined by easy- 
going and bossy. Apparently the assignment of 
payment led the subject to perceive the selected 
worker as more “bossy” and the other worker 
as more “easy-going” (see Table 3: Condition 
X Worker, F = 9.83, p< .01). This effect held 
whether Tom or Bill was selected for payment. 


DISCUSSION 


The overall findings of this study support the 
notion that a person takes into account the 
outcome of a social event in making sense out 
of what he has observed, or is about to observe. 
Apparently this is true even when the outcome 
is established as being fortuitously related to 
the behavior of the actors in the situation. The 


BRIEF ARTICLES 


observers in this experiment construed a social 
event to fit rather simple and understandable 
processes. One of these can be paraphrased as 
“people deserve what happens to them” or 
“once I know what has happened to someone 
I will be more comfortable if I can believe that 
he has earned it.” In this same vein, it is prob- 
ably more comfortable to believe that people 
in a seriously deprived state earned their condi- 
tion by some personal failing than to believe 
that deprived people are fortuitous victims of 
some social processes over which they have little 
personal control. 

There were distinct differences in the attrac- 
tiveness of the two actors in these experimental 
social scenes. The overall attraction to the two 
actors was not affected by the outcome, or 
payment. One worker, Tom, was equally pre- 
ferred whether he was selected for payment or 
not. This suggests that the assignment of pay- 
ment to the workers operated in a meaningful 
and rather selective fashion on the cognitions 
of the observers, rather than in terms of some 
general enhancement of the stimulus value of 
the paid worker. This rather selective effect of 
the cognition of payment suggests that the 
subjects were trying to bring some sort of order 
into their ideas about what was happening. When 
they knew that the attractive person was not 
selected for payment they were less comfortable 
in the task and were less favorably impressed 
with the performance of both workers. 
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SELF-DESCRIPTION OF HOSTILITY AND ITS CORRELATES ' 
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University of Washington 


A series of 3 studies dealing with the correlates of hostility was carried out. 
The 1st study involved the correlation of scores on a hostility scale (H) with 


performance on a sentence unscrambling 


task. This task permitted S to solve 


problems in either connotatively hostile or nonhostile ways. It was found that 


high H scorers made more connotatively 


hostile responses than did low H Ss. 


In the 2nd study, high H scorers remembered more hostile characteristics 
attributed to a hypothetical individual than did low H scorers. The 3rd study 
compared high and low H Ss with respect to reports of childhood experiences. 
High H Ss described their parents as being more punitive and more dissatisfied 
with them than was the case for low H Ss. 


This paper reports a series of studies dealing 
with the behavior of individuals who differ in 
hostility scores obtained on a paper-and-pencil 
test. The measure of individual differences was 
a Hostility (H) Scale developed by Sarason 
(1958, 1961). This scale has been used in two 
studies in which the subjects’ verbal responses 
have been reinforced by the experimenter 
(Ganzer & Sarason, 1964; Sarason & Minard, 
1963). The findings of these experiments sug- 
gested that high H scorers may be less responsive 
to the experimenter’s reinforcements than are 
low H scorers. These studies used response 
classes which did not specifically deal with 
hostility. In another experiment (Sarason, 1962), 
in which the response class was usage of mildly 
hostile verbs, high H subjects run by high H 
experimenters emitted a greater number of 
mildly hostile verbs than did all other subjects. 

The present research seeks to explore implica- 
tions of these findings in two ways: first, by 
comparing high and low H subjects on a task 
that permits the subject to solve problems in 
either connotatively hostile or nonhostile ways; 
and, second, by comparing high and low H sub- 
jects on their ability to recall material which is 
connotatively hostile in nature. If high and low 
H scorers differ in the tendency to emit responses 
which are hostile in character, then definite pre- 
dictions might be generated for the present 
studies. Specifically, high H scorers should make 
More connotatively hostile solutions than low 


» H scorers. It would, also, be expected that high 


scorers would recall more hostile material 
than would low H scorers. 
If these results were to occur, it would be 
appropriate to begin to entertain interpretations 
h This research was supported in part by a grant 
uate the Graduate School Research Fund of the 
niversity of Washington. 
361 


of the obtained differences between high and 
low H scorers. One question that comes to mind 
in light of formulations such as the frustration- 
aggression hypothesis is: do high and low H 
scorers differ in their perceptions of frustrations 
and conflicts in their own lives? The last study 
in the series reported here seeks to provide data 
related to this question by obtaining from the 
subjects retrospective responses relating to frus- 
trations and problems experienced in their own 
development. 


Srupy I 


Two experiments were carried out in this 
study. In each of them subjects were adminis- 
tered the H Scale, and, at a later date, a sentence 
unscrambling task. This task involved presenta- 
tion to the subject of a series of four-word 
groupings. The subject was asked to select three 
words out of the four so that a sentence would 
result. Each four-word group could be used to 
form either a neutral sentence or one which had 
hostile connotations (for example, the swat see 


fly). 


Method 


Subjects. The subjects in the first of the two 
experiments were 37 students enrolled in a summer 
introductory psychology course at the University of 
Washington. The subjects in the second experiment 
were 75 students enrolled in a spring quarter in- 
troductory psychology course at the University of 
Washington. 

Procedure. In each experiment, prior to and 
independent of administration of the sentences task, 
the subjects had been administered the H Scale. 
The sentences task was the 40-item one developed 
by Watson, Pritzker, and Madison (1955). The 
measure was the number of sentences constructed 
with hostile connotations. Both the H Scale and 
sentences task administrations were in groups, 


362 


Results and Discussion 


For the first experiment, the correlation be- 
tween the H Scale and number of hostile 
sentences was 40 (p< .02). For the second 
experiment, the correlation was .22 (p< .05). 
Examination of the data of both experiments in 
terms of sex indicated that: males, regardless of 
H scores, constructed more hostile sentences than 
did females, and male and female high H scorers 
constructed more hostile sentences than did low 
H scorers. 

The obtained positive correlations between 
H scores and hostile sentences support the hy- 
pothesis that the H Scale is related to the 
tendency of subjects to emit responses which are 
hostile in connotation. 


Srupy II 


The results of Study I and of previous re- 
search (Sarason, 1962) showed that the H Scale 
was related to behavior in situations in which 
the subject has an opportunity to either emit 
or not emit responses which are connotatively 
hostile. The three experiments in Study II dealt 
with errors by subjects differing in H scores in 
recall of hostile characterisitics ascribed to a 
hypothetical individual. 


Method 


Subjects. All of the subjects in this series of 
experiments were students enrolled in introductory 
psychology courses at the University of Washington. 
The Ns for the three experiments were 56, 120, and 
148. Prior to and independent of performance on 
the experimental task, the subjects had been ad- 
ministered the H Scale. In the first experiment high 
H scorers were defined by scores of 9-18; low scorers 
had scores of 0-8. In the second and third studies 
high H subjects had scores of 8 and above, and 
low scorers had scores of 7 and below. 

Procedure. These experiments were carried out 
during regular classroom sessions. In the first ex- 
periment, at the beginning of the hour the ex- 
perimenter distributed to each subject a mimeo- 
graphed description of a hypothetical person named 
John. This description consisted of five paragraphs 
in which there were 25 descriptive statements. Five 
of these statements related to John’s hostility (for 
example, “At times John feels like smashing things”; 
“When someone makes a rule he doesn’t like, John 
is tempted to break it”), The remaining statements 
dealt with John’s physical characteristics, interests, 
family relationships, and anxieties,2 

The subjects were instructed to read the descrip- 
tion of John twice. When this was done, the de- 


2For most of the subjects, scores on the Test 
Anxiety Scale (Sarason, 1958) were available. These 
scores were found to be unrelated to the subjects’ 
recall of John’s anxiety characteristics, 
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scriptions were collected and the experimenter then 
withdrew from the classroom. Towards the end of 
the class hour the experimenter returned and dis- 
tributed to the subjects a sheet containing a series 
of items dealing with John’s characteristics. The 
task for the subjects was to check one of six alter- 
natives in terms of the description of John which 
they had previously read. Examples of these rating 
scale items are: How well does John like baseball? 
When he doesn’t agree with a rule, how does John 
react? For this latter item, the subject checked 
one of the following alternatives: always breaks it, 
ignores it, tempted to break it, keeps quiet, feels 
badly, conforms. 

The second and third experiments were identical 
to the first one except that the description of John 
was not presented in paragraph form but rather as 
a list of descriptive statements, and three additional 
hostility characteristics were added to the descrip- 
tion. In the third experiment the subject’s recall of 
John’s characteristics was obtained either at the 
end of the class hour (N = 48) or 24 hours later 
(N = 100), 


Results and Discussion 


The measure employed in these experiments 
was the number of errors made by the subject 
in describing John. An error was made when- 
ever the subject checked any alternative other 
than the one which was actually presented in the 
description of John. 

In all three experiments, error-score compari- 
sons were made between male and female sub- 
jects. In every case the obtained differences did 
not approach significance. Hence, our results 
will be presented in terms of difference between 
high and low H scorers. 

In the first experiment, the mean number 
of errors made by high H subjects (W = 26) 
concerning John’s hostility was .73. The com- 
parable low H mean (NW = 30) was 1.70. The 
difference between these means was significant 
at the 01 level. The correlation between H 
Scores and errors was —.29 (p< .05). 

In the second experiment, the mean number 
of errors made by high H subjects (N = 61) 
was 1.50, The comparable low H mean (N = 68) 
was 2.30. The difference between these means 
was significant at the .05 level. The correlation 
between H scores and errors was —.21 (p < 05). 
It should be noted again that the number of 
hostility items was increased from 5 to 8 in the 
second and third experiments. In this second 
experiment high and low H groups were also 
compared in terms of errors other than those in- 
volving John’s hostility characteristics. No sig- 
nificant differences were obtained for these non- 
hostility characteristics, 

In the third experiment, comparisons were 
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nade between high and low H scorers for 40- 
minute and 24-hour recall intervals. The former 
“interval is the same as the one employed in the 
and second experiments. For the 40-minute 
oup, the mean number of errors made by high 
subjects was 1.48 (N = 29), The comparable 
“mean for low H subjects was 2.24 (N =29). 
‘The difference between these means was sig- 
" nificant at the .05 level. The correlation between 
H scorers and errors was —.38 (p<.01). For 
the 24-hour comparisons, the high and low H 
groups (Ns = 60 and 40, respectively) did not 
" differ significantly in errors. Their means were 
~ 256 and 2.55, respectively. The correlation be- 

‘tween H scores and errors was nonsignificant 
C (r=—.06). The difference between the high 
40-minute and 24-hour groups was statistically 
“significant (p < .05). For the low H 40-minute, 
M4-hour groups there was not a significant 
: difference. 


On the basis of the present findings and those 
“of previous research it seems reasonable to con- 
dude that individuals who receive high scores 
on the H Scale have a tendency not only to 
describe themselves as possessing more hostile 
Characteristics than do low H scorers, but they 
also seem more likely to emit more connotatively 
hostile responses in a variety of situations than 
do low H scorers. To what might one attribute 
these obtained differences? There are, of course, 
many possibilities. For example, high H scorers 
may simply have a stronger response set than 
do low H scorers to emit connotatively hostile 
Verbal responses. However, regardless of the 
hypothesis offered one still wonders: what are 
the antecedents of this set or tendency? The 
ne way in which to get at antecedent con- 
ditions would be to carry out longitudinal 
Tesearch. Thus, if one worked within the frame- 
Work of the frustration-aggression hypothesis, 
one would want to know to what extent high and 
| low H scorers differ in their life histories gen- 
F erally and, more specifically, their histories 
= With regard to significant family relationships 
(Berkowitz, 1958; Dollard, Doob, Miller, 
Mowrer, & Sears, 1939). 
Th the absence of longitudinal data, it was felt 
that a useful procedure would be to obtain from 
" Subjects differing in H scores retrospective ac- 
Counts of their own development. These ac- 
Counts clearly would not permit firm conclusions 
about causal relationships. However, they might 
_ Prove valuable in providing indications of differ- 
_ @nces between high and low H scorers in terms 
_ Of their perceptions and interpretations of de- 
|  Velopmental events and experiences. It was to 
this problem that our third study was addressed. 


——— el 
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Srupy III 


The purpose of this study was to compare the 
responses of high and low H subjects on rating- 
scale items dealing with childhood, family, and 
school experiences. We hypothesized that in- 
dividuals with relatively strong hostile tendencies 
would retrospectively regard their earlier experi- 
ences as having been less satisfactory and more 
stressful and, to a greater degree than low hostile 
individuals, to perceive their parents as having 
been more rejecting and more punitive. We were, 
then, adopting the view that hostile responses 
on the H Scale are, in some sense, a function 
of perceived stimulus conditions in development. 
It was felt that data obtained concerning in- 
dividuals’ subjective assessments of these con- 
ditions would be useful in ultimately empirically 
studying them. 


Method 


Subjects. The subjects were 119 students enrolled 
in introductory psychology courses at the University 
of Washington. There were 59 females and 60 males. 
The females had a mean H score of 8.0; the mean 
for males was 9.5. High H females had scores of 9 
and above (N=29); low females had scores of 
8 and below (N=30). High H males had scores 
of 11 and above (N = 30); low H males had scores 
of 10 and below (N = 30). 

Procedure. Subsequent to, and independent of, the 
administration of the questionnaire which provided 
the H Scale scores, the subjects were group ad- 
ministered a questionnaire dealing with a sampling 
of their earlier life experiences. The instructions 
were: 

This is a questionnaire to find out some of the 
experiences that college students have had during 
their childhoods. The questions pertain to the 
period of your life from birth to high school 
graduation and should be answered according to 
your experiences during that period. 


There were 96 items, to each of which the subject 
responded on a 5-point rating scale. 


Results and Discussion 


The data were analyzed separately for males 
and females. For each item a ¢ test was per- 
formed for males and for females. Thus, there 
were 192 ¢ tests. The number of these which were 
significant at the .05 level or better was 40. 

Fourteen items dealt with parents’ degree of 
satisfaction with the child and his behavior (for 
example, “How disappointed were your parents 
when you didn’t live up to their expectations of 
you”). In every case, for both males and fe- 
males, the obtained ratings suggested that high H 
subjects perceived their parents as being less 
satisfied with them than was the case for low H 
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subjects. Seven of the #’s were significant beyond 
the .05 level. 

There were 22 items dealing with the roles of 
parents as models for the child and with the par- 
ents as source of support, advice, and pleasure 
for the child (for example, “How often did you 
ask for help, advice, and information from your 
father?”). There were 11 significant #’s. The re- 
sults suggested that high H scorers see their par- 
ents as less approachable than do low H scorers. 

‘There were 24 items dealing with disciplinary 
procedures within the family (for example, “How 
resentful were you about the restrictions placed 
upon you by your mother?”). There were 11 sig- 
nificant ?’s. They suggested that high H scorers 
perceived their parents as being more strict than 
did low H scorers. 

Eleven of the items pertained to reactions to 
teachers and school (for example, “How often 
did you talk back to the teacher?”), The seven 
significant ¢’s reflected a greater tendency for high 
than low H scorers to describe themselves as be- 
having aggressively to teachers, 

The results of Study III clearly indicate that 
subjects who obtain high scores on the H Scale 
describe their development as being character- 
ized by parental affectional ties which are weaker 
than seems to be the case for low H scorers. The 
parents of high H scorers are described as more 
strict, more prone to criticize, and more dissatis- 
fied with them than is the case for low H scorers, 
In addition, high H subjects describe their par- 
ents as being less approachable and interested in 
them than do low H scorers. They also appear to 
be more reluctant than low H scorers to perceive 
their parents as models which might be emulated. 

Interestingly, the differences observed between 
high and low H subjects in describing their par- 
ents seem to be paralleled by the differences in 
descriptions of their teachers. As was the case 
with parents, high H subjects present descrip- 
tions of their teachers which suggest less satisfy- 
ing interpersonal relationships than in the case of 
low H scorers. Comparing high H subjects with 
low H subjects, it would seem that they were 
relatively less happy in school, and that they 
were relatively more likely to express hostility 
towards their teachers. 

A word should be said concerning sex differ- 
ences in the descriptions of development given by 
subjects. There were Many more significant dif- 
ferences between high and low H females than 
between high and low H males. It should be ob- 
served, however, that on every item on which 
high and low H females differed significantly, 
there was a comparable tendency, approaching 
the level of statistical significance, for high and 
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low H males to differ. It, thus, seems that 
differences in results between males and fema 
are ones of degree rather than of pattern. 

Taking stock of the evidence accrued, we c 
clude that the H Scale does seem to be relai 
to the likelihood of emission of responses whi 
seem to be hostile, at least in connotation. In 
dition to high H subjects describing themsel 
on the H Scale itself as being more angry a 
more likely to express hostility than is the 
for low H subjects, high H subjects when com- 
pared with low H subjects: emit more hostile 
sponses in a simple sentence unscrambling task 
recollect more accurately hostile characteristic 
attributed to a hypothetical other individ 
perceive their parents as less satisfying and a 
quate identification models, less interested 
them, and more strict disciplinarians; and pers 
ceive themselves to express more easily hostility 
and dissatisfaction to teachers. The present find- 
ings concerning the generality of hostility seem 
to be quite similar to those reported by Bu 
Fischer, and Simmons (1962). Two lines of ina 
vestigation seem to be indicated by these findings. 
One is the study of high and low hostile subjects 
under experimental conditions (Berkowitz, 19625 
Feshbach, 1955). The other is the comparison 
high and low hostile scorers in terms of their i 
dependently assessed histories (Sears, 1961; Seai 
Maccoby, & Levin, 1957). 
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EFFORT AND SUBJECTIVE PROBABILITY * 


HOMER H. JOHNSON anp IVAN D. STEINER 
University of Illinois 


Yaryan and Festinger have reported data which they interpret to indicate that 
the expenditure of effort in preparation for a possible event increases the 
belief that the event will occur. The present study explores alternative 
explanations of Yaryan and Festinger’s results. The findings suggest that it 
was Ss’ perceptions of E’s behaviors, rather than their own efforts, which led 
them to increase their subjective probabilities. 


Yaryan and Festinger (1961) have contended 
that people who have been induced to exert a 
great deal of effort in preparation for a possible 
event will believe more strongly in the likelihood 
of the event than will people who have not been 
induced to engage in effortful preparations. As a 
test to this proposition Yaryan and Festinger 
informed a sample of female high-school stu- 
dents that there was a .5 chance that they would 
be required to take a special IQ test, and induced 
half of their subjects to learn a long list of 
difficult definitions which were said to be rele- 
vaht to that test. Persons in the other half of 
the sample were asked merely to read the list, 
Which they were assured would be available to 
them if they were asked to take the test. At the 
Conclusion of the experimental sessions, subjects 
who had been induced to make the more ef- 
fortful preparations expressed higher estimates 
of the probability that they would have to take 
the test. According to Varyan and Festinger, 
knowledge that one has expended effort in prepa- 
tation for an event is dissonant with the belief 
that the event has a low probability of oc- 
ee Consequently, subjects who received 

 effortful treatment raised their subjective 
Probabilities, 


aye 
This study was conducted’ by Johnson as MA 
ea research under the direction of Steiner. It 
supported by Grant M-4460 from the National 


stitutes of Health, United States Public Health 
eT Vice, 


Perhaps it was true that the subject’s knowl- 
edge that she had worked hard was responsible 
for her increased subjective probability, but her 
strong belief that she would be required to take 
the test might also have reflected her awareness 
that the experimenter had worked hard in her 
behalf. In the Yaryan and Festinger study the 
experimenter spent far more time with the sub- 
jects who received the effortful treatment than 
she spent with those who received the easy 
treatment. And, since all subjects were told that 
the experimenter knew who would have to take 
the test, it seems reasonable to surmise that 
subjects interpreted the “strong” or “weak” 
efforts of the experimenter to indicate that they 
had, or had not, been chosen. It is possible, 
too, that subjects who received the effortful 
treatment felt better prepared, and that their 
high subjective probabilities reflected their feel- 
ings of preparedness rather than the efforts 
they had expended. Neither the subjects’ views 
of the experimenter’s concern nor their feelings 
of preparedness were measured or controlled in 
the Yaryan and Festinger study. 

The present research proceeds on the assump- 
tion that it was primarily the subjects’ percep- 
tions of the experimenter’s concern which was 
responsible for Yaryan and Festinger’s findings. 
It is hypothesized that anything which increases 
the perceived concern of the experimenter will 
Jead to increased subjective probabilities. Among 
the variables which may affect perceived experi- 
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menter concern are the efforts the experimenter 


expends in the subject’s behalf, the subject’s . 


belief that the experimenter knows who has been 
chosen and why they have been chosen, and 
humerous unspecifiable behaviors of the experi- 
menter that can be interpreted to indicate con- 
cern or lack of it. 


PROCEDURE 


The methodology of this study parallels that of 
the Yaryan and Festinger investigation, Female 
college students were told they were assisting in the 
development of a new IQ test which 50% of them 
would take in a subsequent session. Half of the 
subjects were asked to memorize the definitions 
employed by Yaryan and Festinger, while the re- 
maining subjects were asked to read through the 
list which they were promised would be made 
available to them if they were required to take the 
test. Subjective probabilities were measured with the 
same 6-step graphic rating scale as was employed 
by Yaryan and Festinger, 

However, the procedures of this study differed 
from those of Yaryan and Festinger in certain 
respects. Subjects worked in groups of 8-12 instead 
of individually, Although this procedure permitted 
subjects to see that at least a few other girls received 
exactly the same treatment they did, this change 
should have no relevance to a test of Yaryan and 
Festinger’s proposition which concerns only a rela- 
tionship between the subject’s own efforts and her 
subjective probability, Moreover, Yaryan and 
Festinger told their subjects that everyone would 
receive the same treatment. Second, in the present 
study the experimenter’s efforts were deliberately 
varied, In “easy-experimenter” treatments, the 
experimenter read a magazine while the subjects 
read or memorized the definitions, In “effortful- 
experimenter” treatments the experimenter had the 
subjects read magazines for 30 minutes while he 
frantically wrote the definitions which the subjects 
merely examined for 5 minutes, or he spent 30 
minutes displaying placards on which were printed 
the symbols and definitions which subjects were 
required to memorize. Thus the experimenter’s ef- 
fortful role depended upon whether the subjects 
were being required to expend great or little effort. 
It would have been desirable to hold this role 
constant, but we were unable to devise experimental 
situations in which this could be done without 
making the experimenter’s behaviors appear un- 
natural and arbitrary. 

A third innovation involved telling half the sub- 
jects that the experimenter did not know who had 
been chosen to take the IQ test. Choices were said 
to have been made on purely chance basis. The 
remainder of the subjects were given the same bogus 
information Yaryan and Festinger gave their sub- 
jects: choices were said to have been made on the 
basis of school records and interviews with teachers, 
and the experimenter reported that he knew who 
had been chosen. 
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The three manipulated variables (subject effort, 
experimenter effort, and selection criteria) consti- 
tuted the basis for a 2 X 2 X 2 design. There were 
30 subjects in each of the eight treatment categories, 

Because Yaryan and Festinger’s results may have 
been due to differences in subjects’ feeling of pre- 
paredness rather than to the efforts they had ex- 
pended, all subjects in the present study were given 


emphatic assurances that their preparation was com- 


pletely adequate. In order that we might obtain 
data about the success of our experimental manipu- 
lations, and about subjects’ perceptions of experi- 
menter concern, the subjects also responded to sev- 
eral 6-step graphic rating scales which were not 
employed by Yaryan and Festinger. These additional 
scales were administered after subjects had indicated 
their subjective probabilities. 


RESULTS 


At the conclusion of experimental sessions, — 


subjects answered a questionnaire item which 


asked whether people who were scheduled to | 


take the IQ test had been chosen on the basis 
of chance or on the basis of background informa- 
tion. Without exception the subjects in this study 
checked the alternative which corresponded to 
the “facts” they had received from the experi- 
menter. Thus the manipulation of selection 
criteria appears to have been successful. 

Table 1 reports the means of the responses 


made by subjects in the eight treatment cate- | 


gories to the graphic rating scales administered 


at the conclusion of experimental sessions. Re- | 


sponses have been transformed into numerical 
values by assigning the value of 1 to the ends 
of the scales which denoted the weakest re- 
sponses, and the value of 6 to the other ends 
of the scales. Because the ranges of the scores 
were somewhat restricted, and because the dis- 
tributions were sometimes badly skewed, para- 
metric tests were not appropriate. Differences 
between categories of subjects were tested by 
four-celled chi-square tests, with subjects di- 


chotomized as closely as possible to the median | 


on each variable. 


Subjects who were required to memorize the, 


definitions rated their work as more effortful 
than did subjects who merely read the defini- 
tions, and subjects who received the “strong- 
experimenter-effort” treatments rated the experi- 
menter’s efforts more highly than did subjects 
who received the “weak-experimenter-effort 
treatments, These differences are significant at 
the .01 level regardless of the selection criterion 
which was said to have been employed. Thus 
the experimental manipulations were successful. 

Column 1 of Table 1 reports the means of 
the subjective probability scores received by 


| 
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TABLE 1 , 
MEANS OF THE RESPONSES GIVEN BY EIGHT CATEGORIES OF SUBJECTS 
i Do you believe | How hard was, | How hard did | y 
Paimen catenin | x | sgovetece | eeiam |e ie mae | MGP, | peptic 
re] ti lat u 
! tctead | oregona | HERE |"Seoreowell | oe 
| a) 2) (3) (4) (5) 
Background selection 
Experimenter strong, 
subject strong 30 4.37 3.13 
Experimenter weak, poe as Get 
subject strong 30 4.10 3.83 
penter strong, N a fee 
subject weak 30 4.10 1,70 3 
eeter weak, a rig F 
subject weak 30 4.20 K At 
Chance selection ae, re pe fe 
ses imenter strong, 
subject strong 30 4.27 3.43 2 Y 
Experimenter weak, i sH op 
subject strong 30 4.17 3.70 2.67 4.00 4,10 
Experimenter strong, 
subject weak 30 4.10 1.57 327s 3.70 SAt 
Experimenter weak, 
subject weak 30 4.23 1.33 1.87 4.20 3.83 


Í 


$ MEAN SUBJECTIVE-PROBABILITY S 


Note.—High numbers indicate stronger or more affirmative answers. 


subjects in the eight experimental categories. 
None of the three manipulated variables was 
found to have had an effect on these scores; 
the obtained chi-square values did not approach 
the .05 level of significance. Subjects’ perceptions 
of experimenter concern and their feelings of 
Preparedness (Columns 4 and 5) were also 
unaffected by the three manipulated variables. 
_ However, an examination of the data reported 
in Columns 1 and 4 indicates that subjects 
SH perceived high experimenter concern also 
i high subjective probabilities. The means in 
i ese two columns are correlated to a significant 
ed (Spearman r corrected for ties =.976, 
a lel For the’ 120 subjects who received 
 background-selection treatments the phi co- 


efficient relating perceived experimenter concern 
to subjective probability was +.19 (p < .05), 
but for subjects receiving the chance-selection 
treatments it was only +.01. For the former 
subjects, feelings of preparedness were margin- 
ally related to subjective probabilities (phi 
=+.17, p<.10); but for the latter subjects 
the phi value was again only +.01. For all 240 
subjects perceived experimenter concern was 
positively related to subjects’ feelings of pre- 
paredness (phi = +.20, ~<.01). 

Table 2 indicates that subjective-probability 
scores varied systematically with beliefs about 
experimenter concern and adequacy of prepara- 
tion, Subjects who received the background- 
selection treatments and who fell above the 


TABLE 2 


EXPERIMENTER 


CORES OF SUBJECTS CATEGORIZED ON THE BASIS OF PERCEIVED 
CONCERN AND FEELINGS OF PREPAREDNESS 


Subjects’ scores on perceived experimenter concern and 


feelings of preparedness 


Category Above average on both 


bove average on one but Below average on both 


not on the other 


N | Subjective probability 


N | Subjective probability N | Subjective probability 


Chance selection 27 4.52 
ackground selection 27 4.48 
subjects 54 4.50 


49 4.08 44 4.14 
4.21 50 4.02 
92 4.14 94 4.08 


Note.—Subjects were dichotomized as close to the median as possible. 
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median on both of these variables had signifi- 
cantly higher subjective-probability scores than 
did those who fell below the median on both 
variables. A four-celled chi-square test produced 
a value of 6.23 (p< .02), and a six-celled test 
which included the subjects who were above the 
median on one variable but below the median 
on the other yielded a chi-square of 6.25 
(p<.05), For subjects who received the 
chance-selection treatments neither test produced 
significant results. Chi-square was 2.61 (p < .20) 
for the two extreme categories, and 2.92 
(p<.30) for all three categories. An analysis 
which combined the two selection criteria 
yielded a chi-square of 4.87 (p< .05) for the 
two extreme categories, and 6.61 (p<.05) for 
all three categories. 


Discussion 


The data of this study fail to support Yaryan 
and Festinger’s contention that efforts spent in 
preparation for an event lead to an increase in 
the subjective probability that the event will 
occur. When subjects’ feelings of preparedness 
were controlled, those who had engaged in ef- 
fortful preparation were no more inclined to 
believe that the event would occur than were 
those for whom preparation had been easy. 

In this study the intensity of the experi- 
menter’s efforts also failed to affect subjective 
probabilities. Even when the experimenter did 
little more than read a magazine, subjects’ post- 
session ratings indicated that they accepted his 
firm assurances that they had been adequately 
prepared. Consequently, subjects perceived no less 
experimenter concern when the experimenter did 
little than when he did much. In either case the 
experimenter’s efforts in their behalf were seen 
as adequate, and subjects had little basis for 
interpreting his behavior as indicating that they 
had or had not been chosen. 

However, subjects who did perceive high 
experimenter concern, and who believed the 
experimenter knew whether or not they had 
been chosen, reported high subjective probabili- 
ties. So also did subjects who felt highly pre- 
pared and who believed that the experimenter 
knew whether they had been chosen. This pat- 
tern of findings suggests that it was the subjects’ 
interpretations of the experimenter’s behavior 
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which was responsible for Yaryan and Festinger’s 
findings. In their investigation all subjects were 
told that the experimenter knew who had been 
chosen, and subjects were not given firm assur- © 
ances that they were adequately prepared. Conse- 
quently, it should have been rather easy for 
subjects to conclude that the experimenter’s | 
strong or weak efforts in their behalf indicated 
that they had, or had not, been selected to 
take the test. 

The dissonance which produced Yaryan and 
Festinger’s results probably involved an incon- 
sistency between the experimenter’s statement — 
that only 50% of the subjects would take the 
test, and her rather considerable efforts in behalf 
of the subjects who received the effortful treat- 
ment. People who know whether or not an event 
will occur are unlikely to expend efforts in 
preparation for the event, or to ask others to 
do so, unless it will occur. Yaryan and Festinger’s | 
effortful treatment undoubtedly provided sub- l 
jects with social support for the belief that 
they had been chosen to take the test. 

An alternative explanation of the correlational 
findings of this study would maintain that sub- | 
jects who believed they had been chosen to 
take the IQ test experienced anxiety which they 
subsequently alleviated by perceiving the experi- 
menter to be highly concerned and by believing 
themselves to be well prepared. However, if this 
interpretation were correct, relationships between 
subjective-probability scores and these two 
variables should have been as strong when the 
experimenter did not know who would be re- 
quired to take the test as they were when he 
did know. Since the obtained relationships were 
uniformly stronger under the latter condition, 
it appears more appropriate to conclude that 
subjects’ interpretations of the experimenter’s | 
efforts affected their  subjective-probability 
scores. Regardless of which explanation is cor- 
rect, the findings of this study provide no 
support for Yaryan and Festinger’s conclusions. 
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PARENTAL ANTECEDENTS OF AUTHORITARIANISM 


DONN BYRNE? 
University of Texas 


While a number of investigators have reported substantial relationships be- 
tween authoritarianism and various parental attitudes concerning child-rearing 
practices, interindividual studies utilizing families have been lacking. The 
F Scale and the Traditional Family Ideology Scale were administered to 108 
college students and their parents. It was found that husband and wife pairs 
correlate significantly with one another in their scores on the 2 tests, the 
F Scale scores and the TFI scores of male college students are each a positive 
function of the scores obtained by their fathers on these 2 scales, and there is 
a significant relationship between authoritarianism in mothers and in their 
offspring of both sexes. The low magnitude of the various relationships in- 
dicates that the major portion of the variance in both authoritarianism and 
traditional family ideology is a function of factors other than parental stand- 
ing on these 2 variables. In addition, the greater predictability of the male 
students suggests the possibility of differences in the antecedents of authoritar- 


ianism for the 2 sexes. 


The most obvious origins of a system of 
attitudes and beliefs such as authoritarianism 
would be expected to lie in the experiences of an 
individual with parents and others relatively 
early in life. Presumably these influences would 
be of two major varieties (Frenkel-Brunswik & 
Havel, 1953). Of greatest importance would seem 
to be those factors which bring about the de- 
velopment of the underlying characteristics which 
make antidemocratic beliefs acceptable: repres- 
sion, denial of aggressive and sexual impulses, 
and the use of externally oriented problem- 
solving mechanisms like projection and displace- 
ment. Second, there would of necessity be contact 
with the specific ideas and beliefs that form the 
content of authoritarian ideology. 

In the original Berkeley studies (Adorno, 
Frenkel-Brunswik, Levinson, & Sanford, 1950), a 
number of clinical investigations were conducted 
with subjects high and low in prejudice. The 
e ions drawn from these observations led 
o a multitude of hypotheses concerning the 
differential family orientation and parental 

3 guacteristics of individuals who differ in anti- 
a authoritarianism, ethnocentrism, etc. 

riefiy, those high on such dimensions reported 
a arel harsh and threatening home disci- 
said clearly defined roles of dominance and 
eae early suppression of unacceptable im- 
at a an emphasis on the social acceptability 
ae ues, stress on the importance of duty and 
rol Bations, and a dichotomous conception of sex 

es. Subjects low on the prejudice-authori- 


aL 
oss author wishes to thank Jerry Lackey and 
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tarianism-ethnocentrism continuum reported more 
or less an opposite type of family structure: less 
emphasis on obedience, a freer acceptance of 
emotional expression, less anxiety about con- 
formity, and more tolerance of socially unac- 
ceptable behavior. Theoretical descriptions of the 
etiology of authoritarianism have been based 
primarily on these observations. 

Quantitative investigations of parental ante- 
cedents of authoritarianism-equalitarianism have 
been generally supportive of the foregoing for- 
mulations, Authoritarian scores have been found 
to correlate positively with a scale measuring 
Traditional Family Ideology (Conventionalism, 
Authoritarian Submission, Exaggerated Mascu- 
linity’ and Femininity, Extreme Emphasis on 
Discipline, and Moralistic Rejection of Impulse 
Life) in a series of studies with adult subjects 
(Levinson & Huffman, 1955), the frequency with 
which non-love-oriented disciplinary techniques 
were espoused by 126 mothers as their response 
to a series of specific situations involving chil- 
dren (Hart, 1957), perception of parental puni- 
tiveness by fifth-grade children as expressed in 
incomplete sentences (Lyle & Levitt, 1955), 
scores on the Authoritarian-Control factor of the 
Parental Attitude Research Inventory in a group 
of female psychiatric patients and a group of 
nurses (Zuckerman & Oltean, 1959), the holding 
of restrictive versus permissive child-rearing at- 
titudes among fathers in a group of military 
officers (Block, 1955), and scores on the 
Dominant, Possessive (females), and Ignoring 
(males) subscales of the USC Parent Attitude 
Survey in a group of undergraduates (Kates & 


Diab, 1955). 


369 


370 


In discussions of the role of parental attitudes 
and behavior in the development of authori- 
tarianism and equalitarianism, a rather consistent 
picture has emerged, based primarily on the fore- 
going investigations. Authoritarian-equalitarian 
traits in parents lead to the establishment of 
characteristic types of family structures and the 
utilization of particular disciplinary techniques. 
Children reared in these different ways develop 
authoritarian or equalitarian ideology as a conse- 
quence, and, on reaching adulthood, continue the 
cycle with respect to their own family structures 
and disciplinary techniques. The major flaw in 
this representation is that the data are based en- 
tirely on intraindividual studies in which scores 
on the F Scale are found to be related to some 
other behavior within a group of subjects. In 
order to be able to generalize the findings from 
such research to the actual development of 
authoritarianism-equalitarianism, it is necessary 
to make two as yet untested assumptions. First, 
husbands and wives are assumed to be similar 
in both authoritarianism and child-rearing prac- 
tices. Second, parental authoritarianism and 
child-rearing practices are assumed to have a 
direct relationship with both authoritarianism 
and child-rearing practices in their offspring. 

The present investigation is designed to test 
both of the foregoing propositions. Specifically, 
it is hypothesized that there is a positive rela- 
tionship between authoritarianism of husbands 
and of their wives, there is a positive relation- 
ship between the acceptance of traditional family 
ideology by husbands and by their wives, parental 
traditional family ideology is positively related 
to authoritarianism and the acceptance of. tradi- 
tional family ideology by their offspring, and 
parental authoritarianism is positively related to 
authoritarianism and the acceptance of traditional 
family ideology by their offspring. 


PROCEDURE 


The subjects consisted of 108 students (49 males 
and 59 females) enrolled in the introductory psy- 
chology course at the University of Texas and their 
parents, thus making a total of 324 subjects. The 
students were tested in small groups as part of a 
course requirement for participation as research sub- 
jects; their parents were contacted by mail. From 
the original pool of 165 students, 143 were selected 
as research subjects. The only criteria for selection 
were that they had not been adopted, that both 
parents were living, and that their parents were 
neither separated nor divorced. Of the 143 families 
fitting these requirements, 35 were ultimately dis- 
carded because of the failure of one or both parents 
to participate or because of errors in responding to 
the directions for completing the tests. The students 
whose parents participated in the investigation did 
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not differ significantly in their scores on either 
test from the students whose parents did not 
participate. 

The measure of authoritarianism was a 32-item 
scale,? partially balanced for acquiescent response set, 
The Traditional Family Ideology (TFI) Scale was 
the 35-item instrument developed by Levinson and 
Huffman (1955). 


RESULTS 


As in previous investigations, both the F Scale 
and the TFI Scale were found to have adequate 
split-half reliability for research purposes. On 
the student sample, a corrected (Brown- 
Spearman formula) coefficient of internal con- 
sistency of .81 was found for each scale. In their 
original article, Levinson and Huffman (1955) 
reported an average correlation of .67 between 
the two tests. In the present investigation, the 
two scales correlated .62 (p<.01) among the 
students, .61 (p<.01) among the fathers, and 
61 (p<.01) among the mothers. Thus, the 
relatively substantial intraindividual relationship 
between authoritarianism and traditional family 
ideology was again supported. In this respect, 
the present sample would appear to be compa- 
rable to groups previously studied. 

The first two hypotheses dealt with the simi- 
larity of married couples on the two variables. 
The correlation for husband-wife pairs was ob- 
tained for each scale. The obtained Pearson 
product-moment 7’s were .30 (p<.01) for the 
F Scale and .26 (p<.01) for the TFI Scale. 
Both hypotheses were confirmed in that a greater 
than chance relationship exists between marital 
partners on these variables, but neither relation- 
ship is of substantial magnitude. 

The second two hypotheses concerned the rela- 
tionships between parental scores on the two 
variables and those of their offspring. The 16 
correlations are shown in Table 1. It can be 
seen that the hypothesized positive relationships 
were confirmed for the father-son combinations. 
The F Scale scores and the TFI Scale scores 
of male college students are each a positive 


function of the scores obtained by their fathers ` 


on these two scales. The only other positive 
finding is that of a significant relationship 
between authoritarianism in mothers and in their 
offspring of both sexes. 


2The items were 1, 2, 4, 6, 8, 9, 12, 13, 16, 18 
19, 21, 23, 25, 26, 27, 29, 31, 34, 35, 37, 39, 41, am 
42 from Adorno, Frenkel-Brunswik, Levinson, an 
Sanford (1950, pp. 255-257, Table 7); 66 and 67 
from Adorno et al. (pp. 226-227, Table 1); 41 from 
Adorno et al. (pp. 248-250, Table 4); and 5, 8 % 
i and 16 from Christie, Havel, and Seidenbers 
1958), 
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DISCUSSION 


As hypothesized, husbands and wives tend to 
resemble one another with respect to authoritar- 


‘ ianism and traditional family ideology. As with 


other findings of husband-wife similarities (for 
example, Byrne & Blaylock, 1963; Newcomb & 
Svehla, 1937; Schooley, 1936), the relationships 
are of modest magnitude. Among other implica- 
tions of this finding, it should be noted that the 
rather simple picture which may be drawn to 
contrast authoritarian and equalitarian home 
atmospheres is probably considerably less straight- 
forward than has hitherto been suggested. For 
example, in the present sample, there were 14 
families in which the father and mother were on 
opposite extremes of a trichotomized F Scale 
distribution. The postulated direct influence of 
parental authoritarianism and its correlates on 
the ideology of offspring would quite possibly 
be affected by the magnitude and direction of 
differences between the parents, dominance-sub- 
mission patterns of the husband and wife, differ- 
ential amounts of contact between father and 
child versus between mother and child, differ- 
ential identification with the two parents, etc. 
It would appear that future work on the parental 
antecedents of authoritarianism will need to in- 
clude additional variables of this variety. 
As a way of exploring the effects of parental 
Similarity in authoritarianism on the authoritar- 
lanism of their offspring, the families in the 
Present sample were divided in terms of low, 
medium, and high for each parent as shown in 
Table 2. Within each cell, the number of off- 
‘pring with low, medium, and high F Scale 
eke shown along with the mean scores of 
bse in each cell. Even though this analysis of 
the data represents an after-the-fact approach, 
ec interesting findings emerge. For example, 
Pc tanism in the offspring is more likely 
Die at least one of the parents is low F than 
= neither parent is low F (y? = 14.00, n = 2, 
eo This tendency holds whether it is the 
5 por parent (x? =8.68, n=2, p< 02) 
i< A Opposite-sexed parent (X = 6.98, n=2, 
cn who is low in authoritarianism. It is 
itches that the presence of one equalitarian 
ae in the home ameliorates the effects of 
the pa e restrictiveness, punitiveness, etc., of 
ne erutatien, parent. Or, one might specu- 
ot high eg special characteristics of the medium 
Ddd eines who is attracted to a low F 
findin ual as a marital partner. An additional 
ee oe the presence of a high F parent 
ane ntly affects authoritarianism in the off- 
vg only if it is the same-sexed parent who is 
bh (= 11.07, n=2, p<.01). Thus, the 
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TABLE 1 


CORRELATIONS BETWEEN PARENTS AND OFFSPRING 
IN AUTHORITARIANISM AND TRADITIONAL 
Famoty IDEOLOGY 


Offspring F of son TFI of son 

Male 

F of father .38** -30* 

TFI of father 33t BIES 

F of mother -30* 25 

TFI of mother 14 22 

F of daughter TFI of daughter 

Female 

F of father 13 —.08 

TFI of father 10 03 

F of mother Pad 22 

TFI of mother 14 lL 

*p<.05. 

bad b S01 


development of a high F offspring is most likely 
to occur in families in which neither parent is 
low F and the same-sexed parent is high F; in the 
34 families fitting this pattern there were 20 high 
F offspring and 5 low F offspring. The develop- 
ment of a low F offspring is most likely to occur 
in families in which at least one parent is low F 
and the same-sexed parent is not high F; in the 
48 families of this description there were 21 low 
F offspring and 8 who were high F. It may prove 
fruitful in subsequent research to explore the 
factors responsible for the initial attraction of 
husbands and wives who differ in these various 
ways with respect to authoritarianism and to 
investigate the differing parent-child interactions 
in these families. 

The most disconcerting finding is that it is 
considerably easier to find consistent relationships 
between authoritarianism and child-rearing vari- 
ables in intraindividual than in interindividual in- 
vestigations. Correlations are found in the .60s 
and .70s between the F Scale and the TFI Scale 
administered to various groups of subjects. When, 
however, the TFI is administered to the parent 
and the F Scale to the offspring, the only sig- 
nificant relationship found was an r of .33 be- 
tween fathers’ TFI score and authoritarianism of 
sons. In terms of predicting the F Scale scores of 
college students from knowledge about their 
parents, the most useful information to have is 
the F Scale scores of the parents. If both father 
and mother are considered, a multiple r of .40 
for male offspring and .33 for female offspring is 
found. Obviously, a great deal of the variance 
remains to be accounted for. Perhaps other 
antecedents of authoritarianism should be sought, 


as, for example, in the characteristics of other 
family members besides parents, in schoolroom 
experiences (Levitt, 1955), and in other con- 
tacts with authoritarian and equalitarian ideology 


tility measure for male college students. Two 
other investigations reporting positive relation: | 
ships between authoritarianism and hostility 
(Siegel, 1956; Singer & Feshbach, 1959) used 
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TABLE 2 
AUTHORITARIANISM IN OFFSPRING AS A FUNCTION OF MoTHER-FATHER SIMILARITY 
IN AUTHORITARIANISM 
F of mother 
Low Medium High 
Offepring F of son F of son F of son 
L|M/H M LM H M L | M| H M 
Male 
K fen 1 0 0 104.00 0 5 6 125.45 1 ' $ en 
i 3 1 1 102.60 f 1 2 121.50 k 5 
oe 2 4 ht 113.71 3 1 0 97.25 1 2 0 106.00 
Low Medium High 
F of daughter F of daughter F of daughter 
Eemal, i 
iy e: 
High 3 SEE 08:20 i Aa E 1300:31 | 3 mni 
Medium 3 1 2 114.17 2 1 1 110.00 1 3 5 aed 
Low 4/5 1 107.70 | 2 4 | 2 116.00 2 2 1 115. 


outside of the home (Christie, 1952). It should 
perhaps be noted that other investigators have 
reported correlations of low magnitude and also 
insignificant correlations between parental child- 
tearing attitudes and prejudice in children (for 
example, Harris, Gough, & Martin, 1950). 
Similarly, Mosher and Scodel (1960) found a 
correlation of .32 between the ethnocentrism 
scores of mothers and prejudice scores of their 
children but no relationship between authoritarian 
child-rearing practices of mothers and their chil- 
dren’s prejudice. They speculated that the latter 
finding was a function of the ages (11-16) 
of their subjects, They hypothesized that posi- 
tive relationships would be found in college- 
age subjects because the cumulative frustration 
of such a home would eventually lead to the 
focusing of resentment onto outgroups. The 
present findings do not provide much encourage- 
ment for that proposition. 

An unexpected finding was that of sex differ- 
ences. Both variables yielded more parent-child 
relationships for the male offsprings than for 
the females. It is possible that this finding 
reflects genuine sex differences in the develop- 
ment of authoritarian ideology. For example, 
there is some evidence that suggests differences 
in the importance of hostility as a component of 
authoritarianism. Irvine (1957) reported a cor- 
relation of .48 between the F Scale and a hos- 


only male subjects. The suggestion would be that 
males are more likely to be the target of frustrat- 
ing, non-love-oriented disciplinary practices and 
to direct the resulting unconscious hostility onto 
outgroups and those who violate accepted mores. 
Tt is always possible, of course, that the finding 
of sex differences is in part a function of the 
restriction of the sample to a college population. 
Plans are under way for a repetition of the 
investigation with a less restricted group in order 
to test this possibility. 
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OPERANT CONDITIONING WITH SCHIZOPHRENICS COMPARING VERBAL RE- 
INFORCERS VERSUS PSYCHOANALYTIC INTERPRETATIONS: 


DIFFERENTIAL EXTINCTION EFFECTS* 
EDWIN O. TIMMONS? 
Gulfport Veterans Administration Hospital, Mississippi 
CHARLES D. NOBLIN,? HENRY E. ADAMS, ann JOEL R. BUTLER 
Louisiana State University 


Verbal reinforcers—mild, affirmatory words—were pitted against psycho- 
analytically derived interpretations to compare their efficacy in raising the 
frequency of personal pronouns in a standard verbal conditioning situation. 
Ss were hospitalized male schizophrenics, Verbal reinforcers produced the 


steepest acquisition curve which 


frequency of the selected class in extinction. Interpretations 
> of the selected class less rapidly, but the numi 
was significantly larger than the number given 
ment-group Ss. 4 of each treatment group was 
on Ss’ own ward and 4 by Es in the experim 


was followed by a pronounced drop in 


raised the frequency 
ber of responses in extinction 
in extinction by the reinforce- 
extinguished by an aide back 
imental room. The expected 


decrement in responding of the aide-extinguished group failed to appear. 


Interpretations of a psychoanalytic nature have 
een incorporated into many therapeutic ap- 


a This paper was presented to Division 12 at the 
nual Convention of the American Psychological 
sociation, New York City, 1961. 
ea at Louisiana State University. > 

Cre Ow at the University of North Carolina, 
ensboro, 


proaches (Brammer & Shostrom, 1960). Argu- 
ments have raged as to the purpose and efficacy 
of such interpretations, but few would question 
their pronounced effect on verbal interaction in 
therapy. Investigations of the phenomenon have 
only recently begun to appear. One study 
(Kanfer, Phillips, Matarazzo, & Saslow, 1960) 
suggested that interpretations produce an im- 
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mediate reduction in length of speech with a 
subsequent elevation after interpretations have 
been terminated. Others (Adams, Butler, & 
Noblin, 1962; Adams, Noblin, Butler, & Tim- 
mons, 1962) demonstrated that psychoanalytic- 
type interpretations serve as reinforcers in a 
standard verbal conditioning paradigm. 

Many studies have indicated that conditioned 
verbal responses show a marked decline when 
the stimulus compound is changed. For instance, 
decrement in response strength is usually marked 
when the experimenter or the subject leaves the 
experimental room, and some have suggested 
that verbal responses initiated and reinforced in 
therapy virtually disappear when the patient 
returns to his ward. 

The present study tested the comparative ef- 
fects of psychoanalytically derived interpretations 
versus those of verbal reinforcers in raising and 
maintaining the level of a selected class of verbal 
behavior. Further, performance in extinction for 
both treatment groups was compared in two 
settings: half of each group was extinguished by 
the experimenter, and half returned to the ward 
to be extingui: y a psychatric aide.* 

3 METHOD 
Subjects 


The subjects were 36 hospitalized male schizo- 
phrenic patients of average or higher intelligence as 
measured by the Shipley-Hartford test. All subjects 
were on the intake ward of the hospital; patients 
with organic diagnoses were excluded. Assignment to 
conditions was accomplished by randomization as a 
control for unmeasured or extraneous variables. Some 
of the subjects were being seen in orientation-type 
counseling groups, but none were being seen in any 
form of intensive therapy. 


Procedure 


Twelve subjects were randomly assigned to each 
of three treatment groups: those who would receive 
mild, affirmatory words from the experimenter fol- 
lowing correct responses (reinforcement group) ; 
those who would be read an interpretation following 
correct responses (interpretation group); and those 
whose verbalizations received no response from the 
experimenter (control group). 

The first two authors served as experimenters, each 
taking half of each of the three treatment groups. 
Each subject, seen individually, was seated in a 
chair in a private office with the experimenter sitting 
behind him to minimize extraneous reinforcements. 
The experimental task was a modified Taffel pro- 
cedure requiring the subjects to select one of two 


4The authors wish to express sincere thanks to 
C. J. Landrum, nursing assistant, who served as the 
third experimenter in this study. 
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pronouns printed at the top of a 5X 8 inch 
in order to complete the sentence fragment pi 
at the bottom. One pronoun on each card w 
first-person pronoun (I or we), and the other 
a third-person pronoun (he, she, or they), H 
the first-person pronouns were on the left sid 
the card and half on the right side to count 
position preference; randomization was accomp) 
within blocks of 12 cards. The cards were han 
to the subject one at a time from over his should 
then taken back as soon as he responded. 

On the first 24 trials, the operant level for fi 
person pronouns was established for each subje 
For the next 60 trials, sentences containing fi 
person pronouns received either verbal reinfoi 
(“unh huh,” “fine,” “OK,” etc.), or interpretat 
depending on the treatment condition to which 
subject was assigned. Controls received no ven 
treatment. Previous research (Noblin, 1962; Turon 
Noblin, & Timmons, 1962) has demonstrated that t 
differences obtain when third-person pronoun: 
used as the correct response rather than first-pers 
pronouns. 

Interpretations were modified from psychoani 
texts and records and were designed specifically 
each sentence fragment.5 Each interpretation W 
typed on the back of the stimulus card bearing | 
sentence to which it referred. For example: 
subject in the interpretation group chose “I” rathet 
than “He” to complete “. . . mowed the grass 
it were people,” the experimenter took back 
card and then read from the reverse side, “You h 
considerable difficulty expressing your hostility” 
rectly.” If a subject selected the first person 
rather than “You” to complete “. . . grew much | 
fast,” he was told, “Your dependency needs hi 
never been fulfilled.” If third-person pronouns 
chosen, no response was given. { 

The second experimental variation consisted 
randomly splitting the reinforcement, interpretati 
and control groups for extinction phase. The subj 
in one-half of each group were returned to fi 
ward immediately after the acquisition phase; 
the psychiatric aide, who sat behind each sı 
as the experimenter had, collected the 36 extinct 
responses, The remaining half of each experim 
group was extinguished in the experimental 
by the experimenter who had conditioned him. 

After extinction, every subject was asked 
following questions to assess the degree to whi 
could verbalize the experimental concept: 


1. Now, what do you think this experimen! 
all about; what was the general idea? [If À 
know”] Take a guess. 

2. Did you notice my doing anything in 
ticular? [Answer] Why was I doing that? 

3. Did you notice any change in what you M 
saying from the first of the session to the f 
[Answer] How did you change? [Answer] Wh) 
you change? 


5A complete list of the stimulus cards and ini 
pretations is available at no charge from the aut 
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Fic. 1. Mean percentage of first-person pronouns, 


The answers to these questions were recorded on 
the back of each subject’s record sheet. 


RESULTS 


Figure 1 represents the mean proportion of 
first-person pronouns for all subjects in each of 
the three conditions. Responses were recorded and 
are plotted in blocks of 12. The mean number 
of first-person pronouns selected during operant 
was almost identical for the three groups; a 
Kruskal-Wallis analysis showed the differences 
tg be nonsignificant. 

The figure shows that subjects in the reinforce- 
ak group displayed the typical increase in 
Tequency of the dependent variable during 
uae and the expected regular decline 
ied extinction. However, the curve depicting 
as of the interpretation group 
ew s an atypical pattern—a slow, regular in- 
ae the acquisition phase, then a 
Abts tise as extinction begins. The control- 
T p subjects performance was somewhat 

atic, but displayed no consistent trends. 
ago of the mean level of performance 
TU three groups against each other during 
Bylthal treatment, and extinction are afforded 
inde a a analysis of variance for 

aoa: ent groups. As previously mentioned, the 
ie Operant levels were not significantly dif- 
oo aaa the mean levels of the three 
ni icant on phases were compared, a difference sig- 
t beyond the .01 level was found. Subse- 


quent analysis using Mann-Whitney nonpara- 
metric T tests for indepen ‘groups showed 
the reinforcement group significantly higher than 
the interpretation group (p < .05) and the inter- 
pretation group not quite significantly higher than 
the control group (p< .10); the reinforcement 
group was higher than the control group at a 
probability level beyond .01 during the acquisi- 
tion phase. Comparison of the final blocks of 
extinction trials showed a significant overall effect 
(p<.01) which T tests proved to be due to 
the interpretation group being significantly higher 
than either the reinforcement or control groups 
(p <.01); the latter groups were not significantly 
different. 

As a major focus of this study was on dif- 
ferential extinction performance as well as acqui- 
sition performance, Wilcoxon’s matched-pair rank 
T tests were run to compare the relative level 
of performance of each group to itself during 
operant, acquisition, and extinction phases, Sum- 
marized briefly, the reinforcement group went 
up significantly from operant to acquisition 
(p <.01), then dropped significantly during ex- 
tinction (p < .05). The interpretation group rose 
less markedly, though still significantly, from 
operant to acquisition (p<.05), then rose 
again significantly from acquistion to extinction 
(p<.05). There was no significant change for 
the control group. 

To serve as a check on the reality of the 
increase of the interpretation group in extinction, 
the final block of acquisition trials was compared 
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TABLE 1 
MEAN PERCENTAGE FIRST-PERSON PRONOUNS GIVEN IN EACH OF THREE BLOCKS OF 
12 EXTINCTION TRIALS By SUBJECTS IN EACH TREATMENT CONDITION 
Reinforcement Interpretation Control 
Blocks of. 
aE sig Experimenters Aide Experimenters Aide Experimenters Aide 
First 69 59 61 64 47 55 
Second 56 57 69 63 54 54 
Third 49 54 65 64 51 53 
M 58.0 56.7 65.0 63.7 50.7 54.0 


Note.—Half. of the subjects extinguished by the experimenters, half by psychiatric aide. 


to the mean of the extinction trials; the 


comparison proved significant (p < .05). 


Differential Extinction Effects 


The expected difference between rates of ex- 
tinction in a changed versus constant stimulus 
compound completely failed to emerge. Mann- 
Whitney T tests were run to compare each of the 
three blocks of extinction trials collected by the 
psychiatric aide on the ward to the three blocks 
collected by the two experimenters in the experi- 
mental room. Tests were made in this manner on 
all three treatment groups, and no comparison 
even approached significance. Both the aide and 
the experimenters obtained consistently flat ex- 
tinction gradients from the interpretation-group 
subjects and a steeply dropping gradient from the 
reinforcement-group subjects. This comparison 
can be made directly in Table 1 which presents 
the percentage of first-person pronouns given in 
extinction by the reinforcement, interpretation, 
and control groups. Extinction performarice is 
divided into thirds for each group so that the 
slope of the curves can be visualized. 

Two subjects verbalized the experimental con- 
cept and were replaced in the experiment; both 
were in the reinforcement group. Most subjects 
reported that they saw the experiment as a 
personality test; a few thought that it somehow 
assessed intelligence or knowledge of grammar, 


Discussion 


A previous study using interpretations versus 
mild, affirmatory words as reinforcers for schizo- 
phrenics (Adams, Noblin, Butler, & Timmons, 
1962) yielded acquisition and extinction patterns 
almost identical to those reported here. Neither 
set of data showed any evidence of extinction 
for the interpretation-group subjects over 36 non- 
reinforced trials, while the equivalent group re- 
ceiving verbal reinforcers showed a pronounced 
extinction effect. Butler (1963), using a free- 


responding therapy-like situation, found no ex- 
tinction in subjects reinforced with interpreta- 
tions during two 1-hour extinction periods. In a 
future experiment the extinction phase will bi 
continued as long as necessary to detern 
when, or if, the response rate returns to thi 
operant level. 

It is becoming clear that reinforcement words 
and interpretations do not operate on the same 
principles; the reasons for this are not known. 
From questioning subjects after the experiment, 
it has been found that they generally considered 
the experimenter’s mild, affirmatory words as 4 
“comforting” or “encouraging” event; interpreta= 
tions were termed “disturbing,” “anxiety pros 
voking,” or “too close for comfort.” Perhaps 
interpretations have both positive and negative 
aspects to the subjects: positive in that the 
authority figure is responding in some way to his 
efforts, and negative in that the interpretations 
are rather pointed. A time-honored child-rearing 
principle states that even unpleasant attention 
is preferable to no attention at all. i 

Mowrer (1960) has shown that cessation of a 
noxious stimulus (here, the partially noxiou 
interpretations) is reinforcing. In this light, 
consistently obtained spurt in response leve 
might possibly be explained. Future experimen! 
tion will attempt to assess anxiety levels of bo 
groups through physiological and other meas 
in order to shed some light on this puzzi 
phenomenon, Fi 

The failure to find the predicted differential | 
extinction rate between the aide-extinguished and 
experimenter-extinguished subjects is difficult to 
explain. The subjects in the aide-extinguished 
group responded as though few, if any, important 
cues had been changed, even though both the: 
location and the experimenter were differen 
for them. f 
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THE AMERICAN NEGRO: 
A PROBLEM IN EXPECTANCIES + 


HERBERT M. LEFCOURT 


University of Waterloo, Ontario 
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GORDON W. LADWIG 
Veterans Administration Hospital, Chillicothe, Ohio 


This study dealt with differences between Negro and white inmates in their 
expectancies that events are internally or externally controlled. 60 white 
and 60 Negro inmates from correctional institutions were compared on 3 scales 
pertinent to the internal-external control dimensions, and on 3 performance 
variables from Rotter’s Level of Aspiration Board task. On all measures Negroes 
revealed greater expectancy of control being external, The results have implica- 
tions for interpreting the frequently observed differences between white and 
Negro intelligence test performance and achievement striving. 


Recent reviews of research pertaining to Negro- 
white differences by Shuey (1958) and Dreger 
and Miller (1960) have noted the reliable finding 
that Negroes perform less adequately than whites 
on intelligence measures. This kind of data has 
resulted in such polemic arguments as that be- 
tween Chein and Garrett (1962) which debates 
hereditarian versus environmentalist interpreta- 
‘tions and conclusions. Recently, a body of re- 
Search findings has been developing which em- 
phasizes the situational and personality variables 
Operative in Negroes which produce the appar- 


et report is based partially on a doctoral dis- 
G ation submitted by the senior author to the 
taduate School, Ohio State University, August 1963. 
to Ae author wishes to express his appreciation 
stud or Scodel for his guidance in the course of the 
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ment cs Research of the Air Research and Develop- 
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ent intellectual inferiority noted in intelligence 
testing. 

Roen (1960) found Negroes’ intelligence scores 
correlated with certain personality measures. He 
concluded that Negroes 
incorporate intellectually defeating personality traits 
that play a significant role in their ability to score 
on measures of intelligence [p. 150]. 


Tn a series of studies by Katz (Katz & Benjamin, 
1960; Katz & Cohen, 1962; Katz & Greenbaum, 
1963) Negro students were found to be anxious 
and unproductive in problem-solving situations 
especially when confronted with white partners. 
The present research represents a further ex- 
ploration of Negro-white differences on person- 
ality variables related to achievement behavior 
and its correlate, intelligence test performance. 
Rotter, Liverant, and Seeman (1962) have de- 
scribed an expectancy construct referred to as 
internal-external (I-E) control of reinforcements. 
Internal control refers to a generalized expect- 
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ancy that reinforcements occur as a consequence 
of one’s own actions and are thereby under per- 
sonal control. External control refers to the be- 
lief that reinforcements are unrelated to one’s 
own behaviors and therefore beyond personal 
control. 

This variable may prove important in attempt- 
ing to understand the problem of Negroes’ intelli- 
gence test difficulty. Crandall, Katkovsky, and 
Preston (1962) have found that internal-control 
children show more intellectual interest and bet- 
ter intelligence test performance than external- 
control children. This suggests that a person per- 
ceiving his reinforcements as being externally 
controlled, is less likely to try and succeed. Rose 
(1956), in his condensation of Myrdal’s The 
American Dilemma, noted that 


the ambition of Negro youth is cramped not only by 
. . . Segregation and discrimination but also by the 
low expectation from both white and Negro society. 
.. . And if he is not extraordinary he will not ex- 
pect it of himself and will not really put his shoulder 
to the wheel [p. 218]. 


The Negro may be characterized as a person 
who has a low expectancy that he can control his 
reinforcements. The first hypothesis in this study, 
then, is that Negroes will score higher on external 
control than a comparable group of whites, in 
tests designed to measure the I-E dimension. 
Second, in achievement tasks that demand self- 
evaluation Negroes will perform in a manner that 
can be described as reflecting a greater expect- 
ancy of external control. 


METHOD 


To test these hypotheses 60 Negro and 60 white 
inmates from two correctional institutions were used 
as subjects. There were no significant differences in 
social class, intelligence, age, or reason for incarcera- 
tion between the two groups. For the most part the 
subjects were of lower class origin. Their intelligence 
levels were within the average range with beta IQs 
between 90 and 110 (Kellogg & Morton, 1935). The 
mean age was 21.6 years and the typical crime lead- 
ing to incarceration was car theft. 

f Each subject was given the Rotter Level of Aspira- 
tion Board (LOA) using Rotter’s (1942) standard 
instructions. In addition, the subject was instructed 
that he would earn a pack of cigarettes for each 50 
points he had at the conclusion of the task. This was 
done to enlist cooperation and to encourage “re- 
alistic” goal setting. 

Three indices of LOA performance which theoreti- 
cally are related to the I-E dimension were employed: 
number of shifts, number of unusual shifts, and pat- 
terns. The number of shifts (changes in the subject’s 
prediction of his next score) in the LOA relates to 
general stability and self-confidence. High frequencies 
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| 
\ 
of shifts characterize individuals who do not use 
previous experience to establish consistent estimates | 
of their performance. Unusual shifts (up after fail- 
ure, down after success) likewise suggest failure of 
the subject to establish reliable estimates of his skill, 
Unusual shifts tend to indicate dependence on luck 
or magical, externally controlled factors. Of the LOA 
patterns described by Rotter (1945) Patterns 1 and 3 
indicate a stable, ambitious, confident approach while 
the other patterns generally indicate a failure-avoid- | 
ant, defensive approach. In both patterns there are 
an average number of shifts and generally an absence 
of unusual shifts. In Pattern 1, D scores usually — 
range from 0.0 to +3.0, while in Pattern 3, the range 
is from +3 to +6.0, Patterns 1 and 3 demonstrate 
the task involvement and realistic self-appraisal char- 
acteristic of internal controls. Previous research has 
demonstrated the relationship of these latter two in- 
dices (unusual shifts and patterns) with the I-E di- 
mension (Simmons, 1959). ( 

After the LOA task the subjects were given the | 
LE scale of Rotter et al. (1962) and Dean’s (1961) 
Powerlessness and Normlessness scales, The former 
was developed in the framework of Social Learning 
Theory (Rotter, 1954) to assess the I-E control vari- 
able. The test consists of 23 forced-choice items with 
6 filler items. Illustrative items are as follows: 


I more strongly believe that: 
; 


6. a. Without the right breaks one cannot be an 

effective leader. 4 

b. Capable people who fail to become leaders 

have not taken advantage of their oppor- 
tunities. 

. I have often found that what is going to 

happen will happen. 

b. Trusting to fate has never turned out as 
well for me as making a decision to take 4 | 
definite course of action. 

17. a. As far as world affairs are concerned, most 
of us are the victims of forces we can neither 
understand nor control. 

b. By taking an active part in political and 50- 
cial affairs the people can control world 
events. 

Sometimes I can’t understand how teachers 

arrive at the grades they give. 

b. There is a direct connection between how 

hard I study and the grades I get. 


23. a. 


Dean’s scales were constructed to measure two of 
the five variables that Seeman (1959) has defined as 
components of alienation. Powerlessness, like externa 
control, refers to an individual’s expectancy that hee 
cannot control his fate. Normlessness refers to ‘the 
high expectancy that socially unapproved behaviors | 
are required in order to achieve given goals. The 
Powerlessness measure consists of nine Likert scale 
items. Illustrative items are as follows: 


There is little or nothing I can do towards pre | 
venting a major “shooting” war. 
We're so regimented today that there’s not much | 
more room for choice even in personal matters. 
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Typical of the six Likert scale items in the Norm- 
Jessness measure were: 


Everything is relative, and there just aren’t any 
definite rules to live by. 

The only thing one can be sure of today is that 
he can be sure of nothing. 


The predictions are that Negroes will score higher 
on external control and claim greater powerlessness 
and normlessness than whites. In the LOA task it is 
predicted that Negroes will shift more, make more 
unusual shifts, and will produce fewer Patterns 1 and 
3 than whites. 


RESULTS 


As indicated in Table 1, all predicted differ- 
ences in Level of Aspiration Board performance 
were obtained. Negroes shifted more frequently 
and made more unusual shifts than whites. In a 


TABLE 1 
LEVEL or ÅSPIRATION BOARD RESULTS 


Negroes Whites t 


Number of shifts 
M 


9.08 7.00 

SD 5.02 3.57 2.60* 
Proportion of subjects making 
one or more unusual shifts 72 48 2.68** 
Proportion of subjects making 
two or more unusual shifts BY i ek 9 hee: 
Proportions of subjects showing 
Patterns 1 and 3 25 533.14" 
eae oes All tests are two-tailed based on samples of 60 in 

are 

wt > = (001 


comparison of the overall patterns of perform- 
ance, significantly fewer Negroes showed Pat- 
terns 1 and 3, the patterns indicative of internal 
control. 

In the comparison of number of shifts Negroes 
were more variable than whites (¢ between stand- 
ard deviations = 2.57, p < .02). However, since 
the Vs were equal a parametric test of differences 
between the means was justified (Boneau, 1960). 

Subjects making no unusual shifts were com- 
pared with those making one or more unusual 
shifts, A second comparison of those making zero 
to one, versus those making two or more unusual 
shifts, was also made to minimize the possibility 
that subjects were included in the “unusual- 
shift” group who made an occasional unusual 
shift for justifiable reasons, i.e, when a previous 

ailure was just short of the stated goal. 

Table 2 indicates that Negroes scored signifi- 
cantly higher on the three attitude measures re- 
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TABLE 2 
COMPARISONS ON SCALE MEASURES 
Measure Negroes Whites t 
LE 
M 8.97 7.87 
SD DI coo MBG 
Powerlessness 
M 17.30 14.63 
SD Onde SAOR GISLI STT 
Normlessness 
M 12.60 9.37 
SD Fit Neo MRE) Shion 
Note.—All tests are two-tailed based on samples of 60 in 
each group. 
*p =.05. 
++p = 01, 
**p = 001. 


lated to the I-E control dimension. Since the 
largest difference between Negroes and whites was 
obtained on the Normlessness scale it might be 
plausible to suggest that Negroes are even more 
dubious about the avenues open to them than 
about their own adequacy to achieve. 

A third comparison was undertaken between 
the Negro and white inmate samples and norms 
reported by Dean which were based on a strati- 
fied sample of 384 male subjects drawn from the 
Columbus, Ohio, area. It was expected that re- 
formatory inmates, in general, would score higher 
on powerlessness and normlessness than a non- 
institutionalized population, and that Negroes, in 
particular, would be more markedly deviant from 
such norms. 

AS Table 3 indicates, Negro inmates’ scores are 
considerably higher in an external-control direc- 
tion than Dean’s normative group. A similar com- 
parison with white inmates showed no significant 
difference from the norm group in powerlessness 


TABLE 3 


Comparison oF NEGRO INMATES WITH DEAN’s 
NORMS FOR THE POWERLESSNESS AND 
NorMLEssNEss SCALES 


Dean's Negro 
sample inmates t 
(N = 384) (N = 60) 


Powerlessness 

$b ‘go. 4 
Normlessness 

m (p gp mo 
*p = 001. 
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(¢=.71, ns) although on normlessness white in- 
mates also scored higher than Dean’s norm group 
(t= 2.48, p < .05). 

In brief, reformatory inmates seem to have 
higher expectancies than the general population 
that socially unapproved behaviors are required 
to achieve valued goals. Negro inmates appear 
even more pessimistic about socially acceptable 
means than white inmates. However, only Negro 
inmates appear radically different from the gen- 
eral population in regard to the powerlessness 
variable which is more directly related to the I-E 
control dimension than is normlessness. 


Discussion 


Literature concerning the American Negro has 
often focused on their feelings of low self-esteem 
and pessimism. This investigation has been di- 
rected toward a study of these problems making 
use of the I-E control variable. Negroes have 
been found to have low expectancies for internal 
control of reinforcements both in attitude and 
behavior measures. Although research is required 
to clarify the development of the expectancy of 
external control, it would seem likely that segre- 
gation and discrimination facilitate the growth of 
an external orientation. They deny positive rein- 
forcements to Negroes despite individual achieve- 
ments, thus providing the kind of experience 
necessary for the development and maintenance 
of generalized expectancies of external control, 
As Rose has suggested, such expectancies should 
result in a minimum of effort to achieve and a 
lack of interest in achievement-related pursuits. 
Crandall’s findings, noted previously, tend to sup- 
port this thesis. In view of the findings in» this 
study it may be hypothesized that Negroes’ 
poorer performance on intelligence tests reflects a 
withdrawal from middle-class achievement goals, 
The externally oriented N egro may well see these 
goals as being unobtainable through his own ef- 
forts. It is possible that in the current Negro 
mass movement for civil rights, there will be 
greater opportunity for Negroes to witness con- 
crete changes deriving from their social action. 
Tf so, the differences between Negroes and whites 
obtained in this study may not be as extreme in 
the not too distant future. 
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BODY-BOUNDARY SENSATIONS AND ACQUIESCENCE* 


SEYMOUR FISHER 
Upstate Medical Center, State University of New York, Syracuse 


The hypothesis was tested that the relatively more aware an individual is of 
sensations from the exterior sectors of his body (viz., skin and muscle) than 
from its interior regions (viz., heart and stomach) the more likely he is to be 
characterized by an autonomous, self-assertive orientation. 5-min. samples of 
reported exterior and interior body sensations were obtained from 54 female 
and 51 male college students. The Bass Social Acquiescence Scale was used to 
measure degree of self-assertiveness. The data significantly supported the 
hypothesis in both samples. Support was also provided for the more general 
proposition that the manner in which body sensory experiences are organized is 


related to personality and trait patterns. 


There is evidence that the more clearly an 
individual perceives his body differentiated from 
its environs the more likely he is to behave in a 
self-assertive manner. A series of studies has 
demonstrated that a measure of boundary defi- 
niteness or articulation derived from ink-blot 
responses (viz., the barrier score) is predictive 
of such autonomy-related variables as achieve- 
ment level, need for task completion, and sug- 
gestibility (Fisher & Cleveland, 1958). The 
person who has a clear-cut concept of the 
boundaries of his body is characterized by a 
sense of individuality and independence. In the 
course of analyzing the actual body experiences 
related to degree of boundary definiteness it was 
learned that they can be represented in terms 
of differences in the prominence of sensations 
from the boundary and interior regions of the 
body. Body sensation reports from persons clas- 
sified with relation to boundary attributes indi- 
cated that the person with definite boundaries 
is relatively more aware of sensations from the 
boundary regions of his body (skin and muscles) 
than from its interior (stomach and heart). The 
obverse held true for the individual with vague 
boundaries (Fisher & Fisher, 1964). In other 
words, degree of boundary articulation appears 
to be literally duplicated in the relative promi- 
nence of sensations from the body regions which 
constitute the exterior or bounding sheath of the 
body, There is a noteworthy congruence in this 
relationship because at the same time that 
boundary definiteness has been shown to be pre- 
dictive of a need to set and attain one’s own 
goals there has also been speculation that skin 
and muscle are body areas particularly involved 
In actively contacting and responding volitionally 
to the world (Fisher & Cleveland, 1958). 


wi This study was partially supported by Grant 
5761 from the United States Public Health Service. 


The question arises whether it is possible 
to predict directly from a knowledge of an 
individual’s body experiences the degree to 
which he is oriented to volitional, self-directive 
behavior. It was the objective of the present 
study to test the hypothesis that the more 
aware a person is of his exterior body layers 
(viz, skin and muscle) as compared to his body 
interior (viz, heart and stomach) the more 
likely he is to behave autonomously and self- 


assertively. 
PROCEDURE 


The method employed for measuring the subject’s 
relative awareness of skin and muscle as compared 
to heart and stomach is one already tested in pre- 
vious studies (Fisher & Cleveland, 1958; Fisher & 
Fisher, 1964). It involves giving the subject a 
sheet of paper on which are listed the names of four 
body sectors or organs in the following order: skin, 
stomach, muscle, heart. He js told that when a 
signalis given he is to focus his attention upon 
his body. Then, each time a “prominent sensation” 
occurs in any of the four body areas listed on the 
sheet he is to place a check next to the appropriate 
designation, The experimenter signaled the beginning 
of the self-observation period by saying “start”; 
and after a 5-minute period called “stop.” Scoring 
of the body-sensation reports is based on summing 
the number of skin and muscle sensations and sub- 
tracting the sum of stomach and heart perceptions. 
The larger the score the greater is considered the 
sensory predominance of body exterior over interior. 

The Bass Social Acquiescence Scale (Bass, 1956) 
was chosen to ascertain degree of independence and 
autonomy. This scale is comprised of 56 proverbs 
covering a range of topics (e.g. Love is the greatest 
of arts; Obedience is the mother of success; Wild 
colts make good horses). The subject indicates his 
reaction to each by checking one of three categories; 
agreement, disagreement, can’t decide. His score is 
equal to the number of statements with which he 
expresses agreement. Bass has demonstrated that the 
greater the number of statements accepted by an 
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TABLE 1 


CHI-SQUARE ANALYSIS OF RELATIONSHIP OF BODY 
EXxTERIOR-INTERIOR SENSATION INDEX 
TO ACQUIESCENCE IN MALE 
AND FEMALE SAMPLES 


Exterior-interior 


Female Male 


Acquiescence 
a De 186/37") 7 i 16 | 4.7% 


L 16 | 11 17 | Uy, 


Note.—H = above median, L = at median or below. 
* p <05, one-tailed test. 
Dp <.02, 


individual the more he is typified by an acquiescent 
set which involves conformance, being “nice,” and 
pleasing others. Murstein (1961) has likewise shown 
that the higher the Acquiescence score the greater 
there is an “inordinate need to win acceptance by 
others through excessive conformity.” 


Subjects 

There were two samples of subjects. One consisted 
of 51 males and the other of 54 females, Both were 
made up of college students. The median age in 
the female group was 19 years and the median edu- 
cational level, 13 years, The respective values for 
the male group were 20 years and 14 years, All 
subjects were recruited by offering the payment of 
a fee, The experimental procedures were adminis- 
tered in small groups ranging from two to four 
persons, 


RESULTS 


The median body exterior-interior sensation 
Score in the female group was +4 (range +26 
to —33) and the median Acquiescence score was 
25 (range 7 to 40). In the male group the 
median exterior score was +6 (range +15 to 
—38) and the median Acquiescence score, 23 
(range 2 to 38). 

Because of the skewed nature of the body- 
perception indices chi-square was used to ana- 
lyze their relationships to the Acquiescence 
scores, 

As can be seen in Table 1, the results for 
the female group are clearly in the predicted 
direction. A chi-square of 3.7 was obtained 
which falls at the <.05 level of significance in 
terms of a one-tailed test. Similarly, the same 
relationship appeared even more strongly in the 
male group where the chi-square value was 4.7 
which is significant at the <.02 level. In general, 
the data indicate that both in male and female 
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subjects the greater the predominance of body- 
boundary over interior sensations the lower the 
Acquiescence score. 


DISCUSSION 


The findings demonstrate that the manner in 
which an individual perceives sensations from 
exterior versus interior sites of his body is pre- 
dictive of the extent to which he adopts an 
acquiescent set. If he experiences his body- 
boundary regions as relatively prominent, he 
is likely to have a self-determining attitude. This 
suggests that the organization characterizing 
body-sensory experiences is linked with per- 
sonality and trait patterns. 

At present, it can only be a matter of specula- 
tion as to how this link is established. However, 
a possible note of clarification is provided by 
the following statement (Fisher & Cleveland, 
1958): 


the activities of the exterior layers of the body 
are much more subject to voluntary control than 
are those of the body interior. The individual can 
embellish, cover up, or change the appearance of 
his skin and he exerts controlling force over his 
striated musculature. But his body interior is, by 
and large, subject to the influence of involuntary 
autonomic centers. Consequently, the individual is 
probably left with the feeling that his body exterior 
is something which he commands and can actively 
use in his own defense. This is a feeling which he 
lacks about his body interior [p. 33]. 


With this perspective, one may suggest that 
he who is motivated to active and voluntary 
expression invests interest in those body areas 
which are associated with, and likely to consti- 
tute a channel for, such expression. A simple 
analogy for this process would be the athlete 
who lavishes interest on that part of his body 
central to his prowess. Parenthetically, one ques- 
tion that may be raised is whether the relatively 
high investment of attention in the body interior 
by the acquiescent person is merely an absence 
of interest in exterior body sectors or represents 
a “retreat” to that which has no connotations 
of action, 

The general problem remains of determining 
the extent to which the patterning of exterior- 
interior and perhaps other sensation dimensions 
can be used to predict significant attitudes and 
modes of conduct. 
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40 younger and 40 older institutionalized mentally retarded Ss were matched 
for MA level. They differed by about 20 yr. of institutional residence. Each 
S was individually presented the task of arranging a series of 9 cards designed 
to portray an emotional continuum from happiest to unhappiest. Younger Ss 
performed significantly better than older Ss (p < .01). Mental age was not 
related to ability to perform the task. Poorer performance was significantly 
related to length of institutionalization (r = 36). The results appear to support 
clinical and experimental findings in regard to the adverse effect of institu- 


tionalization. 


| In the comparison of the performance of 
_ mentally retarded versus normals, several investi- 
gators have pointed to the adverse effect of 
institutionalization on the motivation and be- 
havior of the retardates (Sarason & Gladwin, 
1958; Stevenson & Zigler, 1957; Zigler, 1962). 
Stemming from several sources there has been 
an accumulation of findings which generally have 
indicated that  institutionalization as presently 
ae out, exerts an adverse effect on human 
aaa The purpose of the present investigation 
‘i to compare the performance of younger 
ets older mental retardates on a task which 
ia the arranging of a series of faces de- 
ae to portray an emotional continuum, It 
aie ypothesized that younger mentally retarded 
7 oa would be superior to older retardates 
ie Esame mental age, due to the effect of 
th a institutionalization of the older group. 
ae er words, the younger mentally retarded 
be more sensitive to gradations of 
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emotional expression as portrayed in the stimulus 
materials. 


METHOD 
Subjects 


Subjects were selected from the populations of 
two state schools, one housing younger retardates 
and one exclusively for older retardates. Only male 
subjects‘were used. They were part of a larger study 
dealing with divergent thinking and originality in 
the mentally retarded. Data for the present study 
were obtained before the subjects participated in the 
originality study. 

In order to assure comparability of intellectual 
level, the Vocabulary subtest of the Stanford-Binet 
(Form L, 1937) was administered individually to 
143 younger and older mental retardates. Transfor- 
mation of raw Binet Vocabulary scores into their 
MA equivalents was done according to the method 
of Cureton (1954). From this pool of tested sub- 
jects, 40 younger and 40 older were matched on 
the basis of Vocabulary MA level. The Binet 
Vocabulary correlates .78 with the total Binet IQ. 
The advantage of using a recently administered 
vocabulary test as the instrument of intellectual 
assessment is that it provides an index of intel- 
ligence level of equal recency for all subjects. This 
is especially important for older subjects, many of 
whom have not been tested for a good many years. 
The mean chronological and mental ages as well as 
years of institutionalization are given in Table 1. 
It may be seen that there was roughly 24 years 
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TABLE 1 


Mean CA, MA, AND YEARS OF INSTITUTIONAL 
RESIDENCE OF SUBJECTS 


Older Younger 
M SD M SD 
CA 39.8 9.2 15.8 1.6 
MA 7.9 1.9 7.9 1.8 
Residence | 22.2 10.62 2.6 2.4 
40 40 


difference between the older and younger subjects 
with no significant differences on MA level. The 
older retardates had been institutionalized some 20 
years more than the younger. 


Stimuli 


The development of the stimuli employed has been 
previously described and pictorially presented else- 
where (Iscoe & Veldman, 1963). Briefly they con- 
sisted of nine separate “moon-faced” drawings de- 
signed to portray an emotional continuum ranging 
from very happy (or happiest) to very unhappy 
(or frowning). Each face was mounted on a separate 
card measuring 5X 4 inches, In the middle of each 
card was a “moon face” about 2.75 inches in 
diameter drawn in black and white. There were four 
degrees of “happiness” ranging from a big grin to 
a barely perceptible smile, a neutral face, and four 
degrees of “unhappiness” ranging from a slight frown 
to an extremely unhappy looking face.3 Extensive 
pretesting on normal children had revealed that the 
Majority of 8-year-olds and virtually all 9-year-olds 
could arrange the nine faces in a continuum with 
little or no errors, 


Administration 


a 


All testing was done by the same experimenter, 
each subject being tested individually. When the 
subject had been seated and made comfortable, the 
experimenter took the deck of nine cards, shuffled 
them, and placed them in front of the subject in 
random order. His instructions were: 


You see these faces here in front of you. Some 
of them are happy and some of them are un- 
happy. I want you to arrange these nine faces 
in a line starting from here and going to here 
[the experimenter pointed from left to right]. 
I want you to arrange them from the kappiest 
to the unhappiest. Do you understand ? 


The experimenter took special Precautions to make 
sure that each subject understood the task require- 
ments. He answered all questions and no time limits 
were imposed. The subjects seemed highly motivated 
to perform. If a subject arranged the continuum 
from unhappy through neutral and then on to most 


3A limited number of sets of the nine Pictures 
are available upon request to the authors. 
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| 
happy he was not penalized in the scoring. After ` 
each subject had finished arranging the cards and 
indicated that he was finished, the experimenter let 
the subject out of the room and then recorded the 
arrangement of the faces before obtaining the next 
subject. 


Scoring 


Two methods of scoring were employed. Stemming | 
from previous research (Iscoe & Veldman, 1963) the | 
arrangement of the nine faces was first classified into | 
three possible categories: no errors (perfect arrange- | 
ment), one face misplaced, and two or more faces 
misplaced. The second procedure involved scoring | 
the amount of displacement of each misplaced 
card from its correct position, providing thereby a 
measure of deviation. 


RESULTS 


The first scoring method permitted the utilizas | 
tion of a 2 X k chi-square. The results are pre- 
sented in Table 2. The chi-square indicates 
significant differences in the ability of younger 
versus older retardates in arranging the stimulus 
materials. Indeed, if the one error category 
were collapsed and combined with the perfect 
performance category, then roughly 25% of 
the older as compared to 75% of the younger 
retardates fit this category. 

In terms of the second scoring procedure 
(displacement scores) a mean of 1.97 for the 
younger versus a mean of 13.5 for the older 
mental retardates yielded a ż of 6.58 (p < .001). | 
Here, too, highly significant differences were | 
obtained between the two samples. 


Discussion 


Although the marked superiority demonstrated 
by the younger retardates would seem to con- 
firm the original hypothesis, consideration should | 
first be given to the possibility of factors other | 
than length of institutionalization influencing the 
results, l 

Chronological age suggests itself as a factor. 
Is it possible that older normals would perform 
at the level of the older retardates? From &' 


TABLE 2 
FREQUENCY oF ERRORS IN ARRANGEMENT 


Errors 
o 1 2or more | Total 
Older 4 6 30 aa 
Younger 27 4 9 40 
Total 31 10 39 80 “i 


Note.—x? = 28.7, df = 2, p <,001, 


BRIEF ARTICLES 


- 
previous study (Iscoe & Veldman, 1963) data 
were available from 20 hospital attendants 

| (M age = 35 years). A comparison of their per- 

‘formance indicated that they were vastly 
superior to the older retardates, making few if 
any errors. 

The possibility of significant differences in 
intellectual level existing in our subjects, even 
though they were of equivalent MA, must be 

dealt with. This is of course one of the problems 
that arises when comparing normals and retarded 
utilizing MA as a basis. It looms also in the 
comparison of older and younger retarded. From 
an operational point of view there is of course 
little argument, since both the younger and 
older retarded subjects were of equivalent MAs. 
Additionally, from Table 1, if a mean IQ is 
computed on the basis of MA and age 16 is 
employed as the CA for the older group (cutoff 
CA for the Stanford-Binet, Form L, 1937) the 
result is virtually equivalent IQs for the younger 
and older retardates. As a further check, a 
median test was run dividing the subjects above 
and below the median MA level and above and 
below the median displacement score for the 
faces. No significant relationship was found. A 
correlation between MA and displacement scores 
likewise was nonsignificant. It seems safe to say 
therefore that differences in intellectual status 
cannot account for the results obtained. 

In terms of possible motivational differences, 
both the younger and older subjects seemed 
well motivated and eager to cooperate. They 
greeted the experimenter enthusiastically and 
certainly were in no hurry to leave once they 
had completed the assigned task. 

It would seem, therefore, that the differences 
between the two sample populations are best 
explained in the light of findings regarding the 
“a of prolonged institutionalization on human 
pemgs. A median test comparing months of 
istitutionalization and displacement scores was 
highly significant (x? = 7.2, df=1, p <.01), as 
Pe the frequency of errors in arrangement 
( ren compared with years of institutionalization 
Ne 8.95, df =2, p < .02). In addition, a cor- 
Ata on run between years of institutionalization 
Ge deviation scores yielded an r of 36 
oe = 01), When the younger and older groups 
[eae separately, however, the rela- 
sae mas not significant between length of 

itutionalization for each group and the two 
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dependent measures, i.e., frequency of errors and 
displacement scores. Nevertheless, consideration 
should be given to differences in the institutional 
environments and to the possibility that the 
negative effects of institutionalization are not 
necessarily linear. Beyond a certain critical 
point, as yet undetermined, the length of 
institutionalization may cease to exert much of 
an effect on an individual. More precise investi- 
gation is needed to tease out complex relation- 
ships existing between the type of institutional 
environment, the length of time spent in it, and 
the chronological age of the person. 

It seems reasonable to assume that the younger 
subjects, living in an environment with a rela- 
tively greater amount of stimulation than the 
older subjects, had maintained (up to this point, 
at least) the sensitivity to discriminate the 
subtleties of human expression, The active hospi- 
tal program, including schooling, may have 
exerted some effect on the maintenance of this 
ability. The older retardates should not be con- 
strued as living entirely in an isolated, deprived 
situation. Indeed, their living conditions, in the 
opinion of the authors, were probably superior 
to the environmental situations provided for 
retarded of this age in many institutions. How- 
ever, 20 years in an institution may well do 
away with the need for or the ability to carry 
out the finer discriminations of human reactions, 
especially in the areas of emotionality and the 
perception of emotional gradations. 


REFERENCES 


Cureton, E. E. Mental age equivalent for the re- 
vised Stanford-Binet vocabulary test. Journal of 
Consulting Psychology, 1954, 18, 381. 

Iscoz, I, & VELDMAN, D. Perception of an emo- 
tional continuum by schizophrenics, normal adults 
and children. Journal of Clinical Psychology, 1963, 
19, 272-276. 

Sarason, S. B., & GLADWIN, T. Psychological and 
cultural problems in mental subnormality: A re- 
view of research. Genetic Psychology Monographs, 
1958, 57, 7-269. 3 Cii 

Srevenson, H. W., & ZIGLER, E. Discrimination 
learning and rigidity in normal and feebleminded 
individuals. Journal of Personality, 1957, 25, 699- 
711. 

Zıcter, E. Rigidity in the feebleminded. In E. P. 
Trapp & P. Himelstein (Eds.), Readings on the 
exceptional child. New York: Appleton-Century- 
Crofts, 1962. Pp. 141-162. 


(Received July 19, 1963) 


Journal of Personality and Social Psychology 
1965, Vol. 1, No. 4, 386-389 


ACHIEVEMENT MOTIVATION AND INTERGENERATIONAL 
OCCUPATIONAL MOBILITY * 


LAWRENCE W. LITTIG 


Howard University 


High and low need Achievement (n Ach) correlates of occupational mobility 
were assessed on a sample of 177 men and 179 women. n Ach was measured 
by means of TAT and occupational mobility was determined by comparing 
the status of S’s occupation (men) or S’s husband’s occupation (women) 
with S’s father’s occupation in terms of a blue collar-white collar distinction. 
Upward occupational mobility was found to be related to high n Ach and 
downward occupational mobility to low n Ach. Ss’ n Ach scores were not 
related significantly to father’s occupation but were related significantly to 
S’s occupation (men) or S’s husband’s occupation (women), suggesting the 
possibility that n Ach scores are dependent upon S’s social class status at the 


time n Ach is assessed. 


In a recent paper, Crockett (1962) has re- 
ported a positive relationship between strength 
of achievement motivation (n Ach) and occupa- 
tional mobility among men. High n Ach sons of 
fathers in both low prestige and lower middle 
prestige occupational categories were found to 
be upwardly mobile. There was, however, no 
relationship between high n Ach and occupational 
mobility among men whose fathers’ occupational 
categories were classified as upper middle or high 
and no relationships between low n Ach and 
downward occupational mobility for all occupa- 
tional categories. These findings, based on an 
analysis of national survey data, were interpreted 
as supportive of a mode of motive-related 
achievement behavior developed by Atkinson 
(1957). } 

Atkinson’s basic theory has received extensive 
coverage elsewhere and the argument for pre- 
dicting a relationship between n Ach and oc- 
cupational mobility has been well stated by 
Crockett who maintains that the occupational 
prestige hierarchy may be viewed as roughly 
corresponding to degrees of probability of an 
individual attaining a position in the hierarchy: 
The higher the prestige of an occupation, the 
lower the probability of attaining it. Generally, 
persons with strong n Ach will be motivated to 
choose to strive for and attain relatively more 
difficult occupations than people with weak 
n Ach. 

The present report is based on these argu- 
ments, developed by Crockett, for applying 
Atkinson’s theory to occupational mobility. But, 
unlike Crocketts study in which mobility was 


1Support for this research was provided by the 
Ford Foundation. 
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AND 


CONSTANTINE A, YERACARIS — ; 
University of Buffalo 


assessed within social class strata, this 
attempts to relate n Ach to mobility acr 
broad social class line between blue-col 
white-collar occupations. 
The hypotheses examined are: 
1. A significantly higher proportion 
n Ach than low n Ach subjects will 
wardly mobile. E 
2. A significantly higher proportion 
n Ach than high n Ach subjects will be 
wardly mobile. 
Hypothesis 1 is tested by compal 
proportions of high and low n Ach s 
from blue-collar background who move to 
collar occupations with the correspond 
portions of those who remain in blu 
occupations. Similarly, to test Hypot 
the proportions of high and low n Ach § 
from white-collar backgrounds who pi 
occupy white- and blue-collar occupatio 
gories are compared. 


METHOD 
Subjects 


The findings to be reported represent a 
analysis of data obtained during an inves 
of the effects of educational television on 
titudes of a community’s members toward 
tion. A probability cluster multistage sé 
400 subjects, 190 male heads-of-household 
married women, residing in an upstate Ne 
community was obtained. Of these, 177 
179 women provided data meeting the ¢ 
this study. 


Procedure 


Subjects were interviewed in their own 
trained interviewers who administered 2 


‘= 
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pretested schedule to assess attitudes toward educa- 
tion. Prior to responding to these questions, the 
subjects were requested to tell stories to a six- 
picture thematic apperception series (TAT). The 
six pictures were previously used in a nationwide 
survey by Veroff, Atkinson, Feld, and Gurin (1960). 
The stories obtained from the subjects were coded 
for n Ach, by means of the content analysis tech- 
nique described by McClelland, Atkinson, Clark, and 
Lowell (1953), by four expert coders who were 
trained using the training manual for n Ach pre- 
pared by Smith and Feld (1958). Each coder had 
achieved at least 90% reliability with the expert 
coding given in the manual. Distributions of n Ach 
scores were prepared within interview-coder pairs 
to reduce any systematic interviewer or coder bias 
and subjects were classified as being high on n Ach 
if their n Ach scores were above their respective 
groups’ medians and low on n Ach if their scores 
were below their respective groups’ medians. Of the 
total N of 400 there were 90 men and 90 women 
assigned to the high n Ach group, and 87 men and 
89 women assigned to the low n Ach group. Sub- 
jects at the median and subjects who had not pro- 
duced scorable TAT protocols were not included. 
To assess occupational mobility the occupations 
of subjects and of their fathers (or of the subject’s 
husband and her father in the case of female sub- 
jects) were assigned blue- or white-collar status. 
Occupations in the following broad categories were 
designated white collar: professional, managerial, 
Proprietary, farm owner, clerical, and sales. Occupa- 
tions in the general categories of craftsman, opera- 
tive, household worker, service worker, and farm 
laborer were classified as blue collar. Occupational 
mobility was then deemed to be upward if the 
subject was in a white-collar occupation and his 
father was in a blue-collar otcupation (or if the 
Subject was a female whose husband had a white- 
collar occupation and whose father had a blue- 
collar occupation), and downward if the subject 
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(or the subject’s husband in the case of women) was 
in a blue-collar occupational category and the 
subject’s father was in a white-collar category. 


RESULTS 


Table 1 summarizes the relationships between 
motivation and mobility for men and women and 
chi-square analyses relevant to the hypotheses. 

Hypothesis 1 predicted a positive relationship 
between n Ach and mobility. High n Ach is 
positively related to upward mobility at a 
statistically significant level for women (x? = 
4.22, df=1, p< .025), and the relationship 
approaches statistical significance for men (x? = 
1.75, df=1, p<.10) when subjects of blue- 
collar origin who achieved white-collar status 
and subjects of blue-collar origin who remained 
in the blue-collar occupational stratum are 
compared. 

Fifty-six percent of high n Ach women are 
upwardly mobile whereas only 35% of low n Ach 
women are upwardly mobile. The split between 
women remaining in the blue-collar category is 
44% high n Ach versus 65% low n Ach. For 
men the proportions are similar although the 
relationship is not as pronounced. 

Hypothesis 2, which predicted a relationship 
between low n Ach and downward mobility, is 
confirmed for women and receives some support 
from men. In Table 1 it may be seen that, for 
both men and women in this sample, a higher 
proportion of low n Ach subjects are downwardly 
mobile than high n Ach subjects and a higher 
proportion of high n Ach than low n Ach sub- 
jects remain within the white-collar status. The 
results are statistically significant for women 
(x? = 3.62, df=1, p<.05) and approach sig- 


TABLE 1 
Hicu AND Low n ACH AND OCCUPATIONAL MosiLity 
- 
n Ach 
Men ‘Women 
Mobility 
High Low High Low 
N % N % N % N % 
Blue > blue 31 56 41 68 24 44 30 65 
Blue > whit 19 32 30 56 16 35 
Total S 3 100 60 100 54 100 4 | 100 
x? = 1.75, df = 1, p < -10, one-tailed test x2 = 4.22, df = 1, p < .025, one-tailed test 
White white | 28 | 80 17 63 26 72 2 st 
hite — blue 7 20 10 37 10 28 z 8 
Total 35 100 27 100 36 100 i 
x2 = 2.19, df = 1, p < .10, one-tailed test x2 = 3.62, df = 1, p <.05, one-tailed test 
19, 5 


Note.—N = 177 men and 179 women. 
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TABLE 2 i 
DISTRIBUTION OF HIGH AND Low n ACH SUBJECTS BY FATHER’S OCCUPATION 
n Ach 
Men Women 
Occupation 
High Low High Low 
N % N % N % N 
White 35 39 27 31 36 40 43 
Blue 55 61 60 69 54 60 46 
Total 90 100 87 100 90 100 89 
x? = 1.23, df = 1, p > .20, two-tailed test | x? = 1.26, df = 1, p > .20, two-tailed test 


nificance for men (x?= 2.19, df=1, p<.10). 

Of further interest are the simple relationships 
between the subject’s occupation and his n Ach 
score (or the subject’s husband’s occupation and 
her n Ach score) on the one hand and the 
subject's n Ach score and the occupation of the 
subject’s father. A slightly greater proportion of 
high n Ach men than low n Ach men have 
fathers with white-collar occupations (39% 
versus 31%) but the reverse is true for women 
(Table 2). In neither case is the relationship 
statistically significant. There is, however, a 
statistically significant relationship between the 
subject’s n Ach score and his (or her husband’s) 
occupation. High n Ach is associated with white- 
collar occupations and low n Ach with blue- 
collar occupations (Table 3). n Ach is related 
to the subject’s occupationally based social class 
but not to that of the subject’s father. 


a 
DISCUSSION 


The present data generally support Crockett’s 
conclusion that high n Ach is related to upward 


TABLE 3 


DISTRIBUTION or Hic AND Low n Acu SUBJECTS BY OCCUPATION (MEN) 
AND HUSBAND’S OCCUPATION (Women) 


Crockett’s findings in several ways. 

First, Crockett’s analysis was limited to 
while the findings reported here indicate that 
relationship between n Ach and occupatio! 
related mobility holds for married women wh 
their mobility is linked to their husband's 
occupation. 

Second, the finding of a relationship bet 
low n Ach and downward occupational mobilit 
new. 

Third, all findings relate to mobility betwee 
rather than, as was the case with Crocket 
data, within social class. It was not possib 
because of the small sample, to assess mobi 
within social class to verify Crockett’s findi 
Crockett’s failure to find interclass mobility 
national sample necessitates more cautious 
of the present findings which were derived fro 
data obtained in a small but highly industriali 
northeastern United States community. It is po 
sible that this is just the kind of communi 


n Ach 
Men Women 
Occupation 
High Low High Low 
N % N % N % N 
White 52 58 36 41 56 
Blue 38 42 51 59 34 $8 i 
Total 90 100 87 100 90 100 89 


X = 4.76, df = 1, p < .05, two-tailed test 
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environment which allows the intergenerational 
social movement which was found. 
Fourth, the absence of a statistically significant 
relationship between father’s occupation and 
n Ach has bearing upon the interpretive problem, 
discussed by Veroff et al. (1960), of deciding 
whether a relationship between n Ach and oc- 
cupational mobility is brought about by the 
effect of n Ach on mobility or the effect of 
mobility on n Ach. It suggests the possibility 
that the relationships between n Ach and oc- 
cupational mobility may result from a change in 
n Ach accompanying, or resulting from, a change 
in occupationally related status rather than high 
or low n Ach being an antecedent of mobility. 
The relationship is between n Ach and the oc- 
cupational class in which the subject is im- 
mersed at the time of assessment and not the 
occupational class from which he originated. 
This suggests that subjects who have moved 
across class lines in the occupational hierarchy 
may have had their n Ach affected by moving 
if it is assumed, on the basis of the previous 
finding of a correlation between adolescent 
boys’ n Ach scores and the social class of their 
families (Rosen, 1956), that subject? n Ach 
scores would have, at an earlier time, shown a 
relationship to the occupationally based social 
class of their families. That is, if like Rosen’s 
adolescent subjects, the downwardly mobile, low 
n Ach subjects of this study would have pro- 
duced high n Ach scores as adolescents within 
their middle class (i.e. white collar) families 
and the upwardly mobile, high n Ach subjects 
would have had low n Ach scores as adolescents 
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in blue-collar families, then a change in n Ach 
scores accompanied changes in occupationally 
based status. If their n Ach scores had not 
changed, and Rosen’s finding is generalized to 
this group, then a higher proportion of high 
n Ach than low n Ach subjects would have 
come from families in which the father had a 
white-collar occupation, and a higher propor- 
tion of low n Ach than high n Ach subjects 
would have appeared among subjects from blue- 
collar family backgrounds, 
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N ACHIEVEMENT AND ENTREPRENEURSHIP: 


A LONGITUDINAL STUDY 


DAVID C. McCLELLAND 
Harvard University 


An analysis of the occupational position 
yr. after graduation indicated that sig) 
scoring high in need Achievement (n Ach 
found in entrepreneurial occupations. A cr 
the classes of 1954 and 1955 confirmed the 
gravitated toward business occupations of an en 


In The Achieving Society (McClelland, 1961) 
‘ata are reported showing relationships between 
igh need Achivement (n Ach) scores and 


és 
entrepreneurial” behavior on laboratory tasks, 


of 55 Wesleyan graduates some 14 
nificantly more of those originally 
) than of those low in n Ach were 
oss-validation study of students of 
finding that males with high n Ach 
trepreneurial nature. 


for business occupations when they 
represent a moderately high level of aspiration 
for a boy, and occupational status as a business 
executive versus specialist or professional in the 


preference 


390 


United States, Italy, and Poland. The most 
reasonable interpretation of these facts seemed 
to be that high n Ach predisposes a young man 
to seek out an entrepreneurial position in which 
he can, normally, attain more of the achievement 
satisfactions he seeks than in other types of 
positions. Ordinarily business should provide 
more scope for entrepreneurial behavior; in fact 
it requires more of it in sales positions than 
the professions normally require, although of 
course almost any occupation can be pursued 
in an entrepreneurial way. 

As reasonable as this interpretation may seem, 
direct confirmation of it requires a longitudinal 
study. For the higher n Ach common among 
men in entrepreneurial occupations may be the 
result of having to function in these positions 
rather than the cause of seeking them out. The 
critical question is whether men who end up in 
entrepreneurial occupations had higher n Ach 
years earlier than men who end up in other 
occupations of equal prestige. Data have recently 
become available which bear on this point. 


METHOD 


In July 1961, Wesleyan University completed an 
effort to bring up-to-date its Alumni Directory 
which lists the current occupations and other bio- 
graphical data for all former students still alive and 
responding to a request for such information, The 
Directory therefore provided the data needed to 
see what had happened to students tested for n Ach 
as early as December 1947—the same students on 
whom much of the research reported in The 
Achievement Motive (McClelland, Atkinson, Clark, 
& Lowell, 1953) was conducted. Most of these 
students graduated in the class of 1950, Many were 
veterans of World War II so that their average age 
at the time of testing in their sophomore year was 
21.6 years, about 2 years above normal for such a 
class. Fourteen years later in 1961, they were about 
35 years of age on the average—the youngest being 
31 and the oldest 46. Thus they could be considered 
reasonably settled as to occupational choice. To give 
some idea of their intelligence level, their average 
SAT score (Verbal and Mathematical subtests 
combined) was 528 (N =25). 

Of the 58 on whom n Ach scores were available 
from this group, 55 provided sufficient data in the 
Alumni Directory for their occupational status to 
be coded. The entries in the Directory were typed 
on cards and the cards were sorted first into busi- 
ness occupations versus all other occupations 
(largely the professions—law, medicine, divinity, 
etc—in this sample of men, but including some civil 
service jobs), Next the cards in the business pile 
were sorted into entrepreneurial or nonentrepre- 

neurial categories, using the following criteria: 

Definition. An entrepreneurial occupation, follow- 
ing the definition of Meyer, Walker, and Litwin 
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(1961) in their study of managers versus sp 
is one in which the individual has 


more responsibility for initiating decisio, 
than merely having to make decisions wh 
sented with problems. 

more individual responsibility for di 
their effects—that is, without review or as 
of a supervisor or a committee. 

[more] objective feedback of accurate di 
cating the success of his decisions, such as in 
volume, profitability, reduction in errors, 
plaints, etc., rather than just a general subj 
evaluation of his success by the group 
superiors. 

[a] job [which] entails more risk or d 
in that there is more chance of a serious 
decision being observed [p. 571]. 
Generally speaking the following occupations 

coded as meeting or failing to meet these c 


Entrepreneurial: 


Sales (except clerical sales) 

Real estate and insurance sales e 

Operates own business (including family bus 
if a key executive) 

Management consulting, fund raising, etc. 

Officer of a large company, assistant to the 
dent of a large company, etc, 

For example, money management at lower 
is classified as “nonentrepreneurial” (for € 
establishing consumer credit), but Vice-P! 
of a large New York commercial bank in 
of credit is classified as “entrepreneurial.” — 


Nonentrepreneurial: 


Credit managers 
Traffic managers 
Personnel managers 
Officer managers 
Appraisers 2 
Data processors Bah: 
Sorting was done independently by two ju 
without knowledge of n Ach scores. Agreement 
91%. Cases of disagreement were decided by 
author, though several of them were simply | 
sified as not providing clear enough inforn 
to make a decision. 

The results of this follow-up study were si 
837% of the entrepreneurs had been high in I 
14 years earlier versus only 21% of the nonentrep 
neurs (see Table 1). It was therefore decided 
cross-validation of the findings was imperative. 
fortunately further testing for n Ach was not d 


1955 were administered the test at entrance. 
samples have two serious drawbacks for 
validation purposes: F 


1. The primary objective of the testing o 
occasions was to try out new pictures, new 
categories, and a new time limit for writing (27y 
4 minutes) in order to get a more “valid” p 
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TABLE 1 


NUMBERS OF ENTREPRENEURS, NONENTREPRENEURS, AND PROFESSIONALS IN 1961 WHO WERE 
ABOVE AND BELOW THE MEDIAN IN n ACHIEVEMENT AS COLLEGE UNDERGRADUATES 
in 1947 anp 1950-51 


n Achievement 


1947 1950-51 % above median score 
Above Below Above Below Combined samples 
Business entrepreneurs* 10 2 18 12 61% x? = 8.70* 
Business nonentrepreneurs? 3 11 12 17 se asi 
Professionals 16 13 50 49 52% 


«^ Sales, real estate and insurance, fund raising, founder of own business, management consulting, etc. 
b Credit, traffic, personnel and office managers, appraisers, data processors, treasurers, etc. 


*p<,01. 


of academic performance (which was never found). 
Thus the n Ach scores available were based on very 
different and variable procedures, In particular it 
was found that the shorter time interval for writing 
stories had some real drawbacks, such as introducing 
a correlation between n Ach and length of protocol 
(cf. Ricciuti & Sadacca, 1955). 

2, The time between college graduation and 
follow up was really too short to be sure the 
individual had found his life occupation. Many had 
gone into the Armed Forces for 2 or 3 years after 
graduation so that they had been working only 
for 3 or 4 years when the follow up was made. 
They were of course also much younger being about 
28 on the average. Nevertheless it seemed worth 
checking the earlier result on these larger samples, 
since they provided the only other data available 
to check the hypothesis at this date. 


Consequently the same coding procedure was 
carried out on these samples as on the first one. 
Unfortunately proportionally fewer of these men 
turned up in business. So it was necessary to combine 
the later samples to provide larger frequencies in 
the analysis, 


RESULTS AND DISCUSSION 


Table 1 presents the main findings of the 
study showing the numbers of men in the three 
occupational categories who had been above and 
below the median in n Ach (computed separately 
for the three samples) as undergraduates 10-14 
years earlier. The findings are quite sriking for 
the sample that provides the best test of the 
Ypothesis in terms of being based on “standard” 
n Ach scores and on “stable” occupational 
choices, Eighty-three percent of the entrepre- 
neurs in business had been high in n Ach as 
College sophomores whereas 79% of the non- 
entrepreneurs in business had been low in n Ach.* 


ian may only be a happy accident for the theory, 
rt the one person in this group who struck out 
his own to found a business far from his home- 


The cross-validation data on the younger men 
are not nearly so striking but they are definitely 
in the same direction: 60% of the entrepreneurs 
had been high in n Ach as college freshmen 
versus 41% of the nonentrepreneurs. The chi- 
square for all cases (8.70) is highly significant 
(p<.01). There is considerable support in these 
results for the hypothesis that at least in the 
United States and among white college students, 
males with high n Ach tend to gravitate toward 
business occupations of an entrepreneurial nature 
where they can better satisfy their achievement 
aspirations, according to theoretical expectations. 
It appears that n Ach does lead to entering 
entrepreneurial occupations, as the theory of 
economic development outlined in The Achieving 
Society requires. It appears also that n Ach 
scores must be valid in the sense of predicting 
life outcomes over periods of 10 years, despite 
frequent reports in the literature of test-retest 
unreliability over periods of a week or so 
(Atkinson, 1958; McClelland et al., 1953). These 
findings are more in line with those reported by 
Moss and Kagan (1961) who found a correlation 
of +.34 between n Ach scores in adolescent and 
adult male protocols over a 10-year lapse in time. 

It is worth saying a word about what happens 
to those who enter the professions with high 
and low n Ach. While the occupational informa- 
tion was not sufficient to classify individuals as 
more or less entrepreneurial lawyers, teachers, etc., 
several striking cases suggest that n Ach does 
modify the style in which a person carries out 
his professional duties. For example, a college 
English teacher need not be an entrepreneur to 
be successful in the same way that a sales repre- 
sentative needs to be. The teacher may succeed 


echt i eea 
town is Karl, the student analyzed extensively in 
my book on personality, who had the highest n Ach 


score, 
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simply by being a scholar who knows more about 
Shakespearean plays than anyone else and who 
year after year inducts students in more or less 
the same way into the mysteries of his specialized 
knowledge. Yet in our sample there is a very 
different kind of English Professor—a man with 
a very high n Ach score as a college sophomore. 
He teaches drama, produces plays at the univer- 
sity where he is on the faculty, and has founded 
his own stock company which tours the state 
during the summer months putting on plays with 
student actors. Obviously he is part business 
entrepreneur, since he must take initiative, as- 
sume risks, plan plays that will draw audiences, 
and meet costs, etc. Yet in Table 1 he is clas- 
sified as a “professional” because he is a teacher. 
Tf the occupational information had been better, 
it would probably have been possible to do a 
better job of determining who had turned out 
to be an “entrepreneur,” even among the profes- 
sionals, But generally speaking the opportunity 
and necessity for this kind of behavior would 
appear to be greater in the world of business, 

A further follow up by questionnaire on these 
and other students tested in the late 40s and 
early 50s is planned but in the meantime these 
data do support the hypothesis that n Ach is 
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EFFECT OF VARIATIONS IN WARNING-INTERVAL DURATION 
ON REACTION TIME IN NORMAL AND MENTALLY 
DEFECTIVE SUBJECTS + 


DONALD J. DICKERSON 2 
George Peabody College for Teachers 


Ellis (1963) has cited evidence which indicates 
that normal and mentally defective humans differ 
in behaviors dependent upon short-term memory 
and has presented a theoretical model to “ac- 


1 This research was supported in part by NIMH 
Grant No, M-4433 made to George Peabody College 
for Teachers, Norman R. Ellis, principal investigator. 

2 The author is a predoctoral fellow in the Pea- 
body mental deficiency program. 
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he 
a fairly stable personality characteristic whic 
given certain characteristics of the social sy: 
predisposes young men to enter entreprene! 
occupations or to function in traditional oce 
tions in entrepreneurial ways. 
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were compared with respect to re- 
over 4 blocks of trials with mean 


central to the theory, stimulus trace (s,) and int 
tegrity of the central nervous system (mj). St? 3 
defined antecedently by an environmental stimu 
lus and, consequently, by a behavioral event. nyt 
defined by intelligence quotient or other indice 
of behavioral inadequacy. The theory states. thal 
St is of a shorter duration and a lesser magnit 

in the organism with lower n,. A deficiency Of 
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available reverberatory circuits is suggested as a 
possible neurological substrate for n;. Ellis hy- 
pothesized that the apparent short-term memory 
deficit in mentally defective humans is the result 
of an impoverished sj. 

Terrell and Ellis (1964) utilized the reaction 
time (RT) experiment in a test of the sẹ hy- 
pothesis. Two subgroups of normals and two sub- 
groups of defectives were tested on the RT task. 
For both groups a red warning light preceded by 
varying intervals of time a buzzer sound to which 
the subject was instructed to respond. However, 
for one subgroup the red light came on and re- 
mained on throughout the warning interval (on- 
on condition), and for the other subgroup the 
light came on for 1 second and then went off for 
the remainder of the warning interval (on-off 
condition), Warning intervals 2, 4, 8, and 12 
seconds in length were used. An interaction be- 
tween group and condition was predicted from 
the s, theory. Slower RTs were expected under 
the on-off condition since the “set” to respond 
would be dependent upon a memory of the warn- 
ing stimulus. It was expected that the difference 
between the retarded subgroups would be rela- 
tively large when compared with the difference 
between the normal subgroups. An additional pre- 
diction was that the retarded subjects would show 
a greater increase in RT with an increase in warn- 
ing interval length beyond the optimum than 
would the normal subjects. This hypothesis would 
be tested by a Group X Warning Interval inter- 
action. The results of the study confirmed the 
first prediction. The RTs of the retarded subjects 
were much slower under the on-off condition, 
while the RTs of the normal subjects were not 
affected significantly by the conditions. However, 
the results of the study did not confirm the sec- 
ond prediction, although the Group X Warning 
Interval interaction was significant. An inspec- 
tion of the data indicated that the trend was to- 
ward faster RTs at the longer warning intervals 
in the retarded subjects, while the RTs of the 
normal subjects did not change as a function of 
Warning interval length. 
cate failure of Terrell and Ellis to find slower 
then a the longer warning intervals may have 
fines ue to a phenomenon reported by Klemmer 
SS ) and by Karlin (1959). These experiment- 
es data which indicated that RT fol- 
the eee warning interval is influenced by 
eit of the warning interval on the im- 
Gist ely Preceding trial. RT is slower when a 
ENN interval follows a long warning in- 
follo and faster when a long warning interval 
pie ws a short warning interval. Terrell and Ellis 

sented the different warning intervals in ran- 
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dom order. This procedure would result in short 
warning intervals following long warning inter- 
vals and long warning intervals following short 
warning intervals on at least part of the trials. 
The effect produced may have been to decrease 
RT at the longer intervals and to increase RT at 
the shorter intervals. 

The present experiment was an attempt to as- 
sess the validity of the foregoing interpretation 
of the Terrell and Ellis findings. RT was tested 
in normal and mentally defective subjects over 
blocks of trials with warning intervals of differ- 
ent mean lengths. The trial-block procedure was 
used so that the effect reported by Klemmer and 
Karlin could be controlled. The prediction was 
made that RT would increase in both groups as 
the warning interval increased beyond the opti- 
mum length, since the s; of the warning signal 
should fade with time. 


METHOD 

Subjects 

The subjects were 24 female college students from 
undergraduate psychology classes and 24 mildly re- 
tarded female students from the Tennessee Prepara- 
tory School, Nashville. The retarded subjects had in- 
telligence quotients between 60 and 80 and chrono- 
logical ages between 15.0 and 20.0 years, The normal 
subjects had chronological ages between 18.0 and 22.0 
years, Table 1 gives descriptive data for the two 
groups. The normal and retarded groups were each 
divided into four subgroups. Analyses of variance 
demonstrated that the assignment to subgroups was 
not biased with respect to chronological age or in- 
telligence quotient. 


Apparatus 

The experiment was conducted in a 5X8 foot 
cubicle in a mobile laboratory, The latter consists 
of an 8 X 24 foot van mounted on a truck chassis. 
The van is divided into two 5X8 foot chambers 
separated by an 8 X 14 foot chamber. The equipment 
consisted of a doorbell buzzer reaction stimulus, a 


TABLE 1 


MEAN, STANDARD DEVIATION, AND RANGE OF CHRONO- 
LOGICAL AGE AND INTELLIGENCE QUOTIENT FOR 
THE MENTALLY DEFECTIVE GROUP AND OF 
CHRONOLOGICAL AGE FOR THE NORMAL 
GROUP 


Chronological age* Intelligence quotient 


Measure 
Defectives Normals Defectives 
M 17.71 20.00 71.25 
Seon 1.23 0.66 4.65 
Range 15-20 19-21 61-78 
a In years. 
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TABLE 2 


MEANS AND STANDARD DEVIATIONS oF RT SCORES FOR 
THE TRIAL BLOCKS WITH DIFFERENT MEAN 
WARNING INTERVALS FOR NORMAL AND 
DEFECTIVE SUBJECTS 


Mean warning interval 


Group and measure 


2s 68 125 18* 
N l 
mr 20 19 21 21 
SD 3 4 4 4 
Defective 
ap 4 21 22 23 24 
SD 3 3 3 4 


Note.—RT computed in 1/100 second. 
® Seconds, 


response key, a red warning light (7.5 watts), two 
recycling synchronous motor timers, and a Standard 
Electric timer. One recycling timer controlled the 
duration of the warning stimulus and the other con- 
trolled the warning interval length. The apparatus 
with the exception of the response key and the 
Standard Electric timer was mounted in an aluminum 
“radio” chassis. The response was releasing the de- 
pressed key. RT was measured to the nearest .01 
second by the Standard Electric timer, 


Procedure 


The experimental chamber was kept at a low level 
of illumination during the experiment in order to 
maximize attention to the warning stimulus. 

The subject was seated in a chair at a table con- 
taining the apparatus with the response key placed 
directly in front of him. The warning light was 
mounted in the side of the aluminum chassis that 
the subject was facing, The experimenter was seated 
on the opposite side of the table and was partially 
shielded from the subject by the apparatus. The sub- 
ject was instructed that the red light was a signal 
to “get ready” to lift his finger when the buzzer 
sounded. On each trial the warning light came on for 
-5 second and then went off for the remainder ‘of the 
warning interval. The subject received four practice 
trials during which the instructions were clarified, if 
necessary. The warning intervals during the practice 
trials were 2, 6, 12, and 18 seconds in length. If the 
subject responded prior to the onset of the buzzer, 
he was cautioned not to respond to the onset or 
offset of the red light, but to the buzzer only. The 
defective subjects were told that if they “did well” 
they would receive $50 when they were finished. 
The normal subjects were not offered any special in- 
centive, as the task was viewed as intrinsically re- 
warding for this group. 

Each subject received 48 trials, presented in four 
blocks of 12 trials. Each trial block was defined by 

a mean warning interval of 2, 6, 12, or 18 seconds, 
Within each trial block there were three different 
warning intervals: the mean for that trial block, the 
mean plus 1 second, and the mean minus 1 second. 
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Each warning interval was used four times in a trial 
block. Each of the first three trials of a block had 
different warning intervals, but warning intervals 
were randomly assigned for the last nine trials, 

The interval between trials was approximately 7 
seconds. During this period the experimenter recorded 
the RT for the preceding trial and “set” the warning 
interval for the next trial. The beginning of a trial 
was signaled by the experimenter instructing the sub- 
ject to depress the response key. A rest period of 
30-seconds duration was interpolated between trial 
blocks. 

The order in which the trial blocks with different 
mean warning intervals were presented varied for the 
different subgroups of retarded and normal subjects, 
Four different orders were used. The different orders 
had the trial blocks with different mean warning in- 
tervals in the sequences 26-12-18, 6-12-18-2, 12-18- 
2-6, and 18-2-6-12, 


RESULTS 


The measure used in the analysis of the data 
was the mean RT for the last nine trials of each 
trial block. The first three trials of each trial 
block were not included in the determination of 
the scores, since the RTs on these trials may 
have been affected by the length of the warning 
intervals in the preceding trial block. Table 2 
gives the means and standard deviations of the 
Scores for the different mean warning intervals 
for the normal and defective groups. 

The experimental design was a Latin square 
teplicated at two levels, normals and defectives 
(Lindquist, 1953). Within each Latin square 
mean warning interval duration was one variable 
and position in the order of trial blocks was the 
second variable. Table 3 summarizes an analysis 
of variance of the data. i 

None of the interaction effects of the analysis 


TABLE 3 
Summary oF ANALysIs or Variance or RT DATA 
Source df | SS F 
Between subjects 47 
Group (G) (retarded-normal)| 1 | 181.74 | 6.24* 
X P (between) 3| 35.98| 1.23 
WI X P X G (between) 3| 78.78| 2.70 
Error (b) 40| 29.14 
Within subjects 144 Ny 
Warning interval (WI) 3| 57.97 | 14.53 
Order position (P) 3 1.14 | 0.28 
WI X P (within) 6| 432| 1.08 
W XG 3| 5.39] 1.35 
PXG 3] 5.38] 1.35 
WI X P X G (within) 6| 1.64] 041 
Error (w) 120| 3.99 
Total 191 
R 025. 
ab SO, 
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reached significance (.05 level). However, both 

the group and warning interval main effects were 
| significant, while the position in the order of trial 

plocks main effect was not. An inspection of 

Table 2 indicates that the significant group main 

effect was due to the normal subjects having con- 

sistently faster RTs than the retarded subjects. 

The data of Table 2 also suggest that the signifi- 
"cant warning interval main effect was due to an 
increase in RT as the warning interval increased 
in length. 

Figure 1 gives the mean RT for each of the 
three different warning intervals within each trial 
block for both retarded and normal subjects. 
Within the trial block with a mean warning in- 
terval of 2 seconds RT is fastest when the warn- 
ing interval is 3 seconds long and slowest when 
the warning interval is 1 second long. Within this 
trial block the difference in RT as a function of 
warning interval appears to be quite large. How- 
ever, as the mean warning interval length in- 
creases, the difference between the warning in- 
tervals within the trial blocks becomes consider- 
ably less, 


a a E 


DISCUSSION 


It is apparent from Table 2 and from the sig- 
nificant warning interval main effect that RT 
tends to become increasingly slow as the warning 
interval is lengthened beyond some optimum 
period. This finding suggests that the results 
reported by Terrell and Ellis (1964) were af- 
fected by the ordering of the different warning 
intervals in the experiment. The random order 
of the intervals apparently caused short warning 
intervals to follow long waming intervals 
and long intervals to follow short intervals 
a part of the trials. Since both Klemmer 
i. and Karlin (1959) have reported that 
as apemer condition results in slow RTs and 
r atter results in fast RTs, the decrease in 
E p the warning interval was lengthened, as 
Ee ed by Terrell and Ellis, is “accounted for.” 
i. ever, in the Terrell and Ellis study only 
RT fended subjects evidenced the decrease in 
The y a the warning interval was lengthened. 
Pied i of the normal subjects were not af- 

his y the duration of the warning interval. 
that ey in the light of the present results, 
R He RT performance of mental defectives 
Bea more by the length of the warning 
A on the preceding trial than is the RT 
a mance, of normals. This leads to one of 
oe in relation to the sẹ, theory: 
Fi s mentally defective subjects of Terrell 

llis had more intense “memories” of the 
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Fic, 1. Graph of the mean RTs for the different 
warning intervals within the different trial blocks, 
(Each series of three connected points represents a 
trial block. The points are the mean RTs for the dif- 
ferent intervals within a trial block.) 


warning interval on the immediately preceding 
trial than did the normal subjects, or else the 
defective subjects had less intense “memories” 
of the warning intervals prior to the warning 
interval of the immediately preceding trial. The 
former interpretation is not consonant with the 
sẹ notion, while the latter is in line with the 
theory. 

In considering the foregoing discussion it 
should be recalled that the major finding reported 
by Terrell and Ellis, that the on-on condition 
produced faster RTs than the on-off condition in 
defective subjects and not in normal subjects, 
was predicted preexperimentally from the s, 
theory and, therefore, constitutes supporting evi- 
dence for the validity of the notion. However, 
alternate explanations of the finding may be 
available. 

The effect demonstrated in Figure 1 also may 
be attributed to the phenomenon reported by 
Klemmer and Karlin. As the different warning 
intervals within each trial block were ordered 
randomly, RT was slowest at the shortest warn- 
ing interval and fastest at the longest warning 
interval within the different trial blocks. When 
the mean warning interval was short, the relative 
discrepancy between the lengths of the different 
warning intervals within a trial block was greater 
than when the mean warning interval was long. 
This explains why the difference in RT between 
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the warning intervals within the trial blocks dis- 
appeared as the mean warning interval increased 
in length, 

The finding that normal subjects have faster 
RTs than retarded subjects has been reported 
by several prior experimenters (e.g., see Ellis 
& Sloan, 1957; Terrell & Ellis, 1964) and, 
therefore, provides no new information. 
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l EFFECTS OF SUCCESS AND FAILURE ON LEVEL OF 
ASPIRATION AS RELATED TO ACHIEVE- 
MENT MOTIVES’ 


ROBERT W. MOULTON 
Department of Education, University of California, Berkeley 


Atkinson’s risk-taking model predicts that individuals high in fear of failure 
and low in need for achievement may react in an atypical manner to success 
or failure experiences; i.e, they may raise their level of aspiration following 
failure and lower it after success, An experimental situation was designed to 
test these predictions and confirmation of this aspect of the model was ob- 
tained. Effects of instructions intended to reduce the degree to which these 
Ss subjectively underestimate probability of success were also examined. Results 
tentatively suggest that such instructions increase the proportion of low as 
contrasted to high levels of aspiration in this group. 


In studies of level of aspiration behavior, 
it has been observed that the most typical 
reaction to a success experience is a moderate 
rise in level of aspiration and that the usual 
reaction to failure is a moderate drop in level 
of aspiration. However, a minority of sub- 
jects, on occasion, react to success or failure 
in an apparently paradoxical manner. That 
is, they respond to failure by raising level of 
aspiration and to success by lowering it. A 
model of risk -taking recently developed by 
Atkinson (1957), which is a modified form of 
the resultant valence conception (Lewin, 
Dembo, Festinger, & Sears, 1944), clearly 
implies that reactions to success and failure, 
whether they are typical or atypical, are 
Predictable from a knowledge of individual 
differences in the relative strength of motives 
to achieve success (M) and to avoid failure 
(My). The study being reported is an at- 
tempt to provide a test of these implications. 

Th Atkinson’s model, the attractiveness of 
a particular task is a resultant of approach 
and avoidance “tendencies.” The strength of 
the approach tendency is assumed to equal: 
nye X Expectancy X Incentive. Motive re- 
a y to a general disposition to strive for 
RS (Me) when confronted with an 
$ levement situation. Expectancy is defined 
gett probability of success (Ps) for 
of task in question, and the incentive value 

the task is assumed to equal 1 — Ps. The 

‘This study was supported by a research grant 


ftom the De a ee 
ae as partment of Education, University of 
California, Berkeley. , 


strength of the avoidance tendency is also 
expressed as: Motive X Expectancy x Incen- 
tive. In this case, motive refers to a dispo- 
sition to avoid failure (Ma). If expectancy is 
again expressed as Ps, incentive is defined as 
the minus value of Ps. With this series of 
assumptions it can be shown that maximum 
motivation, either to approach or to avoid, is 
aroused by tasks which have a Ps of .50 
(Atkinson, 1957).* When individuals whose 
motive to approach success exceeds, their mo- 
tive to avoid failure (approach oriented) are 
presented with a set of achievement tasks 
which vary in subjective probability of suc- 
cess, they will tend to choose the task which 
has a subjective probability closest to .50, On 
the other hand, individuals for whom the 
avoidance motive is preponderant (avoidance 
oriented) will seek to avoid achievement tasks 
because all such tasks tend to arouse avoid- 
ance tendencies in these individuals. However, 
if forced to make a choice from among a set 
of achievement tasks, avoidance-oriented indi- 
viduals will tend to select tasks for which 
P, is as far from .50 as possible, thus select- 
ing tasks from either the easy or the difficult 
range. This aspect of the model has been 
confirmed by previous studies (Atkinson, 
Bastian, Earl, & Litwin, 1960; Atkinson & 
Litwin, 1960; McClelland, 1958; Mahone, 


1960). : 
Success or failure on a task may change its 


2The incentive value of failure can also be defined 
as 1— P; where Pr is the subjective probability of 
failure, Ps + Py is assumed to be equal to 1.00. 
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attractiveness because subjective probability 
of success for the task is thereby altered. 
Thus, if the individual were to succeed on a 
task he initially perceived to be very difficult, 
P, should increase and move toward .50. If 
he were to fail a task initially perceived to be 
easy, P, for the task should decrease and 
move closer to .50. In both instances, the task 
should become relatively more attractive for 
approach-oriented individuals and should 
evoke a stronger avoidance tendency in 
avoidance-oriented individuals. On the other 
hand, success on a task initially perceived 
to be easy should widen the gap between Ps 
for the task and the .50 value. Failure on a 
task initially perceived to be difficult will 
have the same effect. In these latter instances, 
the task should become less attractive to the 
approach-oriented subject and should now 
evoke a relatively weaker avoidance tendency 
in the avoidance-oriented subject. It can be 
shown with this series of assumptions that 
the model predicts differential persistence on 
“easy” and “difficult” tasks in the face of 
failure or success for approach-oriented versus 
avoidance-oriented individuals. Feather (1961, 
1963a) has been able to confirm several of 
these implications. The study being reported 
was designed to provide a test of the model 
as it applies to the effects of success and 
failure on selection of tasks of varying levels 
of difficulty. 

In the study being reported, the ‘subject 
was presented with three tasks presumably 
varying widely in level of difficulty: one was 
easy (P, = .75), one was difficult (P, = .25), 
and one was of intermediate difficulty (P, 
= .50). He was asked to perform the task of 
intermediate difficulty. If he succeeded, he 
would then be expected to perceive these 
tasks as less difficult for him than he had 
originally thought. That is, P, for both re- 
maining tasks should increase and, conse- 
quently, P, of the difficult task should now 
be closer to .50 than P, of the easy task. 
Analogously, if the subject were to fail the 
task of intermediate difficulty, the resultant 
P, for the easy task should be closer to .50 
than P, for the difficult task. Following either 
success or failure on the task of intermediate 
difficulty, the subject was asked to choose 
which one of the two remaining tasks he 
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would like to perform. This choice should be 
predictable from a knowledge of the relative 
strengths of Ms and Mas. Consider first the , 
case of the approach-oriented (M, > M) 
individual. Since he tends to prefer tasks with 
a P, close to .50, he should select the difficult 
task after success and the easy task after 
failure. These choices would be equivalent to 
raising of level of aspiration after success 
and lowering of level of aspiration after 
failure (the typical shift). What kinds of 
choices are to be expected after success and | 
failure for the avoidance-oriented (May > M,) 
subject? If, as assumed, he attempts to avoid | 
tasks with a P, of .50, he may select the } 
difficult task following failure and the easy 
task following success, thus raising level of 
aspiration following failure and lowering it 
after success (the atypical shift). The predic- 
tion was that avoidance-oriented subjects in 
the present study would show the atypical 
shift in level of aspiration more frequently | 
than would the approach-oriented subjects. 

A further hypothesis was tested involving | 
the group of subjects who do not show a cleat 
preponderance of M, or Ma, (ambivalent 
group). If motives which presumably deter- 
mine the nature of the choice after success 
or failure are of roughly equal strength, then 
these individuals might be expected to show 
evidence of conflict with respect to the choice 
A check on this possibility was made by 
looking for evidence of indecision about the 
choice of task following success or failure 
among these subjects. 

Within the design of the study being re- 
ported it was possible to gather data on level | 
of aspiration prior to success or failure on 
the tasks in question. Thus, it was feasible 
to obtain data which might contribute 1) 
further examination of several other issues 
which have arisen in research aimed at vali- 
dating the Atkinson model. Atkinson et al. 
(1960) report a tendency for the avoidance: 
oriented subjects to avoid intermediate wisks 
by selecting the difficult alternatives with 4 
higher frequency than the easy alternative 
Evidence is provided that the task used 12 
that study was, inadvertently, so difficult p 
the easiest possible alternatives had a Ps "° 
far removed from .50. Thus, in effect, 
easy alternatives were not available, 
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avoidance-oriented subjects utilized the only 
remaining defensive alternatives—the difficult 
range? It would be expected that if the 
range of possible aspirations is symmetrical, 
so that the subject could move equally as far 
from a P, of .50 by choosing an easy task 
as he could by choosing a difficult one, a 
roughly equal proportion of avoidance- 
oriented subjects would choose the easy and 
the difficult tasks. An attempt was made to 
provide a test of this assumption in the study 
being reported. 

The issue of initial task preferences is 
complicated by yet another factor. Atkinson 
(1957) has argued that subjective probability 
of success for the avoidance-oriented individ- 
ual tends to be a systematic underestimate of 
“objective” probability of success. Feather 
(1963b) offered supportive experimental evi- 
dence for this assumption. The implication is 
that, even given a range of tasks which are 
“objectively” symmetrical with respect to 
probability of success, one could predict a 
preference for the difficult range on the part 
of the avoidance-oriented subjects. This pre- 
diction follows because the tendency of the 
avoidance-oriented subjects to bias Ps down- 
ward would inevitably reduce the range of 
easy tasks and expand the range of difficult 
tasks, In the present study, an effort was 
made to minimize this factor by attempting 
to convince the subjects that P, for each of 
three experimental tasks (symmetrically dis- 
tributed with P, = .25, P,=.50, P,=.75) 
could be clearly and accurately estimated in 
advance. To the extent that this technique is 
effective in reducing bias, the easy and dif- 
ficult alternatives should be equally desirable 
to the avoidance-oriented subjects as ways of 
avoiding the intermediate task on the initial 
nea and thus a roughly equal proportion of 

ese choices (easy versus difficult) should be 
expected within the avoidance-oriented group. 

The initial choices of members of the ambi- 


Bitter possible explanation suggested by Atkin- 
pur astian, Earl, and Litwin (1960) for the relative 
ANAA for the difficult range shown by the 
te ance-oriented subjects is that such preferences 
more “socially desirable” than choices in the 
eee: The issue of “social desirability” as it 
Which i to this problem as well as the manner in 
‘ich it may affect the “atypical shift” will be dealt 
in the Discussion section of this paper. 
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valent group were also examined, though no 
predictions were possible from the risk-taking 
model under investigation. However, the be- 
havior of this group should provide some 
preliminary ideas about the kinds of choices 
that are likely to occur when other than 
achievement-related motives, as defined in the 
present study, are presumably determining 
the choice. 

At this point the overall design of the 
study being reported will be reviewed briefly 
in order that the rationale for, and implica- 
tions of, several aspects of the procedure can 
be presented. Of particular importance is the 
manner in which success and failure induc- 
tions were assigned. Subjects were presented 
with three tasks and informed that the tasks 
were easy, difficult, and of intermediate dif- 
ficulty and were asked to indicate on which 
of the tasks they would prefer to work. Ir- 
respective of his initial choice, each subject 
was then required to work on the task of 
intermediate difficulty.* 

When the subjects had completed their 
work on the task of intermediate difficulty, 
those who had initially chosen the easy task 
were told that they had failed; those who 
chose the difficult task were told that they 
had succeeded. Half of those subjects who 
chose the intermediate task were told that 
they had succeeded and half, that they had 
failed; with the first of these subjects told 
that he had succeeded, the second that he 
had failed, and so on. The particular assign- 
ments of success and failure inductions were 
adopted in order to be sure that the atypical 
change in level of aspiration always involved 
a shift from the task originally chosen. Thus, 
the atypical shift could not be explained 
simply as a perseveration on the initial choice. 
This procedure may have reduced the number 
of instances in which shifts in level of aspira- 
tion appeared atypical. This assumption is 
based on the reasoning that for subjects who 
initially chose the easy or difficult task and 
were subsequently required to work on the 

4The decision to ask each subject to work on the 
intermediate task first, regardless of his initial choice, 
was made on the basis of pretest data which sug- 
gested that P, could be more easily manipulated in 
a manner clear to the subject if he worked on this 
task first. Subjects seemed to accept easily the 
explanation offered for this procedure. 
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task of intermediate difficulty there may have 
been generated a task tension (that is, the 
Zeigarnik effect) which would not be dis- 
charged by working on the task of intermedi- 
ate difficulty. It could thus be argued that 
any differences obtained between approach 
and avoidance groups with respect to the fre- 
quency of atypical shifts might be explained 
in terms of variations in strength of tenden- 
cies to persist in the direction of the initial 
choice. This problem can be dealt with by 
examining the pattern of typical and atypical 
shifts among subjects who initially chose the 
task of intermediate difficulty. In these in- 
stances there can be no argument that choice 
of task following success or failure is influ- 
enced by a tendency to repeat the initial 
choice. If clear differences in frequency of 
atypical shifts between approach and avoid- 
ance groups occur among those subjects who 
initially chose the task of intermediate dif- 
ficulty, then it would seem quite plausible to 
interpret the differences in terms of the 
Atkinson model. 


PROCEDURE 


Subjects were 93 male students drawn from the 
junior and senior classes of a suburban high school. 
They were required to participate in the study. As 
in previous studies, the need Achievement (n Ach) 
measure developed by McClelland, Atkinson, Clark, 
and Lowell (1953) and the high-school form 
(Mandler & Cowen, 1958) of ‘the Mandler-Sarason 
(1952) Test Anxiety Questionnaire (TAQ) pro- 
vided measures of motive to achieve success (M+) 
and motive to avoid failure (May). A picture form 
of the n Ach measure was administered under 
standard neutral conditions (McClelland et al., 1953) 
with pictures presented in the following order (using 
numbers assigned by Atkinson, 1958, p. 832): 
2, 8, 1, 7. The n Ach protocols were scored by a 
coder who had acquired the requisite level of skill 
through work with the practice materials presented 
in Atkinson (1958). These scores correlated .88 with 
those of a second expert coder who also scored the 
protocols, 

Subjects were ranked on both measures with the 
highest scores given a rank of 1, and so on. Differ- 
ences between ranks were computed by subtracting, 
algebraically, the subject’s rank score on n Ach from 
his rank score on TAQ. For a given subject, if the 
result of this operation was a negative number, he 
stood relatively higher in the TAQ distribution than 
in the n Ach distribution. If the result was positive, 
he ranked relatively higher in n Ach, The distribu- 
tion of rank-difference scores was divided into thirds. 
The 31 subjects with the highest Positive rank- 
difference scores constituted the approach-oriented 
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group (Ms > Mar). The 31 subjects with the highest 
negative rank-difference scores constituted the 
avoidance-oriented group (M.;>M.). For the re- 
maining 31 subjects, it was assumed that M, and.| 
May were roughly equivalent in strength; these sub- | 
jects constituted the ambivalent group. Results of 
these measures were not known by the experimenter | 
during the remainder of the study. 

Following the group administration of the motiya- 
tion measures, subjects were seen individually and 
given the following general instructions: 


| 


{ 
] 


I am studying certain aspects of ability in high 
school students. Here in this file [the experimenter 
pointed to a box containing a large number of file 
folders] I have a number of tasks. As you know, 
if you have less ability, a particular task will be 
harder for you, and if you have more ability, it 
will be easier. We already have some information } 
about your abilities from the school files, but we 
need a little more in order to tell how hard these 
tasks will be for you. I want to ask you a few 
questions and then give you a short test which | 
will help me to assess your ability. 


The experimenter then asked the subject about his 
general grade-point average and about his average | 
in math and in science. The experimenter theni 
administered a short test comparable to the Simi- 
larities subtest of the WISC and, after “scoring” 
this test, stated: 


Now with the other information and your score | 
on this test you just took I can judge your ability 
to do these tasks very accurately. All of these 
tasks are of the same type; the only difference 
that they differ in how difficult they are. Let’s set 
now [the experimenter selected three file folders 
and placed them in front of the subject], for 
students of your ability, there are three chances 
in four that you will do well on this easy task, 
You therefore have a 75% chance on this onei 
This second task is of medium difficulty; youl 
have one chance in two of succeeding, that H 
you have a 50% chance of succeeding. This 
is difficult; there is one chance in four for students 
of your ability to succeed, that is, you have a 
chance of success. Remember [pointing] this 
the easy one, this is the one of medium difficulty; 
and here is the hard one. Which one of thes 
three would you like to work on; the one 
three chances of four of succeeding, the one 
one chance out of two, or this one with one ch 
out of four of succeeding? Which one would 
prefer to do? 


RE 


an 
you 


After the subject made his choice, the experimentei 
said: 
You chose the [easy, medium, difficult] ti r 
Well, we have found that we can learn most abt 
students’ abilities if we start everyone on the 
= of medium difficulty, the one on which you m 
' à 50-50 chance of success, All right? Here £ 
idea. The letters of these words have been 
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up. Your task is to rearrange the letters to form 
the name of a fruit, vegetable, color, or type of 
cloth, Rearrange as many words as you can as fast 
as you can. Do not spend too long on any one 
word. Do you understand? You may start now. 


‘The experimenter made a show of timing the work 
and, after telling the subject to stop, he “scored” 
the tests and announced the results. All subjects who 
had chosen the easy task and alternate subjects 
who had chosen the intermediate task were told at 
this point (failure instructions) : 


Well, you didn’t do as well as we predicted you 
would do. I guess that you did not use the best 
strategy. You're not as good at these tasks as we 
expected from the previous information. 


All subjects who had chosen the difficult task 
and alternate subjects who had chosen the intermedi- 
ate task were told at this point (success instructions) : 


You did very nicely, You must have used the 
best strategy. You're better at these tasks than 
we expected from the previous information. 


? Following success or failure instructions, the sub- 
jects who had originally chosen the easy or difficult 
task were told: 


Originally you chose the difficult [easy] task, 
you have worked on the medium diffculty task, 
and you did [did not do] well. Now I am going to 
ask you to work on one more of the remaining 
two tasks, either the easy or dificult task. Al- 
though you originally chose the difficult [easy] 
task, feel free to change your mind if you wish. 
This time you will definitely work on the one you 
choose. So, on which one of the two would you 
like to work? 


s Subjects who had originally chosen the task of 
intermediate difficulty were told at this point: 


Originally you chose the medium difficulty task 
and you worked on that one and did [did not do] 
well. Now I’m going to ask you to work on one 
hore of the remaining two tasks, either the easy 
or difficult task. Feel free to choose either one. On 
which one of the two would you like to work? 


k an the choice was made, each subject was asked 
he nee what he thought were the new probabili- 
3 ra Success for the remaining tasks. He was given 
ee diagram numbered from 0,00 to 1.00 and 
a to indicate the points on the diagram which 

Presented the new P, level.’ He was then asked 
e following: 


a When you chose the easy [difficult] task to do 

Pate did you have any trouble making the 

ieee Were you at all tempted to choose the 
cult [easy] task? 


5 
w Bs expected, success always resulted in an increase 
hee P, for both remaining tasks, and failure 
ed in a decrease in P, for both remaining tasks. 
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The subject then performed another scrambled 
word task; it was “scored”; he was told he had 
done reasonably well on the last task and asked if 
he had any questions about the study. He was also 
given specific instructions not to discuss the study 
with other students, and our inquiries indicated that 
this injunction was obeyed. 


RESULTS 


The major hypothesis of the study involved 
the prediction that atypical shifts in level of 
aspiration following success or failure occur 
more frequently among subjects whose moti- 
vation to avoid failure exceeds motivation to 
approach success than among subjects who 
show the opposite motive pattern. From 
results presented in Table 1, it can be seen 
that 15 subjects made the atypical shift and 
that this shift occurred more frequently 
among subjects whose resultant motivation 
was avoidance oriented. The difference be- 
tween the approach and avoidance group was 
clearly significant (p < .01). It is also evident 
that the atypical shift is relatively rare 
among subjects for whom there is not a clear 
preponderance of approach or avoidance moti- 
vation. In fact, the ambivalent group differed 
significantly from the avoidance group in this 
respect (p < .05). 

Table 2 presents data relevant to the occur- 
rence of the atypical shift as related to the 
task initially chosen. It can be seen that 6 of 
14 subjects in the avoidance-oriented group 
who initially chose the intermediate task 
made the atypical shift following success or 


TABLE 1 


TYPE oF SHIFT IN LEVEL OF ASPIRATION AS 
RELATED TO RESULTANT MOTIVATION 


Type of shift 
Resultant motivation |————7—___|___ 
Atypical | Typical N 
Avoidance oriented 11 20 31 
Ambivalent 3 28 31 
Approach oriented 1 30 31 
Total 15 78 93 


Avoidance oriented versus approach oriented, 
x? = 8.37" 
Avoidance oriented versus ambivalent, x? = 4.52* 


*p <.05. 
ah <,01, 
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TABLE 2 


INITIAL Task PREFERENCES AND SUBSEQUENT TYPICAL 
VERSUS ATYPICAL SHIFTS AS RELATED 
TO RESULTANT MOTIVATION 


Difficulty level of tasks initially chosen 
zae — 
mediate 
Easy Difficult N 
Approach 
oriented 23 (0) 1 (1) 7 (0) 31 
Avoidance 
oriented 14 (6) 9 (3) 8 (2) 31 
Ambivalent | 17 (2) 0 (0) 14 (1) 31 
Total 54 10 29 93 


Approach oriented versus avoidance oriented; 
choice of intermediate difficulty task versus 
choice of either the easy or difficult 
task, x? = 5.43* 
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expected direction to the occurrence of the 
atypical shift (see Table 3). Thus, 13 of 15 


subjects who made the atypical shift were í 


above the median on TAQ (p < .01), and 12 
of 15 below the median on n Ach (p < .05), 

It was predicted that subjects in the ambi- 
valent group would show evidence of conflict 
with respect to the postsuccess or postfailure 
choices. These results are presented in Table 
4, Of the 24 subjects who reported conflicted 
choices, a majority and a statistically signifi- 
cant proportion fell in the ambivalent group 
(p < .05). 

Data on the initial choices made before 
the success or failure experiences are also 


TABLE 4 


CONFLICTED SHIFTS IN LEVEL OF ASPIRATION 
AS RELATED TO RESULTANT MOTIVATION 


Note.—Numbers in parentheses refer to the frequency of 
atypical shifts following a particular initial choice. 


*p <02. 


failure while none of the corresponding 23 
subjects in the approach-oriented group made 
the atypical shift. The difference is clearly 
significant (x? = 8.82, p< .01). Note also 
(Table 2) that among subjects in the 
avoidance-oriented group the frequency of 
atypical shifts following an initial choice of 
the intermediate difficulty task was 43% and 
among those who chose the easy or difficult 
task the frequency of atypical shifts was 29%. 

Each of the motive measures considered 
independently was significantly related in the 


TABLE 3 


TYPE OF SHIFT IN LEVEL OF ASPIRATION As RELATED 
TO TEST ANXIETY AND n ACHIEVEMENT 


Type of shift 
Atypical | Typical N 
Test anxiety 
igh 13 34 47 
Low 2 44 46 
Total 15 78 93 
P x? = 7.70** 
n Achievement 
High 3 a 47 
Low 12 34 46 
Total 15 78 93 
x= fans 
*p <.05, 
es <.01, 


Shifts in level of aspiration 


Resultant motivation 


Conflicted contincted y 

Avoidance oriented 7 24 31 
Ambivalent 13 18 31 
Approach oriented 4 27 31 
Total 24 69 93 


Ambivalent group versus poa and avoidance 
groups combined, x? = 5.12* 


*p <.05. 


presented in Table 2. Members of the 
avoidance-oriented group showed a relative 
preference for the easy and difficult task i 
contrast to the approach-oriented subjects 
who tended to select the task of intermedialé 
difficulty. This difference was generated pi! 
marily by the fact that the avoidance 
oriented subjects selected the easy task with 
greater frequency than did the approach 
oriented group. Subjects in the ambivalel! 
group also avoided the easy task, but they 
exceeded both of the other groups with 1 
spect to the frequency with which they chos? 
the difficult task in preference to the othet 
two (see Table 2). 


Discussion 


As predicted by Atkinson’s model, x 
atypical shift in level of aspiration occi g 
more frequently among the avoidant 
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oriented than among the approach-oriented 
subjects. The fact that this relationship re- 
|. tains significance among just those subjects 
who initially chose the task of intermediate 
difficulty would seem to make it clear that 
the results cannot be interpreted in terms of 
differences between the two motive groups 
in strength of tendency to persist with re- 
spect to their initially chosen task; an issue 
clearly raised by the nature of the experi- 
mental design. The atypical shift was also 
relatively infrequent among members of the 
ambivalent group although, as predicted, 
these subjects did show evidence of greater 
frequency of conflict or indecision with re- 
spect to the direction of change in level of 
aspiration. This group of findings represents 
the first clear confirmation of the effects of 
success and failure on shifts in level of 
aspiration implicit in the model. 

The relatively low frequency of atypical 
shifts (36%) among subjects in the 
avoidance-oriented group, for whom this be- 
havior was specifically predicted, requires 
consideration. As indicated earlier the nature 
of the design was such that subjects who 
initially chose the easy or difficult task were 
required to change from the task initially 
chosen if they were to make the atypical shift 
following success or failure. It is possible 
that this requirement may have tended to 
lower the proportion of atypical shifts among 
these subjects. This assumption is tentatively 
supported by the fact that within the 
avoidance-oriented group there was a some- 
what greater (though not statistically signifi- 
Cant) proportion of atypical shifts following 
an initial choice of the intermediate task than 
following an initial choice of either easy or 
difficult, However, even for those subjects in 
the avoidance-oriented group who initially 
chose the intermediate task a minority (4376) 
show the atypical shift. One possible explana- 
tion of why even these subjects failed to show 
a high proportion of atypical shifts was sug- 
gested by the spontaneous remarks made by 
Some subjects who, when asked about the 
choice they had made, expressed the idea 
that it would not have been “sensible” or 
would have been “weird” to pick an easy task 
after success or a difficult task after failure, 
even though they had been inclined to do so. 
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Thus, the atypical shift may have low “social 
desirability” and, even among the avoidance- 
oriented subjects who initially chose the task 
of intermediate difficulty, those with strong 
needs to act in a “socially desirable” manner 
will not show this behavior. 

It is of interest to note in this connection 
that subjects in the ambivalent group, for 
whom M, and Mas are presumed to be of 
roughly equal strength, were shown to make 
the atypical shift less frequently than did 
avoidance-oriented subjects. In the ambi- 
valent group, it could be argued that motives 
other than Ms or May must have determined 
the nature of responses to success and failure, 
and it seems plausible to assume that when 
achievement-related motives were not the de- 
termining factors, the choices were influenced 
by the greater social desirability of the typical 
shift, An implication of this argument is that 
if the apparent “social desirability” of the 
atypical shift could be increased, the ambi- 
valent group might show a significantly 
greater proportion of atypical shifts than the 
approach-oriented subjects whose behavior in 
these situations is presumably clearly con- 
trolled by “intrinsic” achievement-related 
motivations. In further studies, attempts to 
control the social-desirability factor by inde- 
pendent measurement of individual differences 
in need to behave in a socially desirable 
manner or through experimental attempts to 
counteract the low social desirability of the 
atypical shift should be useful in deciding 
these questions. 

The exploratory data gathered on presuc- 
cess or prefailure choices provide encouraging 
results. Subjects in the avoidance-oriented 
group chose the easy or difficult task with 
greater frequency than the approach-oriented 
group, as expected. Of greater interest was 
the fact that in contrast to previous work, 
subjects in the avoidance-oriented group chose 
the easy task as often as they chose the dif- 
ficult one. Recall that several aspects of the 
procedure were aimed at producing just this 
effect. Ps for the easy and difficult tasks was 
set equidistant from .50 and an attempt was 
made to minimize the tendency of avoidance- 
oriented subjects to bias Ps downward. These 
results are, of course, only tentative support 
for the assumptions tested because there is 
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no assurance at this point that these findings 
are clearly attributable to the fact that bias 
was reduced by the operations involved. A 
more definitive test would involve a study 
in which the instructions intended to reduce 
the degree of bias would be retained for one 
experimental group and omitted for another 
experimental group. In such a study, it would 
be predicted that, among the avoidance- 
oriented subjects, a greater proportion would 
choose the difficult task when instructions 
intended to counteract bias were omitted than 
when these instructions were included. 

Finally, the initial choices of members of 
the ambivalent group in the present study 
tentatively suggest that when M, and Ma, are 
of roughly equal strength, preferences for 
tasks of relatively low P, are predominant. 
The suggestion that the social desirability of 
such choices may strongly affect the behavior 
of this group would seem to warrant further 
investigation. 
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Although the successful conditioning of a variety of verbal responses has been 
widely reported, the association of such effects with independent but related 
situations is less well established. In this study favorable attitudes toward 
“medical science” (and by implication toward the examiner) were successfully 
enhanced by verbal reinforcement and were found to be significantly associated, 
as predicted, with performance in a task requiring muscular effort (a dynamo- 
meter). Contrary to prediction, however, unfavorable attitudes could not be 
conditioned despite similar reinforcement. In nonreinforced control Ss the 
attitude scores did not change significantly, nor was there an association 


between attitude scores and dynamometer performance. 


The subject’s attitude toward the examiner 
has repeatedly been demonstrated to be'a 
major variable in situations involving behav- 
ior influence. Studies have illustrated that 
verbal conditioning can lead to significant 
changes in response to opinion, attitude, and 
personality scales (Centers, 1963; Ekman, 
1958; Hildum & Brown, 1956; Nuthmann, 
1957; Singer, 1961; Verplanck, 1955). 

In a previous study, Krasner, Ullmann, and 
Fisher (1964) demonstrated that attitudes 
toward “medical scientists” and, by implica- 
tion, toward the experimenter, could be influ- 
enced by the verbal reinforcement of ques- 
tionnaire responses. Further, in that study, as 
predicted, the change in attitude was associ- 
ated with changes on an independent task 
requiring muscular effort, dynamometer pull. 
Krasner et al. used as their attitude measure 
a 54-item Medical Science Questionnaire 
(MSQ).® Two groups of subjects were used, 


1 Portions of this paper were presented to the 
Annual Meeting, American Psychological Association, 
Los Angeles, September 1964. The research was sup- 
ported by Research Grant M-6191 from the National 
Institute of Mental Health, United States Public 
Health Service, through Stanford University. 

_ ?Now a United States Public Health Service Visit- 

ing Scholar, Educational Testing Service, Princeton, 

1964-65, 

P. ° Now at the Medical Research Council’s Clinical 

eey Research Unit, Graylingwell Hospital, 
chester, Sussex, England. 

s Now at the University of Illinois. 
i The MSQ was developed from the rationale that 
ih 4 medical research experiment, the behavior of 
the subject would be affected by his attitude toward 

role of the examiner, In a medically oriented 


Both were given a dynamometer-pull task, 
followed by the first 24 items of the MSQ, 
administered orally without verbal reinforce- 
ment, The experimental group was then given 
the next 30 items of the MSQ with examiner 
reinforcement (nodding, smiling, and/or say- 
ing “mm-hmm”) of responses indicating 
favorable attitudes toward “medical science.” 
The dynamometer-pull. task was then re- 
peated. The same procedure was followed 
with the control group, but no reinforcement 
was given. The reinforced group increased 
their favorable responses by 11.6% which was 
significantly different from the control group 
in which the frequency of such responses 
decreased by 1.6%. 

Similar differences were found in the per- 
formance on the dynamometer-pull task. 
While the initial performance of both groups 
was comparable, the reinforced group in- 
creased their pressure on the dynamometer 
following the conditioning trials, in contrast 
to the control group which declined in 
performance. 

The results of this study were, however, 
open to the alternative interpretation that 
the increased effort on the dynamometer was 
a “nonspecific” response to social reinforce- 


experiment, the examiner role involves aspects of 
both the practitioner and the scientist, Out of 212 
items initially selected from the popular literature 
dealing with favorable or unfavorable comments 
about medical practice and about scientific research, 
the 54 items which were finally selected, after pres- 
entation to several subject groups, were those whose 
favorability to medical science were most highly 
associated with total favorability scores, 
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ment, rather than a function of the demon- 
strable change in attitude. This alternative 
hypothesis was evaluated in the study to be 
reported, by comparing the dynamometer per- 
formance of subjects treated like the rein- 
forced subjects in the previous study, with 
that of subjects given equivalent degrees of 
reinforcement, but on a questionnaire whose 
content was unrelated to the attitude under 
consideration. Thus, one group of subjects 
received the MSQ and was reinforced for 
favorable responses as in the earlier study 
(Group 1), and a further group of 20 sub- 
jects was given a 30-item Medical Science 
Information Survey (MSIS) and reinforced 
irrespective of their response on 75% of the 
items (Group 2). The statements making up 
the MSIS were either favorable or unfavor- 
able comments about the adequacy with 
which such media as TV, radio, and news- 
papers handle issues of health. This was con- 
sidered to be an acceptable control pro- 
cedure in that the questionnaire had face 
validity in the experimental situation, but 
it was not likely to reflect direct attitudes 
about the examiner as was the MSQ. 

The present study was also designed to 
amplify the earlier findings by extending 
reinforcement to MSQ response indicating 
“unfavorable” attitudes (Group 3). Finally 
to assess the possibility of “spontaneous” 
changes in responses from the first 24 of the 
remaining 30 items of the MSQ, a ndnrein- 
forced control group was added (Group 4). 
These 20 subjects were given the MSQ and 
asked to complete the questionnaire during 
the next few minutes. This form of adminis- 
tration was thought to improve upon that 
previously used, in that the failure to give 
any reinforcement during an oral administra- 
tion is likely to be interpreted as negatively 
reinforcing by the subject. 

In all four groups of subjects, the depend- 
ent variable was the change in dynamometer 
performance between trials given prior to, 
and then following, the administration of the 
relevant questionnaire. 

The predictions tested were as follows: that 
the subjects in Group 1 reinforced for favor- 
able responses would significantly increase 
their MSQ scores when compared to the con- 
trol subjects in Group 4; that when compared 
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to both control and MSIS (Group 2) sub. 
jects, the reinforced Group 1 subjects would 
significantly increase their dynamometer 
grip; that the Group 3 subjects reinforced 
for unfavorable responses to the MSQ items 
would decrease their MSQ score and decline 
in dynamometer performance when compared 
to the control subjects. 


METHOD 
Subjects 


The sample consisted of 80 women all of whom 
were mothers of children attending two cooperative 
nursery schools. These subjects were a selected 
sample in terms of both social and educational 
status. Many were married to members of the local 
academic and scientific community, and 75% of the 
subjects had attended college for an average of just 
over 3.5 years. The sample had a mean age of 31.59 
with an SD of 4.88. 


Procedure 


Because this study involved attitudes toward the 
examiner, the selection and orientation of the sub- 
jects was an important aspect of the experimental 
procedure. After an initial contact, a letter was 
sent to the school authorities asking for volunteers 
to participate in a study that was designed “to 
assess the relationship between personality factors 
and psychomuscular coordination as measured on 4 
dynamometer,” and that was one of “a series sup- 
ported by the United States Public Health Service 
through Stanford University.” At this time, the 
experimenter, an Englishman, was introduced as 4 
“physiological psychologist” and a member of the 
scientific staff of the British Medical Research 
Council, who had been granted leave of absence to 
participate in the study. In other words, during this 
initial contact the experimenters tried to evoke for 
the participants a maximum of prestige, scientific 
research aura, and face validity, As an additional 
incentive, the school received a sum of $2.00 for 
each subject seen in the experiment. 

The training of the examiner followed the pni 
cedure used by Krasner et al. (1964) which aia 
in some respects from that used in earlier verbal 
conditioning studies. This training recognized an 
placed great reliance on the natural reinforcing 
qualities of the examiner. Previously, the emphass 
had been on the physical characteristics of the Te 
inforcement, which was carefully circumscribed $0 
as to isolate the stimulus, making the verbalization 
often “mm-hmm,” analogous to a food pellet. 
contrast, our training of the examiner consiste 
discussions of the social reinforcement process, 
clear description of the response to be influence 3 
general description of the behavior cues which m 
individual uses to influence another, and pen 
rehearsal of the examiner-subject roles. It was © 
to the examiner to use his own behavior approh 
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ately to influence the subjects. This behavior con- 
sisted of the usual cuing involving verbalizations 
such as “mm-hmm,” “good” and gestures such as 
smiling, head nodding, and probably other subtle 
gestural cues. 

The 80 subjects were allotted at random to one 
of the four experimental groups. They were seen 
on the school premises at a mutually convenient 
time. All subjects were treated alike prior to the 
administration of the attitude or MSIS scale. Fol- 
lowing a brief introduction to the sequences of 
tests which she could anticipate, each subject was 
interviewed in a standardized manner and informa- 
tion was obtained about the subject’s age, place 
of birth, her educational history, and number and 
age of her children and of her siblings (if any), 
and her husband’s occupation. Each subject was also 
asked whether she had any previous experience in 
medical or psychological research, and whether she 
had been hospitalized as an adult other than child- 
birth, Following the interview the subjects were 
asked to complete two questionnaires, the Marlowe- 
Crowne Social Desirability (MC-SD) scale (Crowne 
& Marlowe, 1960) and the short form of the 
Maudsley Personality Inventory (MPI; Eysenck, 
1958), Although previous studies have been equivocal 
in their findings when dealing with personality cor- 
relates, both “social desirability” and “extraversion- 
introversion” scales have evidenced some promise of 
predicting conditionability or response to verbal 
conditioning. 

At this point, the subjects were given six trials on 
the dynamometer. The dynamometer was mounted 
in a stand adjusted to a convenient height so that 
the subject could easily grip the handle with her 
Preferred hand. The subjects were instructed as 
follows: 


This instrument is a dynamometer. It measures 
precisely and accurately your strength of grip. 
. What I want you to do is as follows. When I say 

teady” I want you to grip the handle with your 
favored hand, Then when I say “squeeze,” I want 
You to squeeze the handle as hard as you can. Try 
ae keep your grip at that pressure until I say 
relax.” When I say “relax” let go of the handle 
and rest your hand on the table. I will repeat 
the procedure six times with a rest between each 
trial. Remember, each time I say “squeeze,” 
Squeeze as hard as you can and try to keep your 
grip at that pressure until I say “relax.” I will 
tell you what to do—all right, let’s begin.® 


ane subjects were then informed that these meas- 
dick were to be repeated after a 15-minute interval, 
in ring which time they should rest. The subjects 
ìn Groups 1 and 3 were then given the following 
structions; 

wie you are resting I am going to read some 
_Sttements concerned with people’s feelings about 


ge additional task, that of reporting the visual 

to i ects of the Archimedes spiral, was also given 

re the subjects in this study. The results will be 
Ported in a subsequent paper. 
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health and medical research. After I have read 
out each statement I want you to say whether 
you agree or disagree with the opinion it expressed. 
If you cannot decide between these alternatives 
just reply “cannot say.” Each time I want your 
immediate reaction to the statement, and your 
reply is to be simply either “agree,” “disagree,” or 
“cannot say.” 


Similar instructions were given to the subjects in 
Group 2, with slight modification appropriate to 
the questionnaire. 

The subjects were then read the first 24 items of 
the MSQ and their replies noted without comment 
from the experimenter. Both the experimenter’s be- 
havior and the content of the questionnaire were 
then varied for the three experimental groups. 

Group 1. Throughout the remaining 30 items of 
the MSQ these subjects were reinforced if their 
responses to the item (“agree” or “disagree”) indi- 
cated a favorable attitude. The reinforcement con- 
sisted of the experimenter expressing interest in, 
or agreement with, the subject’s responses. Interest 
and approval were conveyed by such means as 
smiling, nodding, saying “mm-hmm,” “yes,” “good,” 
and “that’s fine.” 

Group 2. For this group the latter 30 MSQ items 
were replaced by the MSIS previously described 
and the subjects’ response to 75% of the items re- 
inforced irrespective of whether the responses hap- 
pened to be either “agree,” “disagree,” or “cannot 
say.” 

Group 3. These subjects were similarly reinforced 
for response to the same 30 MSQ items which indi- 
cated an unfavorable attitude, that is, the converse 
of those reinforced in Group 1. 

Group 4 (control). These subjects were handed 
the entire MSQ with written instructions similar 
to those given to Groups 1 and 3. 

After completing the relevant questionnaire, the 
subjects in all four groups were retested on the 
dynamometer and then briefly questioned about their 
reactions to the various tests. 


RESULTS 


Description of Sample 

College attendance and other social charac- 
teristics which distinguish the present sample 
from the general population have already 
been commented about. In addition, it should 
be noted that the sample was somewhat more 
neurotic and extraverted (as assessed by the 
MPI) than another United States sample of 
“middle-class housewives” (Wells, Egeth, & 
Wray, 1961), and that their initial attitudes 
towards medical research and medical practice 
were preponderantly favorable across all four 
groups; 69% of all responses to the first 24 
items of the MSQ were in the favorable 


direction. 
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TABLE 1 
AGE, ATTITUDE, AND PERSONALITY CHARACTERISTICS OF THE Four EXPERIMENTAL GROUPS 


Controls Unfavorable Favorable MSIS 
F 
M Variance M Variance M Variance M Variance 
MSQ operant level 17.95 12.58 17.25 15.36 14.75 19.88 16.40 17.83 2.32 
Social desirability scale | 16.10 | 33.25 13.55 36.26 14,25 37.46 15.80 | 38.27 0.82 
Age 31.10 23.04 25.42 31.00 21.37 33.80 22.17 1.96 
Neuroticism 3.00 1.79 3.42 3.65 2.87 2.95 4.26 0.86 
Extraversion 4.10 1.78 3.31 3.90 4.31 3.22 2.42 0.96 


Note.—All F ratios nonsignificant at 05 level. 


However, together with mean age, these 
personality and attitudinal characteristics 
were distributed at rahdom between the 
four experimental groups within the present 
sample. There was no significant difference 
between the respective means as assessed by 
analysis of variance (see Table 1). 


Conditioning of Attitude 


Reinforcement of favorable responses. As 
predicted, when responses (stated either as 
agree or disagree) indicating a favorable atti- 
tude toward medicine and medical science 
were reinforced (Group 1) they became sig- 
nificantly more frequent (see Tables 2 and 3). 
Thus, while they constituted 61.46% of all 
responses during the operant period, 70.96% 
of the responses during the reinforced period 
indicated a favorable attitude (¢ = 2.97, 
p < .01). This increase occurred irrespective 
of the subjects’ operant level and differed 
significantly (f= 11.90, p< .01) from the 
trend in the control group (Group 4) where 
such responses declined in frequency. 


TABLE 2 


PERCENTAGE OF MSQ RESPONSES BY EXPERIMENTAL 
Group AND CATEGORY OF RESPONSE 


Pe i 
Gii A repunies falling 
Sron Favorable ? Unfavorable 
0-24 | 25-54| 0-24 | 25-54| 0-24 | 25-54 
Reinforced 
favorable 61.5 | 71.0 | 13.1 | 14.0 | 25.4 | 15.0 
Controls 74.8 | 68.7 | 9.6 | 13.8 | 15.6 | 17.5 
Reinforced 
unfavorable | 71.9 | 69.7 | 10.0 | 13.0 | 18.1 | 17.3 
MSIS 68.3 10.0 21.7 


Although subjects with high (that is, above 
median) Marlowe-Crowne SD scores tended 
to show greater increases in favorable re- 
sponse than did subjects with low scores, and 
also showed a higher operant “favorability” 
score, this variable was not significantly asso- 
ciated with the degree of attitude change: 
the Groups X Trials interaction F ratio was 
1.90 and nonsignificant. 

Neither neuroticism nor extraversion were 
significantly related to conditioning effects. 
However, throughout the total MSQ question- 
naire, extraverts were found to express sig- 
nificantly more favorable attitudes than the 
introverts (F = 4.51, p < .05). 

Reinforcement of unfavorable responses. 
Contrary to prediction the introduction of 
reinforcement had no significant effect on the 
frequency of MSQ responses unfavorable to 
medicine. Such responses were equally fre- 
quent in the reinforced group (Group 3), and 
the control group (Group 4), and in the 
operant and reinforced periods (see Table 4). 


è 


Dynamometer Performance 


Table 5 shows the dynamometer perform: 
ance of subjects within the four experimen 


TABLE 3 


PERCENTAGE OF FAVORABLE RESPONSES 
IN Group 1 (FAVORABLE REINFORCED) 
AND Group 4 (NoNREINFORCED) 


Trials 


ee eet SO TA 
0-24 25-54 

Reinforced 61.46 70.968 

Nonreinforced 74.79 68.64" 


: t 
* Differences in increases between the two groups significanti 
F = 11.90, b <.01. 
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TABLE 5 


MEAN DYNAMOMETER PERFORMANCE (KILOGRAMS) 
BY GROUP BEFORE AND AFTER THE 
QUESTIONNAIRE PERIOD 


LÀ 
a 
Ting TABLE 4 
PERCENTAGE OF UNFAVORABLE RESPONSES 
ww GROUP 3 (UNFAVORABLE REINFORCED) 
e AND GROUP 4 (NONREINFORCED) 
Trials 
0-24 25-54 
" Reinforced 18.14 17.328 
Nonreinforced 15.65 17.48" 


"Differences in increases between the two groups non- 
i F = 47, 


eg 


_ groups when assessed prior to, and then fol- 
lowing, the administration of the relevant 
questionnaire. These figures represent the sum 

" of the maximum grip recorded (in kilograms) 

On each of the six trials. 

Analysis of variance indicated a highly 

k Significant difference in mean performance 

“between occasions.” Since the interaction 

term was not significant, it may be concluded 

_ that all subjects performed less well during 
‘the trials given following the questionnaire 

| period irrespective of the intervening experi- 
Mental treatment. Thus, the predictions re- 


Proportion “F” responses _ 
on Trials 25-54 


Pre Post* 


Reinforced favorable 

(Group 1) 220.70 206.30 
Reinforced unfavorable 

(Group 3) 222.70 211.45 
Controls (Group 4) 211.70 202.45 
MSIS (Group 2) 216.00 203.80 


“The groups did not differ significantly in performance 
change from pre to post. 


garding the subjects’ dynamometer perform- 
ance were not confirmed. 


Relationship between Attitude Change and 
Dynamometer Performance in Group 1 


The data for Group 1 subjects were ex- 
amined to determine whether their change 
in attitude influenced their performance on 
the dynamometer task following conditioning. 
The subjects’ change in, attitude was as- 
sessed by the following ratio in which F is 
“favorable.” 


Proportion “F” responses 
on Trials 0-24 


Percentage of attitude = 
change 


j A similar computation was made for the 
dynamometer readings with the difference 

between the pre- and postconditioning per- 

fprmance being expressed as a percentage 
“ itcrease or decrease from the preconditioning 
< Performance. 

_ The rank-order correlation between these 
T tWo variables was 0.49, indicating that within 
y group the magnitude of the decline in 
dynamometer grip was significantly correlated 
(Ż < .05, one-tailed) with the degree of atti- 
3 tude change. For these reinforced subjects, 
a those who increased their attitude scores the 
_ Most, either increased or minimally decreased 
their grip on the dynamometer following 

‘Conditioning. In contrast, the comparable cor- 
_Telation within the control group was —0.12 
and nonsignificant. 


Proportion “F” responses 


on Trials 0-24 


Discussion 


The results can be summarized as follows: 
We have replicated the previous study, using 
a different examiner, and again found that 
favorable attitudes toward the examiner, as 
measured by a questionnaire, can be modified 
by reinforcement. Further, this change in 
attitude was related to the subject’s perform- 
ance on a motor task requiring strong physi- 
cal effort, This change in performance ap- 

to be related to the specific content 
which was being reinforced, rather than re- 
inforcement per se. That is, a significant 
relationship between performance change and 
reinforcement was found in the group which 
was reinforced for favorable content (Group 
1) but not in the group receiving compa- 
rable reinforcement but for a nonspecific 
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content (Group 2). Unfavorable attitudes 
were unaffected by reinforcement (Group 3). 

This study has been one of a series investi- 
gating the parameters of placebolike situa- 
tions. The major implication thus far is that 
attitudes toward the examiner, a very basic 
ingredient in a placebo situation, can be modi- 
fied by simple reinforcement cues. Further, 
these attitudes are related to the performance 
of the required task. Briefly summarized, the 
examiner-influencer can affect the subject’s 
attitude toward him and this will be re- 
flected in the subject’s compliance in per- 
forming the task required of him. Further, 
in a situation in which there is considerable 
prestige attributed to the examiner, and in 
which the subject starts with favorable atti- 
tudes, suitable cuing is ineffective in changing 
these attitudes in a negative direction. It 
would be easy to visualize that if anyone did 
want negative attitudes, an unlikely situation, 
much more overt and annoying behavior 
would have to be used. 

These studies can also be seen as related 
to those which have been investigating the 
association between change of verbal behavior 
in one situation and changes in a behavior 
in an independent situation. Our results in 
this study are consistent with the findings 
we have obtained in previous studies in 
which rated behavior in group therapy 
(Ullmann, Krasner, & Collins, 1961) and 
recognition of emotional stimuli (Ullmann, 
Weiss, & Krasner, 1963) have been associated 
with specified changes in verbal behavior. 

The earlier finding (Krasner et al., 1964) 
of the significant relationship between high 
social desirability and high favorable operant 
Score was repeated. In addition, in this study, 
high extraversion score was also related to 
high operant level for “favorable” response. 
Thus, both of these relationships would point 
to “extraverts” and those high in social de- 
sirability as being more likely to report favor- 
able attitudes toward the role of the “medical- 
scientist” examiner. 

If the attitude of the patient toward the 
therapist is an important variable in the 
psychotherapeutic relationship, it would then 
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seem reasonable that increasing the favoral 
ity of this attitude will facilitate responsi 
ness to the treatment procedures. The presen 
study is one of a series which is laying th 
foundation for this procedure by demon- 
strating experimentally that attitudes toward. 
the examiner may be manipulated, and sudh 
manipulations have an effect on the subje 
performance of a task requiring effort. 
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THE EFFECTS OF INTERACTION GOALS AND PERSONALISTIC 
FEEDBACK ON THE PRESENTATION OF SELF* 


KENNETH J. GERGEN 
Harvard University 


During a 30-min., interview Ss were (a) either exposed to a type of social 
feedback called reflective reinforcement or not exposed to such feedback, (b) 
presented this feedback in either a personal or impersonal manner, and (c) 
instructed either to be accurate about themselves or to make a good impression. 
As predicted, the positive character of the Ss’ self-description increased to a 
greater extent under feedback than under conditions of no feedback. However, 
the variation in personalism had minimal consequences. Ss instructed to make 
a good impression demonstrated a greater increase in positiveness than did 
accurate Ss. The generalization of the responses produced by these manipula- 
tions and the role of awareness were also explored. 


The present study was designed to investi- 
gate a few of the major conditions affecting 
the way in which one person presents or de- 
scribes himself to another. It seems clear that 
persons do present different self-attributes to 
meet the functional demands of various 
social settings (cf. Goffman, 1959). On the 
other hand it is also clear that persons are 
concerned with self-consistency, and are 
repeatedly faced with the dilemma: how to 
Temain “true to self” and yet respond adapt- 
ively to varying social demands. This conflict 
between consistency and social adaptation 
can be considered a major theme of the 
present investigation. Thus the study is con- 
cerned with the effects of differing social con- 
ditions on changes in self-presentation, and 
with the consequences of having changed for 
subsequent self-description. 

_ Social reinforcement has long been con- 
sidered the most important determinant of 
tHe self-concept. Such reinforcement, or feed- 
back, seems to be continuously influential in 
Moulding the self-concept in at least two 
Ways: (a) From the standpoint of operant 
Conditioning a person can be reinforced by 
others for providing certain behavior at cer- 
tain times. Since such responses should then 
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occur with increasing frequency, the person 
should come to feel that such responses are 
truly representative of the self. (6) From 
the viewpoint of classical learning a person 
can come to associate his overt behavior with 
descriptions communicated to him by others, 
It is one version of this process which has 
been stressed by the “symbolic interactionists” 
(for example, G. H. Mead and C. H. Cooley). 
According to this view, a person’s self-defini- 
tion is primarily a reflection of the views of 
significant others about him. 

Based on the assumption that persons tend 
to describe themselves in accord with self- 
conceptions, the present experiment attempted 
to exploit both of the above components of 
social feedback to produce a shift in the self- 
image, and thus a shift in self-presentation. 
Subjects described themselves to an attractive 
interviewer who agreed with their positive 
statements about self and disagreed with 
negative self-statements. In this manner the 
interviewer reflected her opinion of the sub- 
ject to the subject. However, in that her 
opinion was also highly gratifying to the sub- 
ject, it served as a reinforcing agent for 
certain of the subject’s responses. This type 
of dual feedback can be termed reflective 
reinforcement. 

It is thus proposed that social feedback in 
the form of reflective reinforcement alters 
the way in which a person views himself in a 
given situation. These alterations will be re- 
flected in modified presentation of self. The 
second major intent of the present study was 
to show that the same feedback can have 
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quite different consequences depending on 
whether it is perceived to be personal or 
impersonal and the goal of the subject in the 
situation, 

One aspect of interpersonal relations which 
intuitively seems important is the extent to 
which they are personalistic in character. 
Some relationships we feel to be conducive 
to the expression of true feelings; others 
seem to breed insincere expressions of feel- 
ing. Although there is certainly a continuum 

` along which such relationships vary, the 
former type of relationship can be termed 
“personal” and the latter “impersonal.” 
(Compare Jones & Thibaut’s, 1958, notion of 
“reciprocal contingency” and Jourard’s, 1964, 
concept of “authentic interaction.”) In terms 
of the above discussion, it would seem that 
impersonal feedback is both nonreflective and 
nonreinforcing. Impersonal feedback cannot 
be trusted to be truly representative of an- 
other's feelings, and thus cannot be considered 
valid as a reflection of another’s perceptions 
of us.' Second, because impersonal feedback 
is neither informative nor a meaningful ap- 
proval cue, it should have less reward value 
than personal feedback. It was thus hy- 
pothesized that impersonal feedback should be 
less effective in influencing the way a person 
thinks of himself in a given situation, and, in 
turn, should have less effect on self-presenta- 
tion. 

A further source of variation in an individ- 
ual’s response to social feedback is the actor’s 
primary goal in the situation. Perhaps the 
most relevant dimension along which such 
goals can vary is that inherent in the previous 
discussion of the self-presentation dilemma, 
At one extreme of the dimension the salient 
goal of the actor is to insure consistency be- 
tween his presented attributes and some core 
concept of self. At the other extreme, the 
goal of social adaptation would be salient 
and self-consistency ignored. In the present 
study the attempt was made to tap into this 
continuum at fairly extreme points in order 
to sample the effects of different interaction 
goals on self-presentation and subsequent self- 
description. To this end, one set of subjects 
was asked to be honest and accurate, a 
second set was instructed to try to create the 
most positive impression possible on the inter- 
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viewer. This latter goal can be termed 
ingratiation, 

There is an empirical precedent for using 
the particular interaction goals of accuracy 
versus ingratiation. In a study by Jones, 
Gergen, and Davis (1962) subjects attempt- 
ing to make an interviewer respond favorably 
to them described themselves more positively 
than a group instructed to “be themselves.” 
Although such a finding seems rather obvious, 
there are also conditions under which the 
opposite is true. In a second study (Jones, 
Gergen, & Jones, 1963) low-status subjects 
showed the same tendency as subjects in the 
above study, but high-status subjects actually 
became more modest when attempting to 
ingratiate themselves with lows. 

In view of these findings, vis-à-vis the fact 
that subjects in the present experiment were 
also in a low-status position relative to the 
interviewer, it was anticipated that subjects 
attempting to ingratiate should rate them- 
selves from the onset of the interview more 
Positively than those attempting to be ac- 
curate. In addition, they should be more 
sensitive to the approval cues given by the 
interviewer. They should be less willing to 
accept feedback as more or less descriptive 
of them as a function of the interviewers 
personalism. Rather, feedback should function 
as a notification or signal of the optimal tactic 
of self-presentation. It was thus predicted 
that the rate of increase in the positiveness 
of self-presentation should be greater when 
subjects are attempting to ingratiate than 
when attempting to be accurate. In the ingtā- 
tiation conditions self-presentation should not 
vary as a function of the personalism of the 
interviewer. 


Generalization of Self-Presentation 


Tn this study generalization was measured 
after the interview by having subjects fill out 
a number of items similar to those used in the 
interview. Based on the assumption thal 
reflective reinforcement is effective in altering 
a person’s feelings about self, it was predicte 
that subjects who are attempting to be at 
curate and who receive personal feedba 
should generalize from the reinforcemet! 
experience to a greater extent than thos 
receiving impersonal feedback. However, ™ 


that 
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the ingratiation conditions the assumptions 
are quite different in character. Since the 
instructions specifically invite the subject to 
ignore his private assessment of self during 
the interview, there should be little tendency 
on the part of the subject to make his sub- 
sequent self-descriptions consistent with his 
behavior in the interview. It was thus antici- 
pated that the generalization effects in the 
ingratiation conditions should show a more 
complete return to the preexperimental level 
of positiveness than in the accuracy con- 
ditions. 


Awareness of Change 


It is also of some importance to know the 
degree to which one is aware of shifts in self- 
presentation. Recent research on learning 
without awareness has cast considerable doubt 
on the earlier findings that learning could be 
achieved without the subject’s being capable 
of verbalizing the fact. According to several 
recent sources (cf. Eriksen, 1962) successful 
learning on the tasks used to date seems to 
require some type of active hypothesis forma- 
tion. However, it does seem that if learning 
without awareness is to be found, it might be 
advantageous to use methods which do not 
Promote a problem-solving set. Also, it would 
seem that one might have a better chance of 
obtaining learning without awareness if the 
Materials used tapped emotional rather than 
Strictly rational resources. The former should 
be less easily verbalized and, at the same 
time, more naturally responsive to situational 
changes. In the present study it was felt that 
both the informal nature of the experimental 
Situation and the instructions to be honest 
Would operate against the tendency to adopt a 
Problem-solving set. Also, as the materials 


Used in the experiment were primarily evalua- 


tive in nature, rationality should not play 
Such a significant role. Subjects in the ingra- 
tiation conditions also provide a convenient 
Comparison group, for in these conditions the 
Instructions should shift the subject’s focus 
to one more closely akin to problem solving. 


METHOD 


ee basic design was a 2 X 2 factorial with four 
its reflecting combinations of interviewer personal- 
of and interaction goal of the subject. It was also 

importance to assess the effects of the stimulus 
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conditions in the absence of reinforcement. Thus, 
additional subjects were interviewed under accuracy- 
personal or under accuracy-impersonal instructions 
with no systematic reinforcement provided. The 
experiment thus involved two overlapping designs, 
each containing four cells, with the two accuracy 
cells in common. Nine subjects appeared in each 
of the six cells. 


Self Valuation Triads Test 


The Self Valuation Triads test (SVT test), derived 
from a measure used by Dickoff (1961), consisted 
of 72 groups of three self-descriptive statements. The . 
statements covered a wide range of attributes and 
within each triad one statement reflected a positive 
trait, one a negative trait, and the third a trait 
more neutral in tone.2 The positiveness of each 
statement was establisked through a group of in- 
dependent raters. For each triad the subject was 
to distribute 10 points among the statements in 
such a way that the most descriptive statement of 
the three would receive the greater number of 
points, etc. A subject’s SVT score was calculated 
by summing the points given to positive statements, 


Subjects 


The subjects were freshman and sophomore fe- 
males enrolled in an introductory psychology course. 
Early in the course a group testing session (here- 
after called the “neutral” testing period) was held 
in which all students filled out the following meas- 
ures: the SVT test, a questionnaire concerning these 
evaluations, and the deCharms and Rosenbaum 
(1960) self-esteem measure, Of this initial group, 
persons scoring in the upper and lower fifths on 
the SVT measure were excluded from further con- 
sideration. Of the remaining group, 56 were sub- 
sequently selected and assigned to experimental con- 
ditionseon essentially a random basis. Attention was 
given to the equality of initial SVT scores from 
cell to cell. Two subjects had to be excluded from 
the final sample because of stated suspicion of the 


manipulation. 


Experimental Procedure 


Inasmuch as experimental conditions differed 
from each other in more than one respect, the 
procedure for the cell combining accuracy and per- 
sonal instructions will be outlined first, 

Accuracy-personal condition. The experiment was 
introduced to each subject as part of an interviewing 
project concerned with obtaining personality in- 
formation from large numbers of people, Students 
were being used to aid in training the interviewers 
prior to the actual carrying out cf the survey. The 
interviewers were said to be carefully chosen from 
the junior and senior classes. Their central task was 


2A sample triad, composed of a positive, negative, 
and neutral phrase, is as follows: 
very accepting and approachable 
resentful of others 
aloof when busy 
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to get to know the persons interviewed as well as 
possible during the time available. In doing this, 
however, the interviewer was to concentrate on being 
as natural and spontaneous as possible; she was not 
to adopt a stereotyped “front” but was to consider 
each person as an individual. 

The remainder of the procedure was then briefly 
outlined for the subject. First, the interviewer would 
be introduced and in her presence the experimenter 
would ask a number of questions regarding the 

* subject’s background. Meanwhile the interviewer 
would take notes. Since part of her training in- 
cluded practice in making rapid judgments, she 
would then leave for a short period to make some 
preliminary ratings of the subject. Upon returning 
she would begin the main portion of the interview. 
The subject was encouraged to be honest and natural 
throughout the procedure. 

The interviewer was then brought in and in- 
troduced, She was an attractive, well-dressed, 
junior-class female, instructed to behave in a stand- 
ard, pleasant manner at all times. After the question- 
ing the interviewer left the room. Upon her return 
5 minutes later the experimenter departed and the 
interviewer began by asking the subject a number 
of standard, informal questions about her interests, 
etc. The subject was then told that the remainder 
of the interview would center around the items on 
the SVT test, 

Accuracy-impersonal condition. Upon the subject’s 
arrival she was taken to an observation booth from 
which she could see through a one-way mirror into 
the same room as used above. The experimenter’s 
explanation of the situation was the same as above 
until the interviewer's task was described. Here the 
subject was told that the interviewer had been 
extensively trained in interviewing techniques. She 
had been taught what to say, how to smile, how to 
make the interviewee feel comfortable, etc. Her task 
was only to practice the techniques she had Tearned. 
In order that this might be done without distrac- 
tion, the subject was told that the interviewer would 
be looking into a one-way mirror and would not be 
able to see her. 

Although the interviewer in the Personal con- 
dition was exposed to information about the sub- 
ject and was given time to think about this in- 
formation, in the impersonal condition she received 
no such information. In order to keep all other 
aspects of the situation constant, however, the 
experimenter asked the subject the same preliminary 
questions regarding her background, During this 
questioning the interviewer moved into a position 
where she could be seen through the mirror by the 
subject. It was also clear to the subject that the 
interviewer could not hear her response. When the 
questioning was over the microphones were switched 
on. The subject was introduced only by her first 
name, and the interview proceeded as in the above 
condition. 

Ingratiation conditions. A third of the subjects in 
both of the above conditions were instructed to make 
a positive impression on the interviewer. In all cases 
these instructions followed the experimenter’s pre- 
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liminary questioning of the subject. In the personal 
condition they were given while the interviewer was 
out of the room “rating” the subject. In the im- 
personal condition they were given just before the 
microphone was turned on. In each case the sub- 
ject was told that it was very important for the 
planners of the project to know how a person would 
rate herself if she were only attempting to make a 
good impression. The subject was told that in carry- 
ing out her task she could say whatever she pleased 
about herself. 

Nonreinforced conditions. The instructions for the 
nonreinforced accuracy-personal and the nonrein- 
forced accuracy-impersonal conditions were exactly 
the same as outlined above for their reinforced 
counterparts. The only difference in the procedure 
was that during the presentation of the triads, to 
be described below, the interviewer gave no rein- 
forcement and confined her conversation to the 
mechanics of the proceedings. 


Procedure for Providing Reflective Rein- 
forcement i 


Sixty triads of the 72 on the original test had 
been assembled so that they could be handed one 
by one to the subject. (In the impersonal conditions, 
two sets of triads were assembled, one for the sub- 
ject and another for the interviewer.) The triads had 
been arranged so that, for scoring purposes, they 
formed six trial blocks with 12 triads in each block. 
All but the sixth block (generalization triads) were 
administered during the interview. The triads had 
been rearranged, however, so that the subject's 
previous score for any given block roughly equaled 
her score on each of the other blocks. In this manner 
the base-line score of each subject remained similar 
from one trial block to the next. Care was also taken 
that each trial block contained the same number 0 
triads of each order of positive, neutral, and negative 
statements. In addition, it was insured that no trial 
appeared in a given trial block significantly more 
often in one experimental condition than in another. 

During the experimental procedure the triads wel? 
handed one by one to the subject who would then 
record in pencil by each statement the number i 
points she decided to assign. She would then han 
the triad back to the interviewer. (In the impersoni 
conditions, the subject would relay the point PA 
ments to the interviewer over the microphone.) T 
interviewer would then reinforce her according 
the following schedule: 


1. During the first 24 triads, each time the points 
assigned to the positive statement equaled or aM 
ceeded the subject’s previously established averaél 
for positive statements, the interviewer would pI 
vide positive reinforcement. (The interviewer bie 
preinformed of the subject’s previous average 
Positive statements in each case.) ae 

2. During the first 24 triads, each time the Pe 
assigned to the positive statement were less i 
the subject’s previous average for positive statemen™ 
the interviewer would reinforce negatively. 


| 
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3, During the last 36 triads, positive and negative 
reinforcements were provided only when the subject 
exceeded or went below her previous average by 

1 2 or more points. Positive statements receiving 
points which could neither be positively nor nega- 
tively reinforced according to this criterion were 

responded to with silence. 


As the reinforcement was also gauged to convey 
the interviewer’s impression of the subject to the 
subject, it took the form of agreement or disagree- 
ment with the subject’s self-evaluations. However, 
in order that the interviewer appear natural, she 
was allowed to use five types of positive and nega- 
tive reinforcement. The interviewer expressed positive 
reinforcement in the following ways: 


a, Yes, I would agree with... 

b. Yes, I think you are... 

c. Yes, you look like you would be. . . 
d. Good 

e. Very good 


All of these were accompanied by either a smile or a 
nod of the head. The negative reinforcements were 
as follows: 


a. You don’t strike me as being the type who . . - 
b. I sort of disagree with... 

c. I would say you were much more... 

d. I think you underestimate yourself . . - 

e. Oh, really ? 


Disagreement was accompanied by either a frown or 
& puzzled expression. In order to check on the 
‘comparability of reinforcement type across condi- 
tions, approximately half of the interviews in each 
condition were recorded from a hidden location. 

In addition to the data obtained during the rein- 
forcement procedure, several other measures were 
administered by the experimenter after the interview 
Was over. Included were the final 12 triads (generali- 
zation triads), the deCharms-Rosenbaum self-esteem 
test, a series of ratings to be made of the inter- 
Viewer, and a questionnaire designed to tap aware- 
Ness and the subjects’ reactions to other aspects 
Of the situation, 


RESULTS 
Validation of Experimental Manipulations 


Before assessing the major results it is 
first necessary to establish whether or not the 
experimental manipulations were effective. 

Variation in perceived personalism. Per- 
ceived personalism was measured after the 
Interview by having subjects rate on a 5-point 
Scale both how “natural and spontaneous” 
the interviewer seemed to be and to what 
extent her behavior seemed “bound by a set 
of rules.” A 10-point scale of “perceived per- 
Sonalism” was derived by combining subjects’ 
ratings on both items. Under conditions of 
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reinforcement, the interviewer in the personal 
conditions was seen as significantly (p < .01) 
more personal than in the impersonal condi- 
tions. Although the difference between means 
in the nonreinforced conditions was in the 
same direction, this difference did not reach 
significance. However, this failure is not 
crucial in understanding the results. 
Variation in interaction goal. As will be. 
recalled, a third of the subjects were given 
instructions to make the most positive im- 
pression possible on the interviewer and the 
remainder were encouraged to be natural and 
honest during the interview. On the final 
questionnaire all subjects were asked how 
they came to assign the number of points 
to the triads during the interview. Without a 
single exception, all subjects in the accuracy 
conditions indicated that they gave the inter- 
viewer an honest picture of themselves; sub- 
jects in the ingratiation conditions indicated 
that they had described themselves in such 
a way as to make a good impression. 
Variation in reflective reinforcement. In 
assessing the validity of this variation, it was 
of importance to know whether subjects in 
the reinforced conditions perceived the inter- 
viewer as feeling more positively about them 
than subjects in the nonreinforced conditions. 
The answer to this question could be found 
in the subjects’ predictions of the way the 
interviewer would rate them. After the inter- 
view,’each subject filled out a series of 
antonyms on which she was to predict the 
way the interviewer would rate her after the 
interview. Most relevant was the personal 
attractiveness cluster, formed by such anto- 
nyms as “likable” versus “not particularly 
likable,” and “attractive personality” versus 
«unattractive personality,” etc. An analysis of 
variance of the combined antonym ratings 
indicated that the reinforced-accuracy groups, 
as compared with the nonreinforced groups, 
felt that the interviewer would rate them as 
more personally attractive (p < .05). 
Stability of reinforcement type. As there 
were several ways in which positive and nega- 
tive reinforcements could be expressed, the 
question arises as to whether some reinforcing 
phrases might have been used more often in 
one condition than in another. However, an 
analysis of these phrases revealed only that 
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for all conditions. 


in every condition more of the shorter re- 
inforcing phrases, for example, “good” or 
“very good,” were used than the longer ones. 
No other systematic differences among types 
of reinforcement used across the various 
conditions were discovered. 

In summary, with the exception of the 
failure to find significant differences in per- 
ceived personalism in the nonreinforced 
conditions, there is good evidence that each 
of the manipulations was perceived or ,carried 
out as intended. 


TABLE 1 


ANALYSIS or VARIANCE oF POSITIVE CHANGE Scores 
IN THE REINFORCED- AND NoNREINFORCED- 
Accuracy CONDITIONS 


Source df MS F 
Between subjects (35) 
Personal-impersonal (B) 1 4.05} <1.00 
Reinforced-nonrein- 
forced (C) 1 | 2,936.27] 14.46** 
BXC 1 22.05 | <1.00 
Error (b) 31a | 202.89 
Within subjects (144) 
Trial blocks (A) 4 90.02| 2.25 
AXB 4 32.83 | <1.00 
AXC 4 81.23| 2.03 
AXBXC 4 5.36 | <1.00 
Error (w) 124a 39.92 


* Reduced because of data substituted for one subject, 
* p <.001, 
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Changes in Self-Valuation 


Effects of reflective reinforcement. It was | 
predicted that reflective reinforcement would 
produce significant increases in the positive- 
ness of self-ratings under conditions in which 
the subject was told to be accurate. In order 
to test this prediction, neutral and experi- 
mental SVT scores across the five trial blocks 
were converted to positive-change scores, a 
higher score reflecting a greater change | 
toward being more positive about oneself. The l 
mean positive change across the five trial 
blocks for the various conditions appears 
graphically in Figure 1. In order to test this | 
first prediction a Type III (Lindquist, 1956) 


TABLE 2 


ANALYsIs OF VARIANCE OF Positive CHANGE SCORES 
IN THE REINFORCED CONDITIONS 


Source df MS F 


Between subjects 


(35) 

Personal-impersonal (B) 1 67] < i | 
Accuracy-ingratiation (C) 1 |4,234.05} 19.3 
BXC 1 | 56.67} <1,00 | 
Error (b) 318} 218,66 

Vithin subjects (144) a 
Trial blocks (A) 4 | 260.48] 6.08 
AXB 4 | 30,95] <1,00 
AXC 4| 41,28) <1.0 
AXBXC 4| 11.92| <1.0 
Error (w) 124+| 42.85 


* Reduced because of data substituted for one subject. 
* p <.025. 
> <.001. 


analysis of variance was carried out in which | 
the change scores of the reinforced- and non- 
reinforced-accuracy groups were comparer: 
The results of this analysis are summarized:it 
Table 1. As evidenced by the significant mall 
effect due to the variation in reinforcement, 
it is clear that reflective reinforcement did, 
indeed, produce a marked increase in the 
positiveness of self-preservation. Orthogon 
comparisons further indicate that the positive 
change in the reinforced accuracy-persom® | 
condition was significantly (p < .01) greate! 
than that found in its nonreinforced cout 
terpart; a comparison of the reinford f 
accuracy-impersonal group and its nont 
forced control also yielded a significan 
(p < .025) difference. 

Effects of variations in personalism «i A 
interaction goals. Given that reflective te” 
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forcement is effective in altering the posi- 
tiveness of self-valuation, a second set of 


i. predictions concerned the differential effec- 


tiveness of reflective reinforcement as a 
function of the personalism of the stimulus 
person and the subject’s interaction goal. The 
results of an analysis of variance in which 
these variations were compared, appear in 
Table 2. As evidenced, the variation of 
personalism did not contribute in any 
significant way. 

However, before concluding that variations 
in personalism had nothing to do with positive 
change, several other aspects of the data de- 
serve attention. First of all, Figure 1 suggests 
that there may have been differences in 
the cumulative effect of reinforcement in 
the accuracy-personal versus the accuracy- 
impersonal condition. In the former, self- 
valuation underwent a monotonic increase 
throughout the five trial blocks; in the latter 
the power of the reinforcement seemed to be 
lost by the end of the third trial block. 
Although the AX B XC interaction term, 
which would indicate overall slope differences, 
Was not significant, an orthogonal polynomial 
test of the components of slope (Ray, 1960) 
yielded a significant (p < .005) linear trend 
in the personal condition. The same test when 
applied to the reinforced accuracy-impersonal 
Condition was not significant. When reflective 
reinforcement is personal in nature, changes 
in self-rating thus seem to be continuous; 
when impersonal, reflective reinforcement 
seems only to have short-lived effects. 

Second, it was also apparent that some of 
the subjects who interacted with the personal 
stimulus person were not particularly con- 
vinced of her spontaneity; on the other hand, 
pome; subjects in the accuracy-impersonal 
condition felt the stimulus person was quite 
sincere, This fact prompted an investigation 
of the relationship between perceived per- 
Sonalism and amount of positive increase 
Within each of these conditions. Correlations 
aed computed between perceived personal- 
i Scores and the positive-change scores. In 

e reinforced accuracy-personal condition 
ae correlation was high and just 
th of reaching significance (r = .45); in 

e reinforced accuracy-impersonal condition 

€ correlation was also high and significant 
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(r = 58, p < 05). Within the nonreinforced 
accuracy conditions, however, these correla- 
tions did not begin to approach significance. 
When receiving reflective reinforcement, then, 
the more personal the agent providing such 
reinforcement is seen, the greater the tendency 
to react to this reinforcement. 

The effects on self-valuation of the varia- 
tion in interaction goals can now be exam- 
ined. It was predicted above that the 
ingratiation condition subjects would demon- 
strate positive increases which would exceed 
those produced by subjects under accuracy 
instructions. Further, it was predicted that 
this differential increase would manifest itself 
from the outset of the interview and would 
increase throughout the duration of the 
interview. The first of these predictions was 
fully confirmed; the positive change mani- 
fested in the ingratiation conditions was sig- 
nificantly (p < .001) greater than that found 
in the reinforced-accuracy conditions. An 
analysis of the first trial block also indicated 
that the ingratiation subjects manifested a 
greater initial increase in the positiveness of 
self-valuation than subjects in the accuracy 
conditions (p < .05). The lack of a signifi- 
cant AX B XC interaction, however, indi- 
cates that there was no tendency for the 
ingratiation groups to increase the positive- 
ness of self-ratings at a faster rate than 
subjects in the accuracy groups. These results, 
however, seem to give full support to the 
importance of interaction goals as a factor in 
understanding self-presentation. 

Generalization. As Figure 1 has shown, the 
degree of generalization from the conditioning 
procedure to the subsequent testing period was 
substantial. A comparison of subjects’ scores 
on the generalization triads with scores for 
the same triads filled out in the neutral 
situation indicated all of the reinforced groups 
with the exception of the ingratiation- 
impersonal group showed a significant 
(p < 05) increase in the positiveness of 
self-ratings. Neither of the nonreinforced 
groups, combined or considered separately, 
showed a significant increase in positiveness 
over their scores obtained in the neutral test- 
ing. Before drawing any conclusions from 
these results, however, several additional 
factors should be considered. 
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One of the hypotheses developed above 
was that generalization resulting from per- 
sonal reinforcement would be greater than 
that produced by impersonal reinforcement 
(accuracy-instruction groups only). Although 
there was no significant difference between 
the two groups in the degree of decline in 
the positiveness of self-ratings from the fifth 
trial block to the generalization triads, a com- 
parison of “generalization gain” scores (i.e., 
generalization triad scores minus scores on 
the same triads under neutral conditions) 
lends support to this hypothesis. Whereas the 
reinforced accuracy-personal group differed 
significantly (p < .05) from its nonreinforced 
control in terms of amount of gain in self- 
valuation produced by the manipulations, the 
impersonal group did not differ from its non- 
reinforced control (F < 1.60). The difference 
between the differences, however, was not 
significant. 

A second hypothesis concerned the amount 
of generalization in the reinforced accuracy 
groups as compared with the ingratiation 
groups. Here it was felt that calculated in- 
creases in positiveness would wash out more 
rapidly and completely than the more modest 
increases occurring in the reinforced-accuracy 
conditions. A glance at Figure 1 reveals that, 
indeed, Joss in positiveness from the fifth 
learning block to the generalization triads was 
much greater for the ingratiation groups than 
the accuracy groups. This difference proved 
to be significant at beyond the .001 level. 
However, of equal interest is a comparison 
of the generalization gain produced in these 
two conditions. A comparison of generaliza- 
tion gain scores indicates that there was no 
difference in the final level of positiveness 
reached in the accuracy or the ingratiation 
groups. It would then seem that calculated 
changes in self-ratings produce their own de- 
mands on personal consistency. This latter 
finding will receive further attention below. 

It could be argued that the subject was 
merely learning from the interviewer a new 
norm for the mechanics of filling out the 
items, and thus the generalization effects were 
only reflecting this new norm. In order to 
assess the validity of this argument a com- 
parison was made of subjects’ pre- and post- 
scores on the deCharms and Rosenbaum self- 
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esteem test. This test had been found to 

correlate .58 with the self-valuation measure, 

but was quite different in format. This com- i 
parison revealed an increase in self-esteem 

in all of the reinforced groups and the 

combined pre- versus postdifference was sig- 

nificant at beyond the .01 level. The smallest | 
increases were also found in the impersonal 

conditions. An analysis of the nonreinforced 

groups combined yielded no significant differ- 

ence. These findings would seem to increase | 
one’s confidence that, indeed, underlying 

changes had been produced by the experi- 

mental manipulations. 

Awareness and changes in self-valuation, ' 
Awareness of change in self-ratings was 
tapped in several ways. First of all, subjects | 
were asked whether they felt they generally 
filled out the items during the interview more | 
positively, more negatively, or approximately | 
the same as they had in the earlier testing 
situation. Second, subjects were asked if the 
interviewer said anything during the adminis- 
tration of the triads, and, if so, what kind of 
things she said. Finally, they were asked 
whether they felt that anything the inter- 
viewer said might have influenced the num- 
ber of points assigned to phrases. One of 
these items failed to discriminate among m 
dividuals: all subjects who had been get 
forced were fully aware that the interviewer 
had been positive in her reactions toward 
them. i 
Awareness was thus defined in the following \ 
ways: for the accuracy conditions (réil- 
forced groups only) all those who felt they 
were more positive during the interview that 
earlier and who also felt the interviewet 
influenced their self-ratings were deemé 
“aware”; those who said that the interview 
er’s attitude could not have affected thet 
ratings were designated “unaware.” These 
two measures were perfectly correlated. In 
the ingratiation conditions all subjects sê! 
they were more positive in the experimen! 
conditions than in the neutral. Thus awalé 
ness in this instance was based on whethet 
the subject felt she had responded to the 
feedback by creating a more positive imag? 
or not. Those who felt they “stressed W 
she seemed to approve” or “emphasized what A 
seemed to impress her,” for example, we" 
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Fic, 2, Mean positive change for aware and unaware 
subjects in the reinforced conditions. 


designated “aware.” Those who said that the 
interviewer’s remarks had no effect on them 
were deemed “unaware.” Subjects from the 
Personal and impersonal conditions appeared 
in roughly equal numbers in the four aware- 
Ness groups. 

The positive-change means across the five 
trial blocks for the four groups are graphi- 
cally depicted in Figure 2. The findings are 
of considerable interest. First, in the only 
group which one could use to test whether 
learning could take place without awareness, 
ìe., the unaware-accuracy group, there ap- 
Pears to have been a substantial increase in 
positiveness. An analysis of variance supports 
this observation, indicating that the difference 
between neutral and experimental SVT scores 
Was significant beyond the .001 level. The 
linear trend was also significant beyond the 
005 level. The amount of increase in posi- 
tiveness demonstrated in the aware-accuracy 
group as compared with the unaware-accuracy 
group was not significant (F < 1.00). Under 
Teinforced-accuracy conditions, then, unaware 
Subjects demonstrated changes in self-ratings 
which, if anything, tended to be greater than 
those found in the aware subjects. 

Second, although unaware subjects in the 
accuracy conditions tended to increase the 
Positiveness of self-ratings to a greater extent 
than the aware group, in the ingratiation 


421 


conditions this tendency was reversed. Al- 
though the total increase in positiveness in 
the aware-ingratiation condition was not sig- 
nificantly greater than that manifested in the 
unaware-ingratiation condition, the picture as 
a whole is one of an interesting interaction: 
when subjects were told to be accurate, aware- 
ness of the fact that they might have been 
influenced by what the interviewer said 
seemed to operate against strong shifts in 
self-ratings. However, when told to make a 
positive impression, awareness of cue utiliza- 
tion operated in a fashion typical of tradi- 
tional learning without awareness studies: 
greater awareness Jed to greater increments 
in learning. Although this interaction was not 
quite significant by the fourth trial block, 
in the fifth trial block it had reached 
significance (p < .05). 

One further finding of interest concerns the 
relationship between awareness and generali- 
zation. As can be seen in Figure 2, under 
accuracy conditions there seemed to be little 
relationship between the two. However, in 
the ingratiation conditions those who were 
aware of having used the reinforcement to 
their best advantage maintained a very high 
level of positiveness after the interview was 
over; those who said that the interviewer had 
no effect on their self-ratings dropped back 
quite substantially. A comparison of the 
neutral to postexperimental gain in positive- 
ness for the ingratiation-aware group with 
that found in the unaware group was signifi- 
cant at the .001 level. This finding will receive 
further attention in the following section. 


Discussion 


The present study fully supported the con- 
tention that social feedback in the form of 
reflective reinforcement is effective in in- 
creasing the positiveness of self-evaluations. 
The results indicated, however, that the de- 
gree of personalism of the agent providing 
feedback was only minimally related to the 
amount of increase. The variation in inter- 
action goals produced results by and large in 
line with predictions. In addition, it was 
found that the alterations in self-presentation 
generalized differentially to a subsequent 
testing period as a function of the conditions 
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producing such changes. Evidence also indi- 
cated that self-presentation could be modified 
without the subject’s being aware of such 
modifications, and that awareness was dif- 
ferentially related to increases in the posi- 
tiveness of self depending on the goal of the 
subject in the situation. 

Modifying the self-image. The prediction 
of an increase in the positiveness of self- 
valuation in the reinforced accuracy condi- 
tions was based on the assumption that 
reflective reinforcement would produce mo- 
mentary shifts in the subject’s feelings about 
self, and that this enhanced self-image would 
reflect itself in the subject’s public presenta- 
tion. All the evidence from this experiment 
was consistent with this supposition. The rele- 
vance of this finding to the hypothesis that 
the self-concept is a result of social learning 
should be noted. Wylie (1961) has com- 
mented that in spite of the importance of 
this hypothesis, her review of the literature 
uncovered only two questionable studies 
which made an explicit connection between 
the psychology of learning and the develop- 
ment of the self-concept. Although the sub- 
jects in this experiment were not necessarily 
affected in any profound way, the importance 
of the above findings would not seem to rest 
on the longevity of the produced effects. 
Given that the principle is valid, it would 
follow that with more intense learning experi- 
ences more deep-seated effects could be 
produced. 

Tt was further reasoned, however, that in 
order to be maximally effective positive feed- 
back should be provided in a personal setting. 
However, the present study would appear to 
lend only minimal support to this specula- 
tion. In view of the gross differences in pro- 
cedure and setting between the personal and 
impersonal conditions, the question arises 
as to whether subjects in the reinforced 
accuracy-impersonal condition were manifest- 
ing behavior in the interview which resulted 
from a change in feelings toward self. From 
all indications the answer would seem to be 
both “yes” and “no.” For one thing, it is 
known from the ratings of perceived person- 
alism that there were subjects in this condi- 
tion who failed to see the behavior of the 
interviewer as impersonal, and these were 
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the subjects who demonstrated the greater 
increases during the interview. 

On the other hand, it has been pointed 
out that positive feedback in the experiment 
could also be thought of as serving as a noti- 
fication to the subject of what the interviewer 
wanted or expected her to do in the situa- 
tion. In spite of the admonishment to be 
accurate during the interview subjects could 
well have felt that it was simply easier to 
give in to the pressure of social expectancy, 
There is at least some evidence that pressures 
of social expectancy did have a greater ef- 
fect in the reinforced accuracy-impersonal 
condition than in the personal condition. First 
of all, subjects in the impersonal condition, 
in contrast to those in the personal condition, 
did not differ from the nonreinforced counter- 
parts in terms of positive increases produced 
during the generalization period. Second, all 
subjects were twice asked to rate their degree 
of sincerity in making their self-ratings, once 
after the neutral testing session and again 
after the experimental interview. For both 
nonreinforced control conditions and the re- 
inforced accuracy-personal condition there 
was a significant (p < .05) increase in the 
ratings of sincerity, i.e., subjects in all these 
conditions felt that they were more honest 
in the experimental procedure than during the 
neutral testing periods. However, for the re- 
inforced accuracy-impersonal subjects this 
increase was not found. One is led to spect 
late that the experimental interview generally 
caused subjects in the accuracy conditions ‘to 
feel they were being quite honest except in 
the one condition, the reinforced impersonal, 
in which subjects felt they were reacting more 
to social pressures. fi 

Ingratiation and generalization. As indi- 
cated above, the effects on self-presentation 
of the variation in interaction goals were 
much as anticipated. Subjects in the ingrati- 
ation conditions began rating themselves very 
positively from the initial stages of the intet- 
view and continued to do so for the remaindet 
of the interview. However, there was no cy 
dence that subjects in these conditions ii- 
creased the positiveness of their self-rating® 
at a more rapid rate than those in the 
reinforced-accuracy conditions. In view of the 
extremely positive self-ratings with which the 


tiation subjects began the interview this 
ult is not surprising. An examination of the 
distribution of SVT scores for the en- 
opulation of undergraduates tested re- 
led that the level of positiveness reached 
the ingratiation subjects in the first trial 
k would have been exceeded by only the 
8% of this population under normal 
mstances. It seems clear that the bounda- 
‘of credibility were reached much too soon 
‘the ingratiation subjects for the antici- 
d cumulative increase to manifest itself 


jomewhat more intriguing, however, were 
results obtained from the ingratiation sub- 
s on the measure of generalization. In 
of positive increase from the neutral 
sting to the generalization period, subjects 
in the ingratiation conditions were quite 
‘Similar to those in the reinforced-accuracy 
ditions. It seemed, in other words, that 
me change in covert feelings of self-regard 
ad been produced by the intentional modi- 
tion of overt self-ratings, Additional in- 
ation concerning this phenomenon was 
id in the data on awareness, The ingrati- 
n subjects who were aware of having used 
reinforcement provided by the interviewer 
a8 cues for modifying their responses, ac- 
counted for almost all of the increase in 
5 tiveness of self-ratings. 
Th attempting to account for these results 
‘Should be noted that somewhat similar 
ffects have been discussed in the literature 
ü tole playing and attitude change. For ex- 
le, Kelman (1953) and Janis and King 
1954) have demonstrated that significant at- 
de change can be induced by having sub- 
ects participate in support of a position 
ards which they are initially opposed. It 
so of importance that these studies have 
“found that the more initiative taken, or the 
ater the improvisation shown in this par- 
Cipation, the greater the resultant change in 
tude. These findings have been explained 
ü two ways. First of all they have been ex- 
hed in terms of implicit verbal responses 
differentially accompany high and low 
lative groups. Those who demonstrated 
re initiative, it was reasoned, thought of 
| Many more arguments supporting the side 
hitially opposed and were less hampered by 
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conflicting thoughts arising from the overt 
pressures to participate. 

Applying this reasoning to the present re- 
sults, it seems safe to assume that the aware- 
ingratiation subjects expended more effort in 
consideration of information which would 
make a good impression. They seemed to be 
attuned to the contingencies of the situation 
and were able to improvise accordingly. On 
the other hand, the unaware-ingratiation 
group seemed to respond to the instructions 
as they would to a flat ultimatum to rate 
themselves positively. This seems evidenced 
in the more or less unchanging pattern of 
responses displayed,from the beginning of the 
interview to its end. It might thus be said 
that the aware-ingratiation group, in their 
more vigorous involvement, brought to mind 
many more positive features of their own 
personalities, Once out of the interview, they 
manifested a higher level of positiveness in 
congruence with their immediately preceding 
view of the self. 

A second interpretation of the effects of 
role playing on attitude change has been of- 
fered by Brehm (1960). Brehm has suggested 
that subjects in the high initiative groups 
were committing themselves to a position 
which was dissonant with their initial atti- 
tude, and attitudes were altered in a direction 
consistent with the commitment in order to 
reduce dissonance. Although such an explana- 
tion ‘would be consistent with the present 
data, there seems to be no good reason for 
contending that the ingratiation subjects were 
in a position which would arouse dissonance. 
Subjects in these conditions were offered no 
choice as to whether to be ingratiating or not, 
and the experimenter’s instructions made it 
clear that falsification of self-ratings was ap- 
propriate behavior during the interviewing 
situations. There would thus seem to be 
little reason to suppose these subjects felt 
personally committed to a dissonant position. 
These latter findings would seem to have 
important implications for much everyday 
behavior. Most of a person’s daily activity 
can be looked at as role behavior. The above 


‘results seem to imply that playing out these 


roles in various social situations may have 
important reflexive effects on the way a per- 
son views himself, Further, it would seem 
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that it is precisely the person who is sensi- 
tized to the cues provided by others who is 
most deeply affected. However, it remains a 
question as to the amount of change in a 
person’s feelings about self which can be 
produced in such instances in the absence of 
social feedback. In addition, it would be inter- 
esting to know whether such changes are 
likely to occur when the role being played is 
self-deprecating rather than self-enhancing. 
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Past work has indicated that flustered or confused speech can be classed into 
several distinct speech disturbance categories. Such disturbances, occurring 
frequently in everyday conversation, have no conventional semantic function. 
In the present study, 25 experimental and 20 control male Ss were used, 
Anxiety was manipulated in an’ interview setting. Under anxiety, the frequency 
of all speech disturbances, except the familiar “ah,” showed a sizable increase, 
The frequency of ah’s increased strikingly in a change from normal to a 
telephonelike conversation. Such change did not affect the other disturbances. 
Measurement of palmar sweat revealed modest positive association with the 
speech disturbances, Exploration of the relationship of the Taylor Manifest 
Anxiety scale to the disturbances suggested that the ah is functionally distinct 


from the other speech disturbances. 


This paper is concerned with an investiga- 
tion of the relationship between experi- 
mentally induced anxiety and certain kinds 
of speech disruptions. In approaching the 
problem of psychological analysis of human 
speech and its content, one can take, roughly, 
two general directions. The first one stresses 
the manifest content of the message and relies 
More or less on its face validity. Moreover, 
when it becomes quantitative, it seems to 
assume an isomorphic relation between a 
particular behavioral state (e.g., the intensity 
of fear) and the quantitative aspects of the 
Content (e.g., the frequency of statements 
about fear). The direction of the second ap- 
Proach leads one to emphasize the instru- 
Mental function of language and speech, 
which may frequently lie in the seemingly 
ityelevant, nonlexical aspects of the message. 
Thus language may have signal properties 
Which lie beyond its symbolic nature. The 
Intent of the experiment described below was 

‘to demonstrate the usefulness of the latter 
approach without arguing, except perhaps 
indirectly, the weaknesses of the former. 
anguage is seen as an expressive behavior 
system with instrumental and reactive proper- 
ties; the speech disturbances, the “flustered” 
Speech, are taken to be a nonlexical attribute 
that is relatively free from both linguistic and 
general social control. That is, these aspects 
of speech are assumed to be linguistically 
irrelevant, 


SPEECH DISTURBANCES AND PREVIOUS 
WORK WITH THEM 


High-fidelity tape recordings of psycho: 
therapy interviews have revealed that in- 
stances of confused or “flustered” speech can 
be analyzed into a variety of separate, identifi- 
able forms of disturbances. Table 1 presents 
the eight empirically formulated categories 
and gives an example of each. The reader is 
referred to a previous publication (Mahl, 
1956b) for a more detailed description of 
these disturbances. 

The speech disturbances have no conven- 
tional.semantic function and are not a part of 
the intentional conscious communication oc- 
curring between people. These disturbances 
are surprisingly commonly occurring phenom- 
ena, found in the speech of psychoneurotic 
patients, therapists, Yale undergraduates, and 
faculty members alike (Mahl, 1956c). Table 2 
presents the relative frequencies of the various 
speech-disturbance categories. In terms of 
absolute frequencies, one of the disturbances 
occurs, on the average, for every 16 “words” 
spoken; this is equivalent to one disturbance 
for every 4.6 seconds the individual spends 
talking (Mahl, 1956a). 

Previous findings suggest. the potential use- 
fulness of the speech disturbance as an index 
of ongoing anxiety. An analysis of the pro- 
files of speech disturbance levels within 
therapy hours has revealed nonchance fluctua- 
tions, as well as significant relationships with 
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TABLE 1 
DEFINITIONS AND ILLUSTRATIONS OF THE SPEECH-DISTURBANCE CATEGORIES 


Category Examples Category Examples 
1. “Ah,” Wherever the ah Well A 6. Sentence incompletion. An Well I’m sorry I i 
sound occurs it is scored. when I first came expression is interrupted, couldn’t get here 
Less frequent variants home. clearly left incomplete, last week so I could 


are “eh,” “uh,” “uhm.” 


Well she’s , . . al- 


2. Sentence change. A correc- 
ready she’s lone- 


tion in the form or con- 


tent of the expression some. 
while the word-word pro- 
gression occurs. Tobe That was . . . it will 
scored, these changes be 2 years ago in 
must be sensed by the lis- the fall. 
tener as interruptions in 
the flow of the sentence. ‘ 

3, Repetition. The serial, su- ’Cause they . . . they 
perfluous repetition of get along pretty well 
one or more words—usu- together. 


ally of one or two words, 
He was . . . he'was 
sharing the office. 
4, Shuler, Tt sort of welll . . . 
l... leaves a 
memory. 


was in 
mourning for about 
2 years, 


Then their anni . . . 
wedding anniversary 
comes around, 


5. Omission. Parts of words, or She mour . . . 
rarely entire words, may 
be omitted. Contractions 
not counted. Most omis- 
sions are of final one or 
two parts of words and 
are associated with sen- 
tence change and repeti- 
tion, 


.. ah... Twas 
getting a child ready 
for camp and finish- 
ing up swimming 
lessons. 

We spleat the bitches | 


(for “split the 
beaches”), 


and the communication 
proceeds without correc- 
tion. 


7. Tongue-slips, Includes neo- 
logisms, the transposition 
of entire words from their 
“correct” serial position 
in sentence, and the sub- He was born in their 
stitution of an “unin- hou(se) . . . hos- 
tended” for an intended pital and came to 
word, their house. 


The reason that I 
don’t . . . didn’t — 
seem to feel the love 
for him (son) that I 
felt for J——— 
(daughter) 


If I see a girl now I'd 
like to take out I 
just... dh.. 
ask her, 


8. Intruding incoherent sound. 
A sound which is abso- 
lutely incoherent to the 
listener. It intrudes with- 
out itself altering the 
form of the expression 
and cannot be clearly 
conceived of as a stutter, 
omission, or neologism 
(though may be such in 
reality), 


independent ratings of the changes in anxiety 
during therapeutic interviews (Mahl, 1956b). 
Global ratings of anxiety during intake 
therapy interviews revealed a significant 


TABLE 2 


CATEGORY PERCENTAGES BASED on DATA FROM 
PATIENTS IN INITIAL AND THERAPY INTER- 
VIEWS, INTERVIEWERS, UNDER- 
GRADUATES, AND FACULTY 


MEMBERS 
Speech-disturbance | Average per- i 
ties centage e of all tee 

“Ah” 40.5 40.5 
Sentence change 25.3 65.8 
Repetition 19.2 85.0 
Stutter 7.8 92.8 
Omission 4.5 97.3 
Sentence incompletion 1.2 98.5 
Tongue-slip a 99.2 
Incoherent sound 12 100.4 


positive relationship with the general speech 
disturbance level (excluding the “ah”) H 
female patients and a nonsignificant negative 
one for the male patients (Mahl, 19568): 
Dibner’s (1956) cue counting, a procedufé 
which pays attention to similar disruption 
in speech, also reveals a positive association 
with anxiety. 

The object of the present study is to ® | 
plore, in an experimental setting an ie 
greater detail, the relation of these spe 
disruptions to anxiety. 


METHOD 
Design 


The subjects in the experiment, 25 experimenti 
and 20 control subjects, were Yale undergraduay 
They were paid volunteers, recruited from the 7° 
of the university bureau which provides part- 
employment for students. The ‘subjects seemed 4 ait 
homogeneous on intelligence and socioecono™ — 
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background. This homogeneity was desirable but 
not necessary since the speech-disturbance levels 


F, do not seem to be related, in a heterogeneous 


i, 


population of 39 neurotic patients, to sex, age, 
education, or socioeconomic status (Mahl, 1956a). 

The-experimental procedure utilized an interview 
sequence in a room with facilities for sound record- 
ing and observation (Mahl, Dollard, & Redlich, 
1954). The subjects also took the individual form 
of the MMPI (Hathaway & McKinley, 1951) and 
their palmar moisture was recorded five times during 
the entire experiment. The procedure for administer- 
ing the palm moisture test, described by Mowrer, 
Light, Luria, and Zeleny (1953), is fully standardized 
and objective. 

The following is a more detailed account of the 
experimental procedure. The subject is brought into 
the interview room with facilities for sound record- 
ing and observation. After the first palm moisture 
test, the subject is seated in a comfortable armchair 
and puts on a set of earphones. This becomes the 
neutral” interview (Interview A), lasting about 30 
minutes, During one half of the interview, the ex- 
perimenter is in the same room with the subject; 
during the other half, the experimenter is in the 
monitor room (separated from the interview room 
by a wall and a one-way mirror and containing the 
recording and monitoring apparatus). During the 
whole interview the subject has on a set of ear- 
phones through which he hears the experimenter. 
The above procedure is controlled for sequence 
effects, so that one half of the subjects are alone 
in the interview room during the first half of In- 
terview A and with the experimenter during the 
second; for the other half of the subjects, the 
Sequence is reversed. When the experimenter is in 
the monitor room, he hears the subject through a 
Microphone hidden in a lamp on a table next to 
the armchair. The subject is told where the micro- 
Phone is located and that the interview is being 
recorded. All through Interview A, the subject is 
Put at ease and encouraged to talk freely about 
School, extracurricular activities, his background, 
and so on. Direct and “searching” questions are 
@Xpided, When Interview A is terminated, the experi- 
a administers the second palm moisture test. 
ieee the subject takes the MMPI and then 


The second session, following a period of at least 


24 hours after the first one, is started by the ad- 


eon of the third palm moisture test. There- 
a ae subject completes several intellectual tasks, 
ale evant to this presentation, and the fourth 
Soit moisture test is then administered. At this 
“ei the subject is left in the interview room and 
e experimenter goes into the monitor room. This 
e setting for the “anxiety” interview (Interview 
ee experimenter manipulated anxiety in two 
ansiet Presenting each subject with the same set of 
ee stimuli and suggesting anxiety-provoking 
ntvokic, discussion. The standard set of anxiety- 
fo the S, timuli- included the following: revealing 
e subject at that point the presence of the 
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one-way mirror through which he is being observed; 
remarking that the long test he took was an “adjust- 
ment inventory” and then asking him what he 
remembers about it; announcing that certain topics 
of the experimenter’s choice will now be probed in 
greater detail; interspersing the discussion with oc- 
casional silences at natural breaks in the conversa- 
tion where the subject might have reasonably ex- 
pected the experimenter to say something. The other 
kind of manipulation was not standard across all 
subjects because the writers believed that the same 
set of discussion topics would not necessarily arouse 
anxiety in all subjects. Consequently, the experi- 
menter used his knowledge of each individual sub- 
ject, gained from Interview A and from the MMPI, 
to suggest areas for discussion which he believed 
would prove anxiety-provoking to the particular 
subject. The probing questions were phrased as 
invitations to explore” some specific issue; direct 
questions, answerable in short sentences, were 
avoided. Interview B lasted about 30 minutes and 
was terminated by the administration of the fifth 
palm moisture test. 4 

At the end of the experiment, each subject was 
briefly told about its purpose and was assured 
that. all of the experimenter’s statements were part 
of the experiment and were addressed to all of the 
subjects and that what the experimenter had said had 
nothing to do with what he really thought of the 
subject. The subject was then asked to express any 
feelings, comments, or questions that he might have. 
Finally, the subject was dismissed when it appeared 
that any possible aftereffects of the experiment 
were gone. 

Twenty control subjects were included in the 
design of the experiment, These subjects came for 
two sessions and went exactly through the same 
interview sequence as did the experimental subjects, 
the only difference being that they were given the 
same freedom of choice of conversation topics 
during the second session as during the first; that 
is, both sessions were “neutral.” Ten of the control 
subjects were also given four palm moisture tests 
before and after Interviews A and B, The control 
subjects were not given any of the other tests. 

The taped recordings of the interviews were 
transcribed and the typescripts were then corrected 
for inaccuracies by listening again to the high- 
fidelity tapes. Thereupon, the speech’ disturbances 
were scored, using both the tapes and the type- 
scripts, Three different speech-disturbance measures 
were used: the ah ratio—the number of “ah's” in 
speech, divided by the number of words spoken; 
the non-ah ratio—the sum of all the disturbances, 
except the ah, divided by the number of words 
spoken; the ratios for the individual speech dis- 
turbance categories. The ah measure was isolated 
from the other categories because previous work 
had indicated that it behaves in a manner different 
from that of the other disturbances (Mahl, 1955). 
Jt also merits special consideration because it is the 
most frequent category (on the average, but not 
for every subject) and untrained but sensitive 
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TABLE 3 
INTERCORRELATIONS OF THE PALMAR SWEAT DATA 


Administration 


of the palmar | Second | Third | Fourth | Fifth 
sweat test 
First 82 -70 62 64 
Second a aro, 82 -80 
Third = 58 -10 
Fourth = 184 
Fifth — 


listeners are most aware of it in ordinary con- 
versation. 

The procedure for recording. the subject’s palmar 
moisture yields fingerprints which vary in darkness. 
The darkness of these prints, reflecting the level 
of palmar moisture, was measured by a densitometer. 
This instrument was constructed according to the 
directions given by Mowrer et al. (1953). 


RESULTS AND DISCUSSION 
Reliabilities 


In order to determine the reliability of the 
densitometer, the scores for 10 randomly 
selected subjects were used. This represents 
50 separate administrations of the palm mois- 
ture test. The reliability of the densitometer 
is .99, reflecting the agreement between two 
scorings of the same prints by the same scorer 
and, of course, suggests that the densitometer 
does not introduce any perceptible error. The 
reliability of the palm moisture test itself is 
not reported by Mowrer et al. (1953), and 
is not known to the writers. For comparison 
purposes, however, Table 3 may be consulted. 
It shows the intercorrelations of the palm 
moisture test for the five administrations and 
can be viewed as reflecting high stability of 
individual differences. (The intercorrelations 
of the first three administrations are based on 
the data from both the 25 experimental and 
10 control subjects. The remaining correla- 
tions reflect only the data from the experi- 
mental subjects because the conditions were 
no longer comparable for the two groups of 
subjects.) The values in Table 3, moreover 
can be seen as a lower bound for a test-retest 
reliability estimate of the palm moisture 
measure. This is because in estimating test- 
retest reliability, one tests under approxi- 
mately the same conditions, whereas the suc- 
cessive administrations of the measure in the 
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conditions, hence the lower bound. 

The reliabilities of scoring the speech dis- | 
turbance ratios are as follows: total ratio .99, 
ah ratio .99, non-ah ratio .96, sentence-change 
ratio .87, repetition ratio .97. These reli- 
abilities represent the agreement of two 
independent judges, using the tapes and un- 
scored typescripts of five randomly selected 
subjects and scoring 36 randomly selected 
pages of transcripts of these subjects’ speech. 
In other words, the unit here was a double- 
spaced typewritten page of transcript and 
the correlation was between the two independ- 
ent scores for a particular disturbance for that 
page. These reliabilities are about the same 
as those that had been obtained previously 
with other trained scorers (Mahl, 1956b). 
The remaining typescripts were scored by one 
writer and checked by the other. Table 4 
presents the intercorrelations of the two major 
speech-disturbance ratios. The 20 control 
subjects are included in the data for Interview 
A inasmuch as the interview conditions for 
the control and the experimental subjects 
were equivalent. 

The mean length of the typescripts of the 
two interviews for the 45 experimental and 
control subjects was 6790 words (SD=2021). 
The experimental and control subjects did not 
differ much in their average verbal output 
during Interview A (3687 and 3847 words, 
respectively) but did show a sizable difference 
for Interview B (2675 and 3478 words, tè 


spectively). The latter difference is a col 
| 


experiment were all under somewhat different 
> 


sequence, in part, of the experimental manipt- 
lation: during Interview B, the experimentet 
intentionally allowed occasional silences t0 
develop during the conversation. The absent? 
of the experimenter during one half of Inter 
view A had no effects on the verbal output 
of the subjects; their average was 1895 wor 
as compared with 1863 words for the oth! 
half, during which the experimenter Wi 
present. The verbal output of the experi 
menter was minimal and did not differ S% 
nificantly across the different conditions. 


Validation of the Experimental Manipulation 


It is being assumed that the palm moistut 
data reflect the subjects’ anxiety levels a 
that, consequently, they can be U 
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TABLE 4 
INTERCORRELATIONS OF THE MAJOR SPEECH-DISTURBANCE RATIOS 


1 3 4 5 6 
Interview A (experimenter present) 
1, Ah ratio = .49* 85° 39 31 .64 
2. Non-ah ratio 50" 81" 37 88 
Interview A (experimenter absent) 
3. Ah ratio — 438 85 60 
4. Non-ah ratio — 33 82 
Interview B (experimenter absent) 
— 52 


5. Ah ratio 
6. Non-ah ratio 


Note.—Boldfaced correlations reflect the stability of individual differences. 
* Correlations are based on all 45 experimental and control subjects. The remaining corre! 


subjects, 


demonstrate anxiety differences between neu- 
tral Interview A and anxiety-provoking Inter- 
view B. Figure 1 shows the mean palmar 
sweat values for the 25 experimental subjects 
during the five separate administrations. The 
units in that figure are standard scores based 
on the mean and standard deviation of the 

< taw palmar sweat scores for the 25 subjects 
across all five administrations. 

The experimental manipulation would lead 
one to expect no change or some decrease in 
palmar sweat during Interview A and a 
Significant increase during Interview B. The 


. z 
Amount of Palmar Sweat (Z-scores) 


71.59 


Ist 2nd 
pre-A post- 


Administrations of the 
Fic. 1, Mean levels of palmar sweat for the 25 experimental subjects at different 
stages of the 


3rd 


Jations are based on 25 experimental 


data support these expectations, During Inter- 
view A, 17 out of 25 subjects decreased in 
their palmar sweat (p < .06), whereas 18 
out of 25 subjects increased in their palmar 
sweat during Interview B (p = .02), The dif- 
ference between the decrease during Inter- 
view A and the increase during Interview B 
is also significant (x° = 6.5, p< 02). The 
10 control subjects on whom palmar sweat 
data were collected showed only random 
fluctuations in their palmar sweat. During 
the first interview, 5 subjects showed an in- 
crease and 5 subjects a decrease in the amount 


The moan for all 
Se and all five 
administrations 


Lth 5th 
pre-B post- 


Palm Moisture Tést 


experiment. 
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of sweat. During the second interview, 6 out 
of the 10 subjects showed an increase in the 
amount of sweat. 

A second procedure was adopted: to check 
upon the presence of anxiety in Interview B. 
Since this method was relatively time con- 
suming, it was applied to only a small sub- 
sample of the experimental subjects. The 
tapes for six subjects were reedited in such a 
way that a new tape contained 10 1-minute 
segments of the subject’s speech (with the 
experimenter’s questions and remarks erased), 
5 from each interview. These segments were 
randomly selected and randomly distributed 
so that the two judges rating the amount of 
anxiety present would have no clues available 
to them other than those coming from the 
subject’s speech. The interjudge reliability 
was satisfactory, albeit not high (r = .77); 
the 30 segments from Interview B were rated 
significantly higher on anxiety than the 30 
segments from Interview A (p < .04). 


Changes in the Speech-Disturbance Ratios 


The major hypothesis of the study predicts 
a higher frequency of the non-ah speech dis- 
turbances during the anxiety-provoking Inter- 
view B than during the neutral Interview A. 
The results strongly support the hypothesis. 
Twenty-four of the 25 experimental subjects 
showed an increase in their non-ah ratios (p 
< .0001). Of the 20 control subjects, only 12 
‘showed such an increase from the first’ to the 
second interview (ns). The difference between 
the experimental and the control groups is 
significant at the .004 level, using Fisher’s 
exact test. 

The non-ah ratio is a measure which lumps 
together seven different and distinct speech- 
disturbance categories. Are all of these cate- 
gories sensitive to the presumed difference in 
anxiety between Interviews A and B, or are 
there only a few of the categories which are 
primarily responsible for the large increase 
in the non-ah ratio? The data reveal that all 
seven categories show a significant increase 
from Interview A to B (< .01, using ¢ tests). 
The increases in the case of the infrequent 
categories are quite small but consistent 
enough to yield significant results. This sug- 
gests that combining the various speech cate- 
gories is a meaningful procedure for obtaining 
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a single index, the non-ah ratio, which 
flects changes in anxiety. The com 
changes for the 20 control subjects sho 
random variations for all categories e 
“sentence change”: here, the ratios during 
second interview were higher than the 
during the first ( < .05). There is no% 
parent explanation for this isolated findi 
No predictions were made about th 
ratio inasmuch as past work (Mahl, 1 
has suggested that the two kinds of sp 
disturbances, the ah ratio and the noni 
ratio (the remaining categories), are 
tionally different and may not be equall 
sensitive to anxiety. f 
In the present study, the ah ratio shoy 
no significant differences, either among 
experimental or the control subjects, be 
the first and the second interviews. T 
the ah ratio was apparently not sensiti ve 
the anxiety manipulation. However, an int 
esting finding comes to light when one C0 
pares the effects of the presence vet 
absence of the experimenter within Interwi 
A. This difference, which leaves the no 
ratio unchanged, has a strong effect on 
ratio: 32 out of the 45 subjects had h 
ah ratios when the experimenter was 4 
in the monitor room, 2 subjects showed 
changes, and only 11 subjects had higher 
ratios when the experimenter was present: 
gether with the subject in the interview 1 
( < 001). All 45 subjects are used here Mt 
cause the first interview was the same for B 
experimental and control subjects. 
In summary, then, we have two kinds 
speech disturbances: the ah ratio, whi h : 
parently does not change with experimen! 
produced anxiety but is sensitive to 4 
from a face-to-face to a telephonelike c 
versation; the non-ah ratio, which is sens! 
to anxiety but does not change whem 
switches from a normal to a telephonelike' 
versation. The remainder of this paper 
voted to a more detailed exploration inte 
meaning of these two ratios. 


Non-Ah Ratio and Other Indices of An 


The average increase per subject 1 
non-ah ratio, going from Interview 4 W 
was 34.2%, Clearly, the experimental mé 
ulation of anxiety had a sizable effect: 4 
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other partial indicators of anxiety can be 
obtained from the available data: the palm 
moisture index and the Taylor Manifest 
Anxiety (MA) scale, derivable from the 
MMPI (Taylor, 1953). 

The palm moisture index is not an un- 
ambiguous indicator of anxiety and certain 
obvious questions arise: does the amount of 
recorded sweat accurately reflect the sweating 
during the preceding 5 minutes, 10 minutes, 
or 30 minutes? Are the absolute values across 
individuals meaningful or should one use only 
relative changes in these values? Such ques- 
tions must be kept in mind as one looks at 
Table 5, which presents the major results 
concerning the relationship of the frequency 
of flustered speech to palmar sweat. The 
non-ah ratios during Interviews A and B are 
related to two indices of sweat: as assessed at 
the end of each interview and as a deviation 
from each subject’s own basal or “resting” 
level. (The third palm moisture test was 
taken as the best approximate indicator of 
the subjects’ “normal” level of sweat. See 
Figure 1.) The data for Interview A include 
the 10 control subjects inasmuch as up to 
that point the test situations were comparable 
for both the experimental and control groups. 
The two alternate indices of sweat are only 
moderately correlated with each other: the 
Correlation is .45 (N = 35) for Interview A 
Scores and .32 (N = 25) for Interview B 
Scores, This suggests that the alternate meas- 
ures are not interchangeable. 

The data in Table 5 reveal a mild positive 
association between flustered speech and 
palmar sweating. Moreover, the last row in 
that table demonstrates that subjects who 
showed a relatively large increase in sweat 


(uring Interview B, as compared with their 


range in sweat during Interview A, had a 
arger increase in the non-ah ratio than sub- 
Jécts who showed a relatively small increase 
1n sweat, 

The findings concerning the relationship of 
the MA scale to the speech disturbances are 
4 good deal more tentative. The scores on the 

A scale correlated —.53 with the K scale 


_ of the MMPI and a statistical correction was 


liea (Guilford, 1956, p. 367) to partial 
tos defensiveness, which is presumably re- 
ected by the K scale. Henceforth, any 
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TABLE 5 


RELATIONSHIP or THE Non-An Ratio (NuMBER oY 
SPEECH DISTURBANCES PER 1000 Worps) TO 
SEVERAL INDICES OF PALMAR SWEAT 


Amount of sweat 
Non-ah ratio —— m] a i 
High Low 


Mean non-ah ratio 
during Inter- 
view A 

Sweat measured at 
the end of 
Interview A 

Sweat measured as 
the deviation of 
the value ob- 
tained at the 
end of Interview 
A from each 
subject’s own 
basal level 

Mean non-ah ratio 
during Inter- 
view B 

Sweat measured at 57.2 
the end of (N=13) 
Interview B 

Sweat measured as 59.8 
the deviation of | (V=12) 
the value ob- 
tained at the 
end of Interview 
B from each 
subject’s own 
basal level 

Mean net increase in 
non-ah ratio 
from Interview 
A to B 

Sweat measured as 
themet compari- 
son of the change 
in sweat during 
Interview A 


39.2 | 33 
(W=18) 


35.6 | 33 
(N=16) 


45.8 
(N=17) 


48.2 
(N=19) 


2,450" 


44.4 | 23 
(N=12) 


43.1 | 23 
(V=13) 


1,62" 


2.220 


23, |2.27** 


*p = 06. 
ep = 02, 
rp <.01. 


reference to the MA scale will mean only 
these scores corrected for defensiveness. 

The data reveal no linear relationship be- 
tween the non-ah ratio and the MA scale. 
However, if the subjects are divided into three 
groups with high, intermediate, and low scores 
on the MA scale, then a hint of a curvilinear 
relationship appears. These data are presented 
in Table 6. The curvilinear relationship was 
not anticipated and as yet does not par- 
ticularly make sense. However, Schwarzen- 
burg, Feldstein, and Jaffe ( 1963) report a 
curvilinear relationship between speech dis- 
ruption and uncertainty contained in the 
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TABLE 6 


RELATIONSHIP OF THE Non-Au Ratio (NUMBER OF 
SPEECH DISTURBANCES PER 1000 Worps) TO 
THE TAYLOR MANIFEST ANXIETY SCALE 


Subjects reporting: 
Non-ah ratio Either t 
Moderate} high or 
anxiety | low 
anxiety 
Mean non-ah ratio during 
Interview A 46.8 34.0 | 1.99* 
Mean non-ah ratio during 
Interview B 63.1 44.3 | 2.44** 
Mean net increase in non-ah 
ratio from Interview 
AtoB , 16.4 10.3 | 1.53 
*p = 06, 
KD = 02, 


passages which were read. Thus our finding 
may not be an isolated one. 


Meaning of Ah in Speech 


It has been seen that the ah ratio, which 
did not change with experimentally produced 
anxiety, is highly sensitive to a change from 
a face-to-face to a telephonelike conversation: 
the average change in the ah ratio for the 45 
subjects is a 41.2% increase from the inter- 
view-room situation (experimenter is present) 
to the monitor-room situation (experimenter 
is absent). In this comparison, sequence ef- 
fects are randomized and experimentally pro- 
duced anxiety remains constant. s 

The striking increase in the ah ratio is a 
finding which requires some explanation. One 
may think of ordinary conversation as a 
process of communication between two people 
in which vocal activity constitutes only a part 
of the information exchanged. Various facial 
expressions, movements of the head, and so 
on, are all additional cues which may tell 
the speaker how he is progressing and how 
his speech is understood and received, even 
though the listener may not actually be saying 
anything with words. When the experimenter 
is in the monitor room, the subject is deprived 
of all these additional cues. The experi- 
menter’s silence tells him nothing. Conse- 
quently, the subject is more uncertain about 
the effects his speaking has on his listener and 
the use of ah enables him to pause briefly 
and determine what he wants to say next. 
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To be sure, the ah cannot act as a sub 
for the information the listener’s man; 
verbal cues could give him; it can only 
able him to deliberate more carefully. 
work of Goldman-EFisler (1958, 1961) ¢ 
Maclay and Osgood (1959) on the relat 
ship of hesitations and pauses to the 
dictability of words from context tends | 
support the above interpretation of the fun 
tion of ah in speech. 


Stability of Individuals’ Speech-Disturb 
Profiles 


Tt has been the experience of those wor 
with speech disturbances (Mahl, 1956c) 
hesitations (Goldman-Eisler, 1961; Ma 
Osgood, 1959) that the various speech 
nomena under study show stable, chara 
istic patterns. Moreover, the work of 
above writers suggests that deviations i 
habitual patterns may be indicative of 
and other emotions. 4 

In the present study, the three most | 
quent categories—the ah, sentence cha 
and repetition—were used to constr 
speech-disturbance profile for each individui 
Such profiles for Interview A (for the 
during which the experimenter was absent 
were then compared with those for Ini 
view B. Subjects showing a change in 
profiles were those for whom the ord 
of the frequency of usage of the three € 
gories did not remain the same across the 
interviews. There were 13 subjects” 
showed such a change. Under the hypo 
that the instability of profiles is in 
function of anxiety, the subjects with $i 
and unstable profiles were compared om 
MA scale and the palmar sweat meas 
While the latter indices of anxiety yielde 
significant relationships, it was found 
the MA scale, subjects with unstable pr 
had higher scores (biserial correlation 
p<.02). 

In investigating the characteristics © 
jects with unstable speech disturban' 
files, an additional striking finding 
light. Subjects who are highly labile on 
palm moisture test (that is, they sho 
intraindividual variation in their scores 
the several administrations) are stable 
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respect to their speech-disturbance profiles, 
and vice versa (biserial correlation = —.68, 


p< .0001). This relationship suggests that 


palmar sweat and disruption of fluent speech 
may be alternate ways in which each subject 
reacted to anxiety fluctuations which occurred 
during the experiment. 
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Children’s free descriptions of other children were elicited in 3 camps for the 
underprivileged. A reliable coding scheme was developed to determine the | 
categories used by the children. 3 types of analyses were made: Ist, 1 child 
described 2 other children; 2nd, 2 children described the same child; 3rd, 2 
children described 2 different others. Descriptions with a common perceiver 
had the highest overlap of categories; next came descriptions with a common 
perceived person, followed, surprisingly closely, by descriptions with only a 
common culture. The results are interpreted as emphasizing the importance of 
the perceiver’s cognitive structure. 


This paper reports an empirical approach 
to the nature of the individual’s structuring 
of his interactions with other people. More 
specifically, it will describe a method for 
determining the salient cognitive categories 
employed by a person in his descriptions of 
others. Substantively, we shall report an 
analysis of the categories of interpersonal 
cognition employed by children, emphasizing 
the relative strength of category determina- 
tion by the perceiver and by the person per- 
ceived. 

Tf each person employs a rather consistent 
set of categories for the interpretation of his 
social world, the same general categories 
should be present in descriptions of other 
people who vary considerably in their char- 
acteristics. On the other hand, category selec- 
tion might be so heavily determined by the 
stimulus object that there would be little 
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thank the Association for the Aid of Crippled 
Children and the Herald Tribune Fresh Air Fund 
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reappearance of categories as one person de) 
scribes a number of others. f | 

Researchers interested in social perception 
or person cognition (the terms often are used 
with a common meaning) have long beet 
concerned with the relative impact of the 
perceiver and the perceived, but the emphasis 
of their empirical research has been on vatia 
tions in the psychological state of the pef 
ceiver. Most commonly, investigators have 
used experimental manipulations (for &| 
ample, frustrating one group) or taken,a® 
vantage of individual differences on s0? 
trait (for example, authoritarianism) ® 
then observed differences in the perceptit 
of a social object. This orientation is C0 
sonant with prevailing conceptions of a 
curacy, distortion, projection, and emp@ Í 
ability. As we have noted elsewhere (Hasto 
Richardson, & Dornbusch, 1958), the desit 
for quantification and control has id 
searchers to specify the categories upon be 
the subject is required to report his bi 
ceptions. In these types of studies, the wy 
vance of the categories has been defined 1 
the investigator, not by the subject. | | 

Our approach demands an exploration” 
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lowing interaction in a natural setting with a 

series of individuals. To operationalize the 

problem, what categories does he use in de- 
scribing other persons when the investigator 
does not specify the relevant dimensions? 

The veridicality of his reports is not evalu- 
ated. Instead, prior questions are explored. 

What categories do people employ in a free 

response situation? Do these change as dif- 
ferent individuals are described? 

We shall study reappearing categories as 
our index of salience. The term “salience” 
refers to that which a perceiver tends to 
notice in others or that which tends to be 
noticeable about a perceived person. When a 
describer uses a category in several descrip- 
tions, we assume that the specific category 
is salient to him. If several perceivers use the 
Same category in describing a particular per- 
son, we assume that this is a salient char- 
acteristic of the perceived person in his inter- 
action with others. Salience in one case 
refers to cognitive or perceptual organization 
and in the other to prominence in the per- 
ceived stimulus object, yet, using reappear- 
ance as the measure, one can explore the more 

general issue of the relative strength of the 
perceiver and the perceived in determining 
Category usage. 

4 The analysis of verbal reports in unstruc- 
ured interviews will necesarily center on that 
Sample of categories used by the respondent. 
pte early, questions may arise in regard 
ie jects withholding certain descriptive 
oa or to the operation of certain 
ee etical unconscious processes. We shall 
N o eces only with the manifest con- 
fi K le descriptions, the words used by 
F an ag In Heider’s sense, we are seeking 
F cna ion about the naive psychology of the 
=e focus is on children in this beginning 

a Y cumulative series of researches. They 

a ee to be less sophisticated, less able 
ates eea material, and perhaps less moti- 
Bon ea Control the impression they make 
“people interviewer. It is assumed that all 
; are ell us something about themselves as 
Bie G others. It is also assumed that 

ing ai be somewhat less guarded. Study- 
3 dren has some further advantages. 


. cognitive structure of the perceiver, fol- 


1 


435 


They talk less than adults and use simpler 
concepts. This simplifies our task of develop- 
ing reliable content analysis procedures. 

The remainder of this report will describe 
the methods by which we obtained and 
analyzed a set of descriptions of others in free 
response interviews. 


METHOD 
Sample 


The children whose descriptions of others are 
reported here came from the lower-class areas of 
metropolitan New York. They were white, Negro, 
and Puerto Rican boys and girls between the ages 
of 9 and 11. Three summer camps operated by a 
charitable organization “provided the setting. The 
children were referred to a camp by social agencies 
and spent 2 or 3 weeks in residence engaging in 
organized recreational activities. 

The three camps differed primarily in the char- 
acteristics of the children in attendance, Therefore, 
we have used their diversity to provide us with 
opportunities to replicate the findings. Data analysis 
never combines either camps or sexes. Camp A 
accommodated both handicapped and able-bodied 
children. It. had four different encampments, but 
we must combine encampments to develop a large 
enough sample. It contained separate groups of 
boys and girls. The tent, which had five or six 
children, is the basic unit for interaction and the 
planning of activities. The assignment of children 
to tents was specifically designed by us to provide 
heterogeneity of race and handicap in each tent, 
while seeking homogeneity of age and sex. Children 
were not previously acquainted with their tent mates. 
Camps B and C had no handicapped children, and 
each had only one sex. Camp B had only boys, 
and C had only girls. The general organization and 
activities of the three camps were similar, Non- 
handicapped children showed no statistically sig- 
nificant differences in age, occupation of father, 
ethnicity, or other bases for assignment to the three 


camps. 


Method of Interviewing 

The interviews were conducted during four encamp- 
ments at Camp A, two at Camp B, and three at 
Camp C. The interviewers were trained by the 
authors prior to reaching the camp. Each interviewer 
was present during various camp activities before he 
began interviewing. He was careful not to act as a 
counselor. The child knew his interviewer as a 
member of the camp community who was inter- 
ested in children, but not as a representative of 
the camp authorities. Before each interview, the 
child was encouraged to play with the tape recorder 
and listen to his voice played back. The child was 
further assured that the other children and the coun- 
selors would not be told what he had said. The 
interviewer explained to the child that he was 
interested in learning about children in the camp. 
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TABLE 1 
First ORDER CATEGORIES 


Demographic variables 
1. Spatial location 
2. Age 


4. Ethnicity 
5, Religion 


Organic variables 
6. Handicap 
7. Health 
8. Physical description 
9. Physical attractiveness 


Recreational variables 
10. Physical recreation 
11. Nonphysical recreation 


Aggression 
12. Physical aggression 
13, Verbal aggression 
14. General aggression 


Quality of interaction 
15. Described to describer 
16. Describer to described 
17. Others to described 
18. Described to others 
19. Described and describer to others 
20. Others to describer and described 
21. Reciprocal relationships 


Frequency of interaction 
22, Described and describer 
23, Others and described 
24. Described, describer, and others 


Interpersonal relations 
25, Relations with describer 
26. Relations with father 
27, Relations with mother 
28. Relations with siblings 
29. Relations with other family members 
30. Relations with nonfamily adults 
31, Relations with nonfamily children 


> 


Group status 2 
32. Membership in a specific collective 
33. Inclusion or exclusion by others in a group 
34. Executive-holding an office in a specific 
collective 
35. Socioeconomic status 


Modes of interaction 
36, Generosity 
37. Giving aid 
38. Needing aid 
39. Cooperation 
40. Competition 
41, Trust 
42, Humor 
43. Dominance-submission 
44. Interpersonal skill 


Moods of interaction 
45, Happiness 
46. Excitability 
47, Confidence 


Total personality 
48. Adult terms 
49. Children’s terms 
50. Undifferentiated positive comments 
51. Undifferentiated negative comments 


Abilities 
52. Verbal communication 
53. Physical ability 
54. Mental ability 
55. School ability 


Norms 
56. Grooming and manners 
57. Honest-dishonest č 
58. Swearing, cursing, drinking, smoking 
59. Conformity to expectations of peers 
60. Conformity to expectations of family adulti 
61. Conformity toexpectations of nonfamily adult 


Miscellaneous 

. Possessions of the child 

. Possessions of the family 
. Magic and autism 

. Occupation of child 

. Occupation of adult 

. Family 

. Sexual 

. Residual 
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The interviews were completely nondirective, so 
that the descriptions given would not be influenced 
by the responses of the interviewer. The entire 
interview was tape recorded, transcribed, and later 
coded. Each child in the study was interviewed 
twice—first, a few days after arrival at camp, and 
second, shortly before departure. There was usually 
a 1-week interval between interviews. Information 
solicited in each interview included a description 
of two other children in the child’s tent. The second 
interview followed exactly the same procedure, except 
that two different children were described. Thus, each 
child described four children. The children to be 


described were assigned in order by random eo 

The interviewer began by stating that he i 
interested in learning about the camp an a m 
dren in the camp. Following this, he said, tpe 
about Johnny Doe.” The interviewer was nour 
mitted to illustrate the type of statement, 
If the child asked for more specific instructi 
answer was, “Just tell me anything you Be, 
of.” When the respondent stopped talking, th? 
were variants of, “Anything else?” and c 
tell me any more?” The next description 
quested only when a probe did not elicit 
verbal material. 
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College students of the same sex as the respondent 
were used as interviewers. Neither sex nor personal 
variability of interviewers had a major effect on 
the findings of this study, for camp and sex 
differences in Table 2 are negligible. 


Content-Analysis Procedure 


Our goal in content analysis was to stay as close 
to the child’s verbal report as possible. It would be 
impossible to analyze the verbatim record of the 
children’s responses without a more abstract method 
of categorizing the words they used. Yet, the very 
act of categorizing for content analysis may distort 
the perceptual categories of the children. In this 
dilemma, we developed an empirically based set of 
content-analysis categories from an exploratory 
study at Camp A. The coding scheme consists of 
two parts: 69 content categories (68 plus a residual 
category), and 19 somewhat more abstract second- 
order categories, which emphasize the descriptive 
style of the describer, This paper is concerned only 
with the 69 content categories, as listed in Table 1. 

The unit of coding is an idea, and a sentence 
may contain several such units. For example, “He 
played baseball with his father,” would be coded 
as both “physical recreation” (He played baseball) 
and “relations with father.” A single idea unit can 
be coded in only 1 of the 69 mutually exclusive 
categories. Ideas not codable in the first 68 cate- 
gories were coded in the sixty-ninth, or residual 
category. We analyzed only descriptions with six 
or more idea units so that we could assume at least 
minimal ability to communicate. 

It should be noted that many categories are really 
continuous. For example, “He can run very fast,” 
and “He is the slowest runner in our tent,” would 
both be coded “physical ability.” We were interested 
in the describer’s making use of a category, rather 
than where he located another on a specific 
tontinuum, 

‘The proportion of residual statements provides 
Some test of the relevance of our coding scheme. If 
the coding scheme departed drastically from the cate- 
Sories employed by the children, the proportion of 
residuals would be high. The average residual was 
11.8% of the total number of idea units and was 
mainly a product of lengthy anecdotes straying from 

_ the topic, 
Ai reliability of the coding is high. Specific 
are for reliability analysis were selected at 
ae om. Studies of reliability by the two coders 
tay performed throughout the coding process. There 
eee tendency for reliability to change as the 
g continued. Reliability using the strictest cri- 
—_——_ 


* The coding manual has been deposited with the 
No. PEH Documentation Institute. Order Document 
Phot 014 from ADI Auxiliary Publications Project, 
ingt oduplication Service, Library of Congress, Wash- 
ER D. C. 20540. Remit in advance $1.75 for 
insit m or $2.50 for photocopies and make checks 
ate le to: Chief, Photoduplication Service, Library 

ongress, 
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terion was 86.4%. This means that for any idea 
which one interviewer coded in a specific category, 
more than 17 out of 20 times the other interviewer 
coded iņ the identical category. If idea units on 
which both coders agreed on residual classification 
were included in the reliability analysis, the coding 
reliability is above 90%. 

A second problem of analysis is the measurement 
of overlap or reappearance of categories. A number 
of indices could be used. We selected two indices that 
appeared to have meaningful relationships to our 
concept of salience. Let us examine the psychological 
bases of these indices before turning to the specific 
measures. 

First a category may be considered salient if it 
is employed in a number of descriptions. This is a 
binary concept, present or absent, related to reap- 
pearance or nonreappearance. This definition views 
salience in terms of a sampling process which the 
describer uses to select samples from some finite 
number of categories available to him. Of two de- 
scriptions we compared, salient categories would then 
be those which appear in both descriptions. 

A second conception views salient categories as 
more likely to be frequently employed. Instead of 
noting merely the reappearance of a category used 
in another description, this conception takes account 
of the number of times it has been used. Viewed 
in the same sampling terms, this notion of salience 
incorporates not only the reappearance criterion, but 
also the number of elements in each sample assigned 
to the reappearing category. This second conception 
of salience appears to be superior. One possible dis- 
advantage, however, is the extent to which it can be 
affected by individual differences in linguistic style. 

These two conceptions of salience would dictate 
differing measures. Both these measures could be 
employed for the determination of those categories 
which are salient for a particular perceiver (he tends 
to notice these things) or salient about a particular 
social’ object (these things about him are noticeable) 
or even salient in a specific interaction situation. The 
focus of attention in this report, however, is not on 
the level of specific salient categories, but rather on 
the more abstract question of the relative influence 
of the perceiver and the perceived. For example, if 
we compare two descriptions given by the same 
perceiver of two different persons, do we find more 
reappearing categories than when we compare two 
descriptions given by different perceivers of the same 
person? This provides a comparison of the relative 
influence of the perceiver and the perceived on the 
amount of category overlap or reappearance. 

The two measures used can now be specificed. The 
“presence” measure is the mean proportion of the 
categories used in one description that were used at 
least once in the other description. The denominator 
for the computation of the proportion of reappear- 
ing categories varies with the number of categories 
in the description selected as the base. Therefore, 
the mean proportion is used, with each description 
in turn as the base. For example, if eight categories 
are used in one description and six categories in 
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another, the presence of three reappearing categories 
would produce the mean of 3/8 and 3/6, or .44. 

The second, a “frequency” measure, is the mean 
proportion of idea units in reappearing ¢ategories. 
This measure considers both the reappearance of 
categories and the distribution of idea units among 
the categories. With 15 units and 10 units in the two 
descriptions, for example, 6 idea units in the re- 
appearing categories would give the mean of 6/10 
and 6/15, or .50. 


RESULTS 


Before presenting data on the relative 
strength of the perceiver and the perceived 
in determining reappearance, it may be ap- 
propriate to give the reader a general idea 
of the most frequently used categories. For 
this purpose, the high incidence categories for 
the nonhandicapped white boys and girls in 
Camp A will be noted. It is interesting that 
of the top 12 categories for the boys, 11 of 
these were also among the top 12 categories 
for the girls. They were: relations to de- 
scriber, relations to peers, relations to non- 
family adults, conformity to peers, conformity 
to nonfamily adults, physical recreation, 
physical ability, humor, giving aid, verbal 
communication, and undifferentiated positive 
comments. The boys also stressed physical 
aggression, while happiness appeared to be 
more salient for the girls. 

Two caveats are in order here. First, the 
near identity in the set of the top 12 cate- 
gories in usage for both boys and girls does 
not indicate equivalence in category usage. 
There are, in fact, major sex differences which 
will be discussed in a separate report. Second, 
no single category is used so frequently in 
these groups that overlap between descrip- 
tions is a product of its incidence, for the 
most used category is only 12% of the idea 
units, 

To return to the central problem, the rela- 
tive influence of the perceiver and the per- 
ceived, the descriptions can be organized to 
attack this issue. Let us now define “1 on 2,” 
“2 on 1,” and “2 on 2.” One on 2 refers to 
overlap for a single describer when com- 
paring 2 of his descriptions of others. Two 
on 1 refers to the amount of overlap found 
when 2 perceivers have described the same 
person. Two on 2 refers to the overlap when 
2 different perceivers have each described dif- 
ferent persons. Let us use A, B, C, and D as 
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symbols for persons to illustrate these con- 

cepts. If A and B both describe C, we get 
2 on 1. If A describes both B and C, the | 
result is 1 on 2. When A describes B, and | 
C describes D, the result is 2 on 2. These | 
three measures of overlap make it possible to | 
ask such questions as: “Is there greater over- 

lap when a single perceiver describes two 

different people than when two different per- 

ceivers describe the same individual?” or, | 
more simply, “Is there greater overlap with 

a common perceiver, or is there greater 

overlap with a common person as object?” 

The 2 on 2 analysis provides some evidence ) 
as to the amount of overlap that can be 

ascribed to what might be called “common 

culture.” 

Tt is tempting to look at these data within 
the framework of how much overlap can be 
attributed to the perceiver, how much to the 
perceived, and how much to “culture.” It is, 
however, crucial to note at this point that 
there- clearly is not a common object of 
description in this study. Each person pat- 
ticipates in relatively unique interaction with 
another, so that no two persons can be said 
to have the same experiential basis for de- 
scribing John Jones. In 1 on 2, the common 
participant in the interaction is the perceivet. | 
In 2 on 1, he is the person perceived. In? 
on 2, there is no common participant. The | 
results of our overlap analysis are presented 
in Table 2. 

There is remarkable order in these overlap | 
data. There are no exceptions to the finding | 
that 1 on 2 is higher than 2 on 1, and that 
2 on 1 is higher than 2 on 2. Not only 8 
this regularity found for both presence and 
frequency measures, but the three distribl | 
tions do not overlap in range. The lowest, 
1 on 2 proportion, for example, is higher that 
the highest 2 on 1. Quite clearly, the greatest 
overlap is obtained with a common perceive 

The interpretation of these data, attributing 
the greatest overlap to the existence of 4 
common perceiver, is complicated by the po 


sibility that the respondent is affected y q 
E 


\ 


his own immediate prior responses. To say ‘i 
a girl that “She is a good student” may mak 
it markedly more probable that a descriptio" 
of another girl a few minutes later will me | 


tion school ability. Are we studying salient 
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TABLE 2 
CATEGORY OVERLAP BY CAMP AND SEX 
Percentage of overlap 
Camp group Presence Frequency 
1on2 2on1 2on2 lon2 2on1 2on2 
A—Males ( 3a) i a j 34 59 47 39 
80) 251 444; 8i 
B—Males ( E 38 ) EA : 58 ; : 45 : ay 
(452) 40) 223) 452 
A—Females 47 ) 38 Hy ee ; wD GR 
(284 (536) (74 284) 536, 74) 
C—Females tin j (im) 3 : 57 : í D My 
100) 62) (40) 100) 162 40) 
A + B—Males PD (696) a oo CS D 
(474 896) 120) 474)* 896, 120) 
A + C—Females 47 38 eM oP wi oe 
a (384) (698) (114) (384) (698) (114) 
Total 48 39 33 57 45 38 
(858) (1594) (234) (858) (1594) (234) 
Ko Figires in parentheses represent the number of pairs averaged in a group. 


or merely the recall of a prior response in a 
somewhat similar situation? Recall is a pos- 
sible complication only for 1 on 2, since 2 
on 1 and 2 on 2 employ different describers. 
A test of this possible deficiency is provided 
by the separation of a week between the first 
and second interviews. A random sample of 
40 boys and 40 girls was selected from 
Camp A. For each, both presence and fre- 
quency overlap were computed for two de- 
scriptions by a single perceiver within the 
same interview and for two descriptions given 


‘in interviews 1 week apart. Table 3 reports 


these data. 

The differences in overlap for descriptions 
Separated by only a few minutes as compared 
to descriptions given a week apart are so 
slight that the high 1 on 2 overlap is clearly 
hot a function of a temporary set. 


Discussion 


The ordering of the overlap data is highly 
Consistent: reference to Table 2 indicates the 
Presence and the frequency measures main- 
tain a constant relationship to each other, 
ae the frequency measure consistently 
fig er. For the frequency measure, the total 
rI 2 overlap is 57%, while the 2 on 1 is 
ae and the 2 on 2 is 38%. Obviously, 

e absolute values of these measures could 
shift rather easily with a different coding 


scheme. Yet, these figures are remarkably 
high, given the fact that there are 68 cate- 
gories used in the analysis (the residual 
category is excluded). 

In one sense, the 2 on 2 measure repre- 
sents the model for overlap due to chance, 
given the distribution of category usage in 
the population. One might ask what overlap 
would be present if all categories were equally 
likely to occur. For both the presence and 
the frequency measures, the chance overlap 
is the ratio of the mean number of cate- 
gories in a description to the 68 categories 
in the analysis. Applied to this data, the 
chance presence and frequency measures 
would be .12. As in the research of Yarrow 


TABLE 3 
Wirarn-INTERVIEW AND BETWEEN-INTERVIEW 
OVERLAP 

Within-interview Between-interview 

overlap overlap 
Prenes | Frequcn | Pages® | ice 
Camp A 

Males 41% 59% 45% 56% 
(w=40) 

Females 48% 51% 45% 53% 
(w=40) 

Total 47% 55% 45% 55% 
(v=80) 
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and Campbell (1963), the chance model for 
the reappearance of categories can be rejected, 
for the lowest overlap is .32. In this paper, 
we attempt to go beyond the rejettion of 
chance to the development of techniques for 
measuring the relative contribution of the 
perceiver and the perceived. 

Special stress should be placed on the rela- 
tive differences indicated by our measures. 
Let us compare 2 on 1 with 2 on 2; in both 
situations, we have different perceivers, but 
in 1 on 2 we have a common individual being 
described. Yet, the increase in overlap over 
the 2 on 2 situation is only 6 and 7%, 
whereas if attention is shifted to the 1 on 2 
analysis (where two different individuals are 
being described by a single perceiver), the 
increase over the 2 on 2 is 15 and 19%. Cer- 
tainly, the magnitude of the 2 on 2 finding 
is a function of the homogeneity of the camp 
situation, as well as of the common general 
culture the children share. However, the small 
difference introduced by specifying a common 
social object reinforces our finding of the 
greater impact of the perceiver. 

In the process of comparing the relative 
contribution to overlap of the perceiver and 
the perceived, we want to emphasize an 
important next step for investigation. It will 
be important to explore the relative contribu- 
tion of the perceiver and the perceived to 
overlap of specific categories. Are there cer- 
tain categories that appear to be mainly 
perceiver determined, whereas others may be 
mainly a function of the perceived? This is 
an interaction situation, but it still may be 
possible to separate certain categories as 
being mainly determined by the object of 
description, whereas others are more a func- 
tion of the perceiver. Evidence of this sort 
should contribute to the exploration of inter- 
personal perception, especially in terms of the 
categories selected to be the basis of ratings 
in experimental situations. For example, one 
might ask the effect of various sets upon the 
descriptions given. Providing an “objective 
stimulus set” ought to increase the number 
of categories that are a function of the per- 
ceived, while altering the cognitive orienta- 
tion of the perceiver should have the most 
impact on perceiver-determined categories. 

Some further questions that need investiga- 
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tion are the effect of the interpersonal rela- 
tionship on category usage. For example, 
how does the description of another vary with ; 
one’s own interpersonal attraction to that 
person? Does category usage shift when a 
liked and a disliked other are described, or 
does one merely shift position on category con- 
tinua? Questions can also be raised as to the 
relation between category usage and inter- 
personal behavior. Do people who behave 
aggressively talk a lot about aggression in 
their descriptions of others? 

A number of questions can also be raised 
as to group differences in category usage in 
the description of others. How do descriptions 

i 


of others differ between rural and urban 
groups, sociocultural groups, etc.? Sex differ- 
ences in category usage should be of con- 
siderable interest. Beach and Wertheimer 
(1962) have demonstrated the existence of 
category usage differences as a function of sex. 

A final comment: our findings indicate that 
the most powerful influence on interpersonal 
description is the manner in which the per- 
ceiver structures his interpersonal world. 
However, we wish to draw attention to the 
complement of each of the proportions in 
Table 2. If there is overlap for 1 on 2, there 
is also shift, as another person is described. 
It is irresponsible to treat this shift as merely 
“noise.” In this same-status, same-age, same- 
sex world, the descriptions do change as each 
describer discusses a new person with whom 
he interacts. The world of our subjects 1 
not just a Rorschach card. A major task of 
social psychology, we believe, is the develop- 
ment of concepts to order the interaction phe- 
noma that lead to these changes, for we have | 
here focused only on stability. 
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3 experiments are reported in which a total of 124 college males wrote TATs 
at 3 points during social drinking, either in living-room discussion groups or at 
stag cocktail parties. 62 comparable Ss wrote under similar conditions when 
only nonalcoholic beverages were served. In contrast to the control or “dry” 
Ss whose protocols showed almost no changes, the protocols of the “wet” Ss 
showed significant increases in various sentient thoughts, particularly with 
moderate drinking, and decreases in various inhibitory thoughts with heavier 
drinking. Among the former, meaning contrast and physical aggression thoughts 
increased up to a maximum at 3-4 drinks (containing 1.5-0z. shots of 86-proof 
alcoholic beverage). They then decreased and were replaced by an ever in- 
creasing number of physical sex thoughts from 6 drinks on, The decreases in 
inhibitory thoughts—aggression restraint, fear-anxiety, and time concern— 
occurred regularly only after heavy drinking from 6 drinks on. Physical aggres- 
sion thoughts recurred again at high frequency in those Ss who drank very 
heavily (10 drinks and up). An n Sentience score based on the preparty TAT, 
consisting of the sentient categories which increased, less some of the inhibitory 
categories which decreased, predicted the amount of alcohol that would 


subsequently be consumed. 


The research literature on the psycho- 
logical effects of alcohol consumption is 
singularly unenlightening as to why so many 
people drink small amounts of alcohol, often 
daily, in social settings. It reveals primarily 
that alcohol impairs functioning at all levels 
—starting with blurring the higher mental 
Processes and ending with psychomotor in- 
efficiency, Yet it is hard to believe that 
people drink in order to think, talk, or act in 
an. uncoordinated, inefficient way (Takala, 
Pihkanen, & Markkanen, 1957). On the con- 
tiary, such confusion should theoretically 
create anxiety, if anything, because the indi- 


1 This report is the first in a series on the causes 
and effects of social consumption of alcohol sup- 
Ported by a grant to D. C. McClelland by the 
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vidual finds himself unable to control his 
behavior. Yet the traditional scientific expla- 
nation for social drinking is the precise 
opposite: people drink to reduce anxiety, 
tension, or conflict. “Alcohol solves the prob- 
lem of anxiety reduction [Horton, 1945, p. 
162].” “As complex society generates in- 
creasingly greater tension, the alcoholic means 
of solving it becomes more prominent [Jel- 
linek, 1945, p. 27].” (See also McCord & 
McCord, 1960.) Without even reviewing the 
criticisms that have been made of the cross- 
cultural (cf. Field, 1962) or animal studies 
(Weiss, 1958) one may question whether the 
average person who has a drink or two before 
dinner in the evening is doing so in order to 
reduce his anxiety. At least this is not what 
people say universally when asked why they 
drink. Straus and Bacon (1953) report that 
college students mention positive reasons for 
drinking (“to get high,” “for a sense of well- 
being”) as well as the traditional ones in- 
volving reduction in conflicts (“not to be 
shy,” “to meet crises”). Is it possible that 
the American puritan tradition has led even 
scientists to overlook some of the “positive” 
effects of social drinking in stressing that the 
only value it has is to deaden the individual 
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to real life, to anesthetize him a little so that 
he suffers less from anxiety? 

The present investigation was undertaken 
to discover empirically why people drink in 
normal amounts in natural social settings. It 
differs, therefore, from most previous alcohol 
research in several respects: it is designed to 
discover what happens after two or three 
drinks; it therefore employs a measure which 
is sensitive to slight psychological effects 
(that is, fantasy), since most types of be- 
havior are not much affected at low levels 
of alcohol consumption; it allows people to 
drink as they normally do—in normal 
amounts, in a natural social setting, in the 
mixtures and at the rate to which they are 
individually accustomed. Most studies in this 
field have a physiological orientation in which 
predetermined amounts of alcohol (by body 
weight) are administered according to a fixed 
time schedule, preferably by a continuous in- 
travenous drip (cf. Frankenhaeuser, Ungsten, 
& Jarpe, 1962), so that the person will not 
know whether he is getting alcohol or a 
placebo solution. The purpose of such studies 
is to determine the effects of alcohol per se 
(that is, apart from “suggestion”) in psycho- 
logical functioning. But it is impossible to 
eliminate suggestion in psychological research. 
One can only work with different settings that 
influence the subject in different ways. Cer- 
tainly strapping the subject down and admin- 
istering an unknown substance to him (maybe 
alcohol) intravenously while his every reac- 
tion is being checked must have a powerful 
effect on him: very probably it makes him 
anxious at this strange, threatening procedure 
which takes place in a laboratory or hospital 
with all its negative associations in our 
culture. Thus the results obtained are obvi- 
ously a product of the interaction of the set 
and setting with alcohol. The expectation is, 
of course, that the effects of alcohol can be 
isolated because the same set and setting oper- 
ate on the subjects who received only a 
placebo. But a little reflection will show how 
unwarranted such an expectation may be, par- 
ticularly if the effects are slight and require an 
anxiety-free atmosphere to develop. Suppose, 
for the sake of argument, that alcohol con- 
sumption in interaction with Settings B, C, 
and D gives rise to feelings of good cheer. Is it 
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surprising that in interaction with Setting A 
(anxiety-arousing intravenous administra- | 
tion) the investigators (Frankenhaeuser et ; 
al., 1962) found no increase in the frequency 
with which the subjects who had had alcohol 
said they felt “happy” or “carefree” as com- 
pared with control subjects? It was a totally 
“cheerless” experience for all subjects: to 
expect a little alcohol to overcome this 
massive effect of the set and setting is uni- 
reasonable. Obviously, what is needed is a 
series of studies in which alcohol is taken 
in a variety of settings so that the range 
of interaction effects can be studied. What 
make no methodological sense is to assume 
that one type of set and setting (no knowl 
edge of what is being ingested, laboratory 
conditions) is somehow better or “purer” for 
studying the effects of alcohol than another, 
After all, the results are only generalizable to 
that particular condition, and who ingests 
alcohol in that way? If one were to pick @ 
set or setting as “best,” it would seem most 
logical to pick the one that is modal for the 
culture. Hence we have decided to study the: 
effects of social drinking, without any illt- 
sions that we (or anyone else) can study the 
effects of alcohol apart from the conditions 
under which it is consumed. 


EXPERIMENTS I AnD IT 
Procedure 


| 

Young male subjects were recruited from two 
different educational institutions to take part 1 
some psychological research “on the effects of vati 
ous social atmospheres on people’s imaginative 
ness or fantasy.” It was not mentioned that alcohol 
was the focus of interest because it was felt that 
subjects might be set to react in certain ways 0 
this information. Instead the social settings we 
chosen so that either alcoholic or nonalcoholic beroi 
ages could be served without being considet 
unusual, 

Subjects in Experiment I were first- and se 
year male students in a graduate school of busine 
administration, averaging about 25 years of 
They were invited to attend a discussion 1m f, 
living room of an apartment in groups of three © 
six at 4 p.m. or at 8 p.m. Alcoholic beverages WA 
served to nine such groups (totaling 39 subjee 
chosen at random, and nonalcoholic beverages a 
five other groups (totaling 24 subjects). The fon 
will be referred to as the wet condition an 
latter as the dry condition. > 

Subjects in Experiment II were male un 
ates belonging to two fraternities (averag® 


cond- 


srg 


dergradi” 
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=19.5 years) at a liberal arts college for men. They 
were invited to attend one of two parties being 
given at 4 P.M. on a Friday afternoon by the 
researchers “to study the effects of a party atmos- 
phere on fantasy.” At both parties there was food 
(snacks) and music, but one served alcoholic (wet 
condition) and the other soft drinks (dry condition). 
One half of the men in each fraternity was randomly 
assigned to the wet (V=35) and dry (W=38) 
parties. Unfortunately, most subjects had expected 
alcohol to be served at both parties—which was 
normal on such occasions—so that the subjects in 
the dry condition were quite angry when they dis- 
covered no liquor was available. To keep their co- 
operation, the experimenter had to promise them a 
keg of beer after the party was over, but obviously 
their mood was not ideal for a control or “neutral” 
condition, 

A few subjects in each study did not drink any 
alcohol in the wet condition. They were classified 
as dry subjects.3 

When the subjects arrived, it was explained that 
they were going to be given a test of imagination, 
since the experimenters were interested in finding 
out what the effects of various social situations were 
on imagination. They were further told that this 
was simply one type of social situation that was 
being studied (that is, a living-room discussion or 
a fraternity party, depending on the study) and 
that other situations were also being investigated. 
_ After some further introduction to the TAT as a 
Measure of imaginativeness, they wrote a set of 
Stories to four different pictures (TAT:). The social 
_ Interaction was then set in motion for about 25 
minutes, during which alcoholic or nonalcoholic 
beverages were served as naturally as possible. 
Subjects then wrote stories to four new pictures 
(TATu), conversed again socially for another 25 
Minutes, and finally wrote four more stories to a 

new set of pictures. During the 25-minute periods 
Of social interaction, subjects in Experiment I were 
| engaged by the experimenter in a discussion of what 
it means to be a businessman—that is, Why had 
they come to the business school? What was their 
_ attitude toward making money? What is the busi- 
_ &sman’s place in society? etc. In Experiment II, 
the periods of social interaction proceeded without 
much help from the experimenters in the context 
of a stag party, complete with a jazz combo and bar. 

. In both studies, alcohol was served in the form 
. of the usual drinks: scotch and soda, martinis, etc. 

n the living-room discussions, the subjects were 
encouraged simply to serve themselves whenever 
they felt like it. One experimenter was present 
in the room, ostensibly taking notes on the conversa- 
tion, though actually he had trained himself to esti- 
mate and record accurately the amount of alcohol 
= tape 


} 


It could be argued that abstainers are probably 
fferent from subjects who are not offered alcoholic 
eee However, the fact that only two subjects 
5 Xperiment I and three in Experiment II were 
_ *0 Classified makes it unlikely that the decision could 
‘ave affected the results in any significant way. 
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consumed from various sized glasses, with and 
without ice cubes. In the fraternity party, partici- 
pants could go to a bar for drinks, each of which 
contained two shots (3-ounce, 86-proof beverage). 
In order*to get one, they had to sign a slip so 
that a record of amount of alcohol obtained by 
each subject was available after the experiment. It 
should be noted that the alcohol estimates are only 
appoximate in both studies, because in Experiment I, 
there was undoubtedly some observer error and in 
Experiment II, the record was for the amount 
obtained (say before TATu), not for the amount 
consumed (as in Experiment I). Often subjects 
would be sipping a drink while writing their stories. 

The instructions for taking the TAT were the 
standard ones used for experimental purposes (see 
Atkinson, 1958). Subjects were allowed about 5 
minutes for each story. Twelve pictures were 
specifically chosen for the present study, so that 
three sets of four different pictures could be available 
for each subject. The themes suggested by the 
pictures were chosen to represent areas of thought 
which alcohol might be expected to affect—namely, 
sex (S), aggression (A), elation (E), conflict (C), 
and “mysteries of life’ (M). Since there were 12 
pictures and three TAT administrations, 3 pictures 
were chosen to represent two themes each. A short 
description of the pictures and the themes which 
they were meant to elicit follow: 


1, Young couple walking on a sidewalk. (S) 

2. Couple crossing the street in a snowstorm. (S) 

3. Blonde model sitting pensively on a rotten, 
decaying boat. (S) 

4. Boss leaning back in a desk chair looking at 
blonde secretary who is taking dictation. Picture 
of wife and children on the desk. (C) 

5. Hoodlum in leather jacket leaning against a 
lamp post. Young, college couple walking by, 
young man turned around toward the “hood.” 
(C, A) 

6. Boxer in gym, looking pensive. In background 
someone shadowboxing. (E, A) 

7. Negro in a race riot, taunted by a number 
of white fellows. (C, A) 

8. Two racing cars cutting a corner, (E) 

9. Ski jumper in midair, (E) 

10. Man looking into a tent. (M) 

11, Man looking into a tunnel, (M) 

12. Boatman rowing on a lake on moonlit night. 
(M) 

The order of pictures was arranged so that: each 
of four themes (S, E, C, M) occurred in each 
4-picture TAT; each 4-picture TAT also contained 
one aggression picture; the order of appearance of 
the four thematic categories was random in each 
4-picture TAT; the order of the three pictures con- 
stituting a category was random across the different 


4-picture TATs. 


Coding Scheme 

Categories for coding the imaginative stories 
written were chosen partly on theoretical and partly 
on empirical grounds, and underwent several re- 
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TABLE 1 


"AVERAGE FREQUENCY OF VARIOUS SENTIENT THOUGHTS IN FOUR-STORY PROTOCOLS OBTAINED AT INTERVALS 
DURING MALE SOCIAL INTERACTION 


PER Í 
peri Experiment II 
Relee discussion) (Fraternity party) 
RT Difference Atn | TATm | Difference : 
N Tarr TA gana TAT N ene T. Aa n ae TAT 
PaaS cia 39 469 85 1.33 +.87**| 35 69 1.26> | 1.06 +.37 
Dry 24| 75 | i54 | 79 | +004 |38| (97 | ‘92 | <63 | —34 
Physical sex å a 
39 64 51 36 —.28 35 .80 2.54 2.91 | +211 
De 24 aus 38 29 +.12 38 1.45 2.00 2.37 +.92" 
a as me 39 | 2.26 2.72 2.08 —.18 35 | 3.20 3.60 3.00 —.20 
Dry 24.) 1.96 2.08 1.54 —.42 38 | 3.39 2.76 2.68 —.61 
a Riera 39 .69 74 69 00 | 35 83° 1.40 | 1.20 +.37 
ry 24 .50 .63 .15 +.15 38 1.71 1.34 1.39) —.32 
‘cal fy ; 
Seas aggression | 39 | 2.38 | 2.26 | 251 | +13 | 35 | 217 | 1.80 | 1.60 | -5m 
Dry 24 | 2.46 2.08 2.58 +.12 38 1.96 1,66 2.42 46 
Cumulative average 1.61 4.07 7.2 14.1 
alcohol consumed 
(ounces, 86-proof 
beverage)® 


“Standard deviations range from .8 for means around .5 to 2,3 for means around 2.00. 
b Difference TAT1r — TATY is significant at p < .05 in the predicted direction. 
® Difference between means of dry and wet conditions significant at p < .05. 3 Tm (p <0 
4A physical-nonphysical aggression score for each subject shows a significant average increase from TAT: to TATu 


in the predicted direction), 
® For the wet conditions only, 


*p <.05 by £ tests for differences between correlated means. 
** p < 001 by ¢ tests for differences between correlated means. 


visions, A number of catgeories thought to be of 
theoretical importance were dropped because of 
infrequent occurrence, low interscorer reliability, or 
no changes in frequency of occurrence as a function 
of social drinking. Among the categories extensively 
tested and discarded were: provoking press (for 
example, temptation to sex or challenge to aggres- 
sion); conflict (in particular, approach and avoid- 
ance conflicts over sex or aggression) ; impulsive con- 
fict resolution (that is, impulsive, unplanned action, 
without response-produced guilt); elation (including 
all words connoting positive affect); pleasant 
physical impulse, deception (for example, villainy, 
immorality, illicit love); and poetic usage (for ex- 
ample, metaphors, similes, hyperbole, etc.). The 
coding system reported here developed out of some 
of these categories, but it is worth recording as a 
guide to future research what types of “associative 
chains” proved unreliable or invalid in this study. 
The scoring scheme which finally evolved con- 


* Only a brief indication of coding definitions can 
be included here. The complete set of scoring 
instructions is on file with the American Documenta- 
tion Institute. Order Document No. 8302 from 
ADI Auxiliary Publications Project, Photoduplica- 
tion Service, Library of Congress, Washington, D. C. 
20540, Remit in advance $1.75 for microfilm or 


tained two major classes of coding categories, one 
containing sentient thoughts, the other restraining 
thoughts. The class of sentient thoughts was m 
up of “sex,” “aggression,” and “meaning contra 
Sex was defined as any reference to sex, love, 0 
romance. Such references were classified as phe 
and nonphysical, Physical sex included any rele 
containing a physical or body image. For examp 
“beautiful knees,’ “he wants to sleep with het 
Nonphysical sex included all references not cla! 
as physical. For example, “he asks her for a 
“they are in love.” 

Aggression was defined as any reference to human 
aggressive thought, emotion, or action. As in s¢X suey 
references were classified as physical or nonphyst 
Examples of physical aggression are: “his face ¥ aa 
with anger,” “he slapped him.” Examples of i 
physical aggression are: “he is angry,” “he insu 
him.” 

Meaning contrasts were themes of sharply 
tasting ideas where the contrast is immediate 
the same or adjacent sentences) and involves 
Meaning of life or some major life experience you 
example, “death provides meaning to life,” do 
take the easy road in life or get caught uP 


date)” 


con- 
(in 


to: 
$2.50 for photocopies and make checks payable 
Chief, Photoduplication Service, Library of C% 
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society’s rat race?” “he placed 23rd, but he is 
happy.” 

The class of restraining thoughts included “sex and 
aggression restraints,” “fear,” and “time concern.” A 
restraint was anything that involves an avoidance 
reaction to sex or aggression. Restraints may be ob- 
jective frustrations, subjective fears, regret, guilt, or 
punishment for sexual or aggressive activities. 

The fear category to some extent includes refer- 
ences also coded under the previous two categories, 
although it is broader in the sense that it includes 
all references to anxiety of any sort. 

Time concern was defined as references to waiting, 
having enough time, keeping ahead of time, etc. For 
example, “He is almost late now,” “It seems an 
interminable time to the tournament.” 

These categories were worked out using a small 
number of protocols from the first study to test for 
coder reliability and validity in discriminating be- 
tween wets and drys. It was then applied to the 
remainder of the stories from this study and to all 
those from the second study. The stories were 
detached from the test booklets and randomly mixed 
together so that the coders could not be biased in 
their judgments, In the end, the results from the 
40 odd stories used to develop the code were in- 
cluded in calculating means, since they differed in 
no way from scores on the remaining stories and 
since belief in the dependability of the results rests 
largely on cross-validation anyway. Two coders were 
trained in the use of the coding scheme, and each 
scored half the stories. Intercoder reliability checks 
were made at the beginning and at the end of the 
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coding. Agreement coefficients were computed by the 
formula: 2X number of agreement/total number 
scored by both judges. The respective median coding 
reliabilities were .94 and .96, when agreements on 
absence pf a category were included. However, as 
will appear later, agreement on presence of meaning 
contrasts was considerably lower and ultimately 
required a revision in its scoring definition. 


Results and Discussion 


Table 1 summarizes the meaz levels of 
various sentient thoughts for wet and dry 
groups for the three different TAT adminis- 
trations. So far as the relaxed discussion 
groups are concerned, most of the categories 
showed no significant shifts, despite their 
initial promise based on scoring the trial 
protocols. No significant shifts appear in the 
inhibitory thoughts either, the findings on 
which are summarized in Table 2. This result 
is very important in a negative sense, since 
it casts doubt on the most widely held theory 
as to why people drink. Certainly there is no 
evidence here of a decrease in inhibitory or 
anxious thoughts after one or two drinks in a 
normal living-room discussion. Nor is there 
any evidence of a release of id thoughts deal- 
ing with sex and aggression (which might be 


TABLE 2 


AVERAGE FREQUENCY OF VARIOUS INHIBITING 


THOUGHTS IN FOUR-STORY PROTOCOLS OBTAINED AT INTERVALS 


DURING MALE SOCIAL INTERACTION 


Experiment II 
eee elon) 3 (Fraternity party) 
“ea Difference 
Diffe ATi 
w | TAT | Tate TATu | T. Mur wo] TAT: f Toge Tape | AT 
© Sex restraints M ie 
Wet A Ta at | Sats] eae | eau +. 
A Dry Aa ed ne od eee De ec) (ae 
ggression restraints nt ai i aradi 
s 9 | 56 | 49 a2 O paon tasi ii $ - 
ae Era E Are We ak ant Se mee teen | 13 
aaa a A 47 | —.52* 
v ; s4 | +08 | 35] 09 | - : 
„P? aie Ve at E aai RBH fe OE 16.60 8 58-78 F15 
ime concern ý H 
—26 |35 | 214 | 240 | 9 ' 
Dy BA iz on i Za | 3g | 182 | 1.63 | 153 | —.29 
Cumulative average 1.61 4.07 7.2 14.1 
alcohol consumed 
(ounces, 86-proof 
beverage)> 


* The number of sex restraints in the dry 
Sex-related stories in which such imagery could occur; see 
eis wet conditions only. 


ition is significantly higher tl 
condition i: EE 


han in the wet condition, but so is the number of 
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attributable to a decrease in inhibitions not 
reflected in the protocols), 

Meaning contrast is the one category which 
showed a marked, regular and significant in- 
crease in the wet condition (TAT; > TAT}, 
$< 05; TAT > TATu, p< .05; TAT 
> TAT), p < .0001) and not in the dry con- 
dition, It is a newcomer to theorizing about 
the effects of alcohol and contains the first 
hint as to why people indulge in drinking 
small amounts of alcohol in social settings 
of this kind. Apparently it acts to allow or 
encourage them to think in contrasting terms 
about large or existential life issues. Many 
of the themes have a meditative and ironic 
twist to them, as in such poetic statements as, 
“Paths of glory lead but to the grave” or 
“To be or not to be, that is the question.” 
One can think of these contrasts as involving 
primary process thinking—that is, entertain- 
ing contradictory thoughts or feelings at the 
same time—or as indicating a yearning for 

on some of life's basic issues 
(success-failure, life-death, pleasure-unpleas- 
ure, etc.) which is encouraged or released 
by alcohol in small quantities under social 
conditions, 

However, it should be recalled that the 
living-room discussions had focused around 
such “existential issues” as why they had 
come to the business school, the meaning of 
business in national life, etc, Perhaps this line 
of thinking was just picked up more easily 
by the wets because they were in a more sug- 
gestible state, However, the findings for the 
cocktail party, where there was no organized 
discussion, do not support this view. There 
too stories showed a significant increase in 
meaning contrasts from TAT; to TAT}; after 
an average consumption of 7.2-ounces 86- 
proof , followed by a slight decrease 
from TAT to TAT r. The increase in mean- 
ing contrasts as a result of social drinking is 
cross-validated 


Unfortunately in view of the importance 
and novelty of this finding, real difficulties 
were encountered in maintaining high inter- 
judge agreement for the meaning-contrast 
category, particularly for the fraternity party 
protocols. Judge B (a woman) found very 
few instances of meaning contrast in Experi- 
ment II—less than 4 the number coded by 
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Judge A (a man)—which produced low inter 
scorer agreement. It was therefore decided to 
have Judge A score all the protocols in 
Experiment II so that the means in Table i 
represent his judgments exclusively. A year 
later two new judges, C and D, recoded all 
these protocols to check the result, Again 
Judge D (a woman) did not recognize as 
many instances of meaning contrast, making 
agreement with Judge C (a man) low. How- 
ever, the results obtained by Judge C were 
very similar to those obtained by Judge A 
Even when the means per subject are based 
only on the half of the stories scored by 
Judge C, the increase in meaning-contrast | 
thoughts from TAT; to TAT; is highly sig- 
nificant (£ = 3.91, p < .0001), leaving little 
doubt as to the “reality” of the increase 
shown in Table 1. However, this experience 
convinced us that the coding definition had 
to be revised since, while two male judges 
had obtained the result, two female judges 
had not. 

Other changes in fantasy in Experiment If 
are marked and in line with general expecta- 
tions. After moderate to heavy drinking (Up 
to 9 drinks containing 1.5-ounce shots), there 
is a marked increase in references to physical 
sex and a not quite so marked increase if 
references to physical aggression, among youn 
college males at a stag party. The increase Ia 
physical sex references also occurs in the diy 
condition, although the increase in the wet 
condition is significantly greater. Unfortu 
nately this finding is marred by the fact thit 
the drys also started out higher in physical 
sex and perhaps could not show as great a) 
increase, Perhaps the reason why dry subject 
were somewhat (though not significantly 
higher was their anger which may have made 
them more impulsive. Whatever the reason, 
the nature of the control group does present ® 
real problem for the interpretation of the rel f 
tively greater increase of the wet group M 
physical sex. On the basis of the results P 
sented so far it is difficult to decide W 
equivocally whether alcohol has a real effecti 
on physical sex or whether such imagery # 
creases just as much in a party setting Whe 
subjects do not consume alcoholic b } 
Further evidence, however, indicates U 
alcohol does have a real effect on physical 


Analyses within the wet group were carried 
‘out (see below) and a significant positive 
elationship between amount of alcohol con- 
‘sumed and number of physical sex themes was 
found, Also, a recent study by Kalin (1964) 
with a more adequate control group showed 
‘that alcohol does raise the level of physical 
sex above that of a dry control. 
Some confusion is introduced into the ag- 
ession comparisons by the fact that the 
itation of the dry subjects at not being 
‘served liquor, showed up in their first TAT, 
‘producing a significantly higher mean on 
physical (but not nonphysical) aggression. 
‘Thus while the wet subjects showed a sig- 
cant increase in physical aggression from 
TAT, to TATy, they could not really be 
‘compared with the dry subjects who started 
a significantly higher level. What seems 
‘more appropriate and less influenced by in- 
‘tial level is the ratio of physical to non- 
Physical aggression. It starts out low (mean 
‘of .83 to 2.17) and increases significantly in 
the wet condition (1.20 to 1.60) whereas it 
decreases in the dry condition perhaps as the 
jects’ anger wore off. What is interesting 
both for sex and aggression is that the key 
changes occurred not in any references to sex 
‘and aggression but only in those references 
having a physical aspect—using body, sensory, 
or feeling imagery. 
Table 2 shows a significant decrease in 
ggression restraints in the protocols written 
N the wet condition in Experiment II. The 
t increase in sex restraints is only ap- 
t, since a glance at Table 1 shows that 
increase could easily be due to the much 
larger number of sex-related stories in the 
it condition for TAT and TAT. The 
ignificant reduction in fear-anxiety is direct 
onfirmation of the most prevalent theory as 
O the psychological effects of alcohol. Finally 
concern” also shows a big drop after 
heaviest drinking, suggesting that a key 
of “reality constraint” drops out with 
consumption of alcohol which also in- 
cres with many other ego functions like 


Experment III 


, It was decided to check the results for the 
tern y cocktail party, since, unlike the discussion 
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groups, the subjects were all exposed to a single 
social atmosphere which may have been unique in 
some way, More subjects would alo provide data 
for a more detailed analysis of the effects on fantasy 
of different levels of alcohol consumption, 

Protocols were therefore obtained from 50 more 
male undergraduates at another college as they 
took part in cocktail parties normal for that in- 
stitution—12, 17, 8, and 13 subjects from each of 
four different parties. Conditions diffeed from 
those reported for Experiment II in the following 
respects: (a) A faculty member was no. present, nor 
was cither of the two experimenters used in the 
other two studies, A naive undergraduate experi- 
menter was used to eliminate any possible effects of 


been drinking before they arrived 
failed to write enough on one or 
any 


Fade 
ir 
: uf 
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of alcohol consumption, the protocol 
subjects were not used in this study.* 
above, of course, do not Include sul 


? 


H 
FE 
PE 


ly 
the subject of study. (d) Alcohol was provided 
the bartender in 1-ounce shots to cut 
amouft of drinking and provide a condition in 
which average consumption was between that of 
Experiment I and Experiment II (e A number of 


= 
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TABLE 3 


AVERAGE FREQUENCY OF VARIOUS THOUGHTS IN 
FOUR-STORY PROTOCOLS OBTAINED DURING 
MALE UNDERGRADUATE SOCIAL | 
DRINKING 


Difference 
TATi| TATn| TATim| TATut 
— TATI 
Meaning contrast .82 | 1.04 | .78 | —.04 
Physical sex .96 | 2.56 | 3.04 | +2.08* 
Physical aggression .94 | 1.10 | 1.00 | +.06 
Aggression restraints |2.24| .96| .72 | —1.52* 
Fear-anxiety .90 | .38 | .22 | —.68* 
Time concern 1,02} .68| 50 | —.52* 
Cumulative average 442 8.84 
alcohol consumed 
(ounces, 86-proof 
beverage) . 
Note, —N = 50, 
*p < 0001. 


marked and significant increase in references 
to physical sex (but not nonphysical sex) 
and marked decreases in references to aggres- 
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sion restraints, fear-anxiety, and timu 
cern, just as in Experiment II. It shoi 
noted, however, that there are marked 
ences between the living-room disc 
and these cocktail parties. Consumpti 
2-3 drinks (a little over 4-ounces 86 
alcoholic beverage) had none of these 
on TAT in the living-room discussio 
all of them on TATy: in the cocktail 
setting. i 
The results for meaning contrast and p 
ical aggression suggest curvilinearity as 
do in Table 1. The effect shows up 
clearly for meaning contrasts in Fi 
which was constructed in the following 
First, an attempt was made to establ 
base line for appearance of the category 
out ingestion of alcohol by averaging 
frequency of its occurrence for TAT; 
TAT under the dry condition (avail 
only in Experiments I and II) and T 
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MEAN NUMBER OF PHYSICAL AGGRESSION IMAGES 
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—Four Smaller 
Cocktail Parties 


(6.1712) (Over 12) 


ALCOHOL CONSUMED (0Z. 86 PROOF BEVERAGE) 


Fic, 2, Average frequency of 


physical aggression images in TATs as a function of 


alcohol consumed in cocktail parties. 


Under the wet condition (to include the same 
subjects whose later protocols contribute to 
other points on the curve). Then the number 
of Meaning contrasts in TAT or TAT 
Si classified according to how much alcohol 
person had consumed prior to writing the 
“eg Four levels of consumption were estab- 
; “aan across the three experiments—.1-3.99 
12 ces; 4-6 ounces; 6.1-12 ounces; and over 
ie “sas 86-proof alcoholic beverage—and 
this wet subject contributed two values to 
© a of the curve depending on how much 
Thr drunk prior to TATy and prior to 
thes ts Obviously there are flaws in this 
Sent lure: more subjects in Experiment I 
a lightly (less than 4 ounces 86-proof) 
fev in Experiments II and ITI drank 
Ex ly; also those who drank heavily in 
cones I or lightly in the cocktail party 
ably differed in some key respects from 


the other subjects. Nevertheless, the results 
are instructive because in all three experi- 
ments there is a sharp increase in the number 
of meaning-contrast images to a maximum 
after ingestion of 4-6 ounces (3-4 drinks) 
followed by a regular decrease in the two 
experiments in which drinking continued 
beyond this point. The increase after two 
drinks is not quite significant (t= 1.47, P< 
.10, in the predicted direction for all results 
combined), but it is very significant after 3-4 
drinks (t= 3.42, p < 01). The finding is 
not so marked in Tables 1 and 3 because at 
any given TAT administration, individuals 
are writing who have consumed very different 
amounts of alcohol. 

The results for images of physical aggres- 
sion, when plotted in the same way in Figure 
2, are not so easily understood. There are no 
significant changes by amount of alcohol con- 
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sumed for the discussion groups, but several 
changes occur in the cocktail parties. The 
pattern is so irregular, one would be inclined 
to dismiss it as due to chance fluctuations 
except that it is identical in shape for the two 
experiments and when the results are com- 
bined nearly all the shifts are significant. 
First, with small amounts of drinking, there 
seems to be a decrease in physical aggression 
images (t = 1.46, p < .20), followed by a 
sharp increase (t= 3.19, p<.01), then 
a decrease (t= 4.13, p<.01), followed 
by a final increase (¢ = 2.03, p < .05) with 
very heavy drinking. The first increase is 
significantly above the ‘base line (£= 2.43, 
p< .02) and the decrease at an average 
9-ounce 86-proof consumption is significantly 
below the base line (£ = 2.26, p < .05). These 
changes cannot readily be explained by refer- 
ence to alternating patterns in writing about 
aggression, as they do not occur markedly in 
Tables 1 and 3. The pattern of changes in 
nonphysical aggression images is irregular and 
quite dissimilar for the two groups, as is the 
pattern of changes in differences or sums 
of physical versus nonphysical aggression 
images. So one must apparently accept the 
results in Figure 2 at face value: In cocktail 
parties with increasing consumption of alcohol 
physical aggression images first increase in 
young males (when they are “high” or ex- 
cited), then decrease (when they are “sod- 
den”), and finally increase again (when they 
are really “drunk”). These findings need to 
be checked in other settings. 

The curves for the other variables when 
plotted in this way show more or less linear 
changes in the cocktail parties and no sig- 
nificant changes in the discussion groups. Of 
considerable interest is the point in the 
course of drinking at which the trends really 
begin to be marked. So far as physical sex 
images are concerned the increase appears 
immediately, at the lowest levels of drinking 
recorded in the cocktail parties, but it rises 
significantly above the high base line that 
exists in the dry party in Experiment II only 
after about 5-6 drinks containing 1.5-ounce 
shots of 86-proof beverage. No such increase 
appears in the discussion groups but perhaps 
it would have if alcohol consumption had been 
continued to higher levels. 
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There is some evidence for a decrease in 
aggression restraint images in Experiments II 
and III at moderate levels of drinking, but it ; 
really becomes marked and significant in f 
both cases only at around a consumption o 
9 ounces and up (6 drinks or more)—a level 
not reached by anyone in the discussion | 
groups. The same can be said for fear-anxiety | 
and time images, though a really good case 
for a decrease in the latter can be made only 
at even higher levels of consumption (15 
ounces and up or 10 drinks or more). 

To summarize, at the level of consumption 
of a single drink (a 1.5-ounce “shot” of 86- | 
proof beverage), no changes in fantasy were | 
detected; at 2 drinks, themes of meaning con 
trast appear more often, both in the discus} 
sion groups and in the cocktail parties; at 34 
drinks, the subjects appear to be “high” i 


gression; at 5-6 drinks, they think mord 
about physical sex than usual and physical 
aggression drops out. At 7 drinks or mote) 
meaning contrasts begin to disappear and the] 
decreases in inhibitory thoughts become 
clearly apparent for the first time—that isf 
aggression restraint, fear-anxiety, and time 
concern—and there may be a final increase 
in physical aggression, this time combined 
with fewer thoughts about aggression restraint. 
The results match fairly well the stages o 
alcoholic intoxication as commonly under- | 
stood or described in the clinical literatures} 
What has not been previously stressed iS” 
early “positive” stage in which subjects 

more often in emotionally contrasting tem 
about the meaning of life. If this type 
thinking is regarded as pleasurable, it ma 
help explain why so many people drink smal 
amounts of alcohol socially. At this time, 0) 
can only speculate about the reason why it | 
should be pleasurable to be able to think 1 
contrasting terms about large existenti 
issues. The possibility of considering th 
contrasts as involving primary process © | 
ing was suggested above and it is a comma 

psychoanalytic assumption that primary Pag 
ess thinking is associated with pleasure. ~ 
also possible that alcohol diminishes 
obsessiveness and the need for logical 
affective consistency characteristic © 
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person engaged in everyday dealings with 
reality. An idea very similar to this inter- 
> pretation is expressed by Levy (1958) who on 
the basis of clinical evidence derived from 
dealing with alcoholics says that “alcohol 
diminishes the need of the ego for synthesis, 
for logical consistency, for unambivalence 
[p. 652].” 

The anxiety-reduction hypothesis is sup- 
ported by these data only for very heavy 
drinking. 

One final note: in previous research of 
this kind, it has been found useful to compute 
scores for individuals consisting of the fre- 
quency with which they normally inject 
images into their stories of the type associated 
with changes in the experimental variable. It 
turned out here that the tendency to use 
these images in TAT, before drinking 
started, is associated with the amount of sub- 
sequent drinking. This is particularly true of 
the images that increase with alcohol con- 
sumption—that is, meaning contrast, physical 
sex, and physical aggression. Subjects whose 
initial protocols contain only two or fewer 
of these images less often drank heavily in all 
three experiments—42% above the median in 
the discussion groups (x? = 2.92), 38% in 
the large cocktail party (x? = 2.30), and 31% 
in the smaller cocktail parties (x? = 1.92). 
The combined chi-square is significant in the 
Predicted direction (x? = 7.14, df=3, $ < 
05). Or to put it the other way, those who 
had more “sentient” thoughts to start with 
drank more subsequently. There was also a 
tendency for those who had more inhibitory 
thoughts to start with to drink less. This was 
Particularly true in the cocktail parties for 
the categories of aggression restraint and 
„time concern. The fear category was incon- 
sistently related to subsequent drinking in 
the two cocktail party experiments. So a total 
n Sentience score (cf. Murray, 1938) was 
computed for TAT; by adding the number 
of instances of meaning contrast, physical 
sex, and physical aggression, and subtracting 
the number of instances of aggression restraint 
and time concern. This score was significantly 
related to subsequent drinking in Experi- 
ments IT and IIT. In the large cocktail party, 
of those who scored high in n Sentience (+1 
or more), 80% drank more than average, as 
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compared with only 25% of those who scored 
low in n Sentience (x? = 10.3, p < .01). In 
the smaller parties of Experiment III, the 
same cofnparison yielded a 75% versus 35% 
split (x? = 12.3, p < .01), although the pro- 
portion of people scoring +1 or more was 
considerably smaller (32% versus 43%). In 
short, in these parties if the sentience-inhibi- 
tion ratio is positive to start with, the person 
is likely to drink more. The same tendency 
exists in the discussion groups (64% versus 
44%), but at an insignificant level, perhaps 
because not enough drinking occurred to 
sharpen up the difference. The interpretation 
of this finding is untertain; it may be a kind 
of conditioned response evoked in anticipation 
of drinking, since the initial testing is done in 
a party setting. That is, subjects may antici- 
pate the effects of drinking from past experi- 
ence and project the thoughts it will produce 
into their initial TATs. Those who project 
more such thoughts are those who are more 
in a “party mood”—readier to drink more at 
the moment. On the other hand, the n Sen- 
tience score may represent a more stable 
personality characteristic signifying an in- 
dividual who is more open to sentient experi- 
ence and likely to drink more. If so, it may 
provide a clue to why some people drink 
heavily and perhaps even become alcoholics. 
The next step is to get a measure of it away 
from the party setting and correlate that with 
drinking behavior and other personality 
variables. 
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With the emphasis placed upon possible aversive consequences of risk taking 


(actual physical pain coupled with monetary loss), 


group consensus achieved 


through discussion was displaced in the risky direction relative to the average 


of the group members’ prior 


individual decisions. Subsequent private decisions 


also exhibited this shift toward greater risk taking. These results were explained 


in terms of a process of responsibility diffusion. This interpretation was further 
reinforced by the findings observed for other experimental conditions, which 
‘ effectively ruled out the alternative possibilities 
sued from its social desirability in a male sample or 


that greater risk taking en- 
derived from affiliative 


propensities in a context where group members expected to experience 
aversive consequences in the company of like-minded others. 


In two previous studies of individual and 
group decision making, Wallach, Kogan, and 
Bem (1962, 1964) found that group decisions 
reached through discussion and consensus 
tended to be more risky than decisions made 
by the group members as individuals. In our 
first investigation, the decisions involved a 
number of hypothetical life situations in which 
a protagonist was faced with the choice be- 
tween a more risky and a less risky course of 
action. The second experiment inquired 
whether these risk-taking tendencies in groups 
would be found in a decision situation in 

f ‘Which the group members were actually ex- 
posed to the consequences of their decisions. 
Using risks and payoffs based on monetary 
„Ban and loss for problem-solving perform- 
ance, we observed that groups were con- 
siderably more likely than individuals to 
select the more difficult, higher payoff (for 
Correct solution) problems, even though prob- 
lem solving itself was carried out by a single 
group member. In both of the above experi- 
Ments, the “risky shift” phenomenon was in- 
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National Science Foundation. We are indebted to 
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invaluable aid, and to A. E. Myers and D. L. 
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Now at Carnegie Institute of Technology. 


terpreted as the outcome of a process of re- 
sponsibility diffusion. 

The present experiment seeks to extend 
the generality of our previous findings to a 
type of group decision making in which nega- 
tive consequences are emphasized. In order 
to accomplish this end, we selected physical 
pain and discomfort, coupled with monetary 
loss, as the potential negative outcomes of 
risk taking. A second purpose of the present 
investigation was to examine in more detail 
whether processes other than a diffusion of 
responsibility might account for, or con- 
tribute to, a risk-taking shift in group decision 
making. We consider two possible alternative 
processes in turn below. 

First, there is the possibility that risk 
taking, by connoting boldness, may be more 
socially desirable than conservatism, Such an 
association might be quite strong in the 
present experiment where physical pain and 
discomfort are being risked, since conserva- 
tism under such circumstances could imply 
cowardice. Further, the association may be 
especially likely to appear in a group setting 
where one’s “image” is on public display. 
Accordingly, an experimental condition in- 
volving anticipated disclosure of one’s de- 
cisions to others has been incorporated in the 


study design. 
Second, an expectation that the conse- 


quences of one’s decisions will not be experi- 
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enced alone, but in the company of group 
members who are undergoing the same ex- 
perience, may enhance the level of risk 
selected. This possibility might well”be de- 
rived from Schachter’s (1959) observations 
concerning subjects’ strong preference for 
awaiting impending painful stimulation in the 
company of others in the same predicament. 
In the present case, subjects may be inclined 
toward greater risk taking if they know that 
others will be present to serve as a potential 
source of comfort during the course of any 
negative consequences ensuing from their 
risk-taking behavior. Accordingly, we have 
included an experimental condition in which 
subjects were informed that like-minded peers 
would be present during any experimental 
session involving possible aversive stimula- 
tion. 

The two conditions outlined above might 
or might not be sufficient, when considered 
separately, to elicit the “risky shift” effect. 
Suppose, however, that the processes at work 
in the preceding two conditions were allowed 
to operate simultaneously. Would such a state 
of affairs yield a shift in the direction of 
greater risk taking? To explore this third pos- 
sibility, we added an experimental condition 
in which subjects would make their decisions 
on the basis of an anticipated discussion to 
consensus. The possible social desirability of 
high risk taking would be expected to emerge 
even more dramatically under these circum- 
stances, for beyond having one’s decisions 
disclosed to others, one anticipates defending 
the selected risk level in the discussion to 
follow. In addition, the presence of others 
while consequences of the decisions are being 
experienced is also anticipated in this con- 
dition. Thus, only the discussion to consensus 
itself is omitted in the present case. 

If we can show that the conditions stated 
above are unable or insufficient to account 
for the observed shift of group decisions in 
the risky direction, then we will possess a 
strong basis for proposing that the group dis- 
cussion to consensus is an active causal factor. 
Such a finding would, of course, reinforce the 
“diffusion of responsibility” interpretation 
discussed earlier. On the other hand, if the 
conditions described produce a risky shift as 
large as that obtained under group discussion 
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to consensus, then we shall know that diffu. 
sion of responsibility cannot be the sole ex- 
planatory principle. 
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METHOD l 


Subjects and General Procedure 


One hundred and twenty-six male subjects, stu- 
dents at the summer session of the University of Col- 
orado, were recruited to serve in an experiment dis- 
guised as a study of various “physiological effects on 
problem solving.” The mean age of the subjects was 
22.6 years, and they were randomly assigned to five 
treatment conditions. The written description of the 
experiment used for recruiting subjects was de- 
signed to minimize self-selection; in particular, 
neither the risk-taking nature nor the possible 
aversive features of the experiment were mentioned. 
Potential subjects were told that there would be 
two sessions, a preliminary session of .5 hour during 
which they would fill out nonpersonal information 
forms, and a 1-hour experimental session to be 
arranged at their convenience sometime later, Pay- 
ment of $2.00 was offered for the preliminary session 
in addition to payment (of unspecified amount) 
for the experiment itself. The “preliminary session,” 
in actual fact, constituted the complete experiment. 
Subjects were paid $2.50 after all participants had 
completed the experiment, at which time the decep- 
tion was fully explained. 

The experimental procedures were administered to 
groups of three previously unacquainted subjects at 
a time by a male experimenter. The subjects were 
seated along one side of a long table in a small 
experimental room which contained assorted pieces 
of mechanical and electrical apparatus. Movable 
wooden partitions separated the subjects’ work spaces 
at the table. As soon as the subjects were seated, 
they were given a paper-and-pencil questionnalle 
with the request that they read the initial instruc 
tions and the first item and then wait for further i 
instructions, The 6-item questionnaire was entitled, 
“Preliminary Session for Psychophysiological Ex 
periment,” and its instructions read as follows: 4 


With the recent interest in manned space & 
ploration, scientists have become increasingly con- 
cerned with the effects of various physiologi 
stimulations on the ability to perform various 
tasks. For example, it is well known that back- 
ground noise interferes with the ability to solve 
simple arithmetic problems. In our research We 
are interested in having people undergo various 
physiological stimulations for a period of fiftee 
minutes before engaging in a forty-five minute ssi 
sion of solving simple verbal and mathemati 
problems. An experimental session, then, lasts # 
total period of one hour. of 


the six experiments we are conducting. Throug 
random drawing procedure, you may be S¢@ ie 
to participate in one of these; no person will pate } 
in more than one. If you are so selected, the” — 
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formation you provide on these sheets will then be 
consulted for selecting the exact procedure to be 
employed. Within the next few weeks, you will be 
informed by mail whether or not you have been 
selected to participate in one of these experiments. 
Whether you are selected or not, you will be paid 
$2.00 for the present preliminary session at that 
time. 

We are employing this rather unusual preliminary 
session and selection procedure in order to partially 
overcome a difficulty which has impeded the re- 
search. This is the occurrence of unwanted dis- 
ruptive side effects which sometimes result from 
the physiological stimulations under investigation. 
When such side effects do occur, it is impossible 
for the individual to provide meaningful results on 
the problem-solving and his participation in the 
experiment must then necessarily be terminated. 
Although we do know that all of the side effects 
subside within an hour and never leave any perma- 
nent effect, it is still not possible to prevent such 
side effects nor to predict in advance who is likely 
to suffer them. For these reasons, then, we are 
offering more money to those willing to undergo 
stimulation which is more likely to produce a side 
effect, with the understanding that those suffering 
the side effect cannot be used in the problem- 
solving and, hence, will be excused from the ex- 
periment and paid a flat fee of $1.00 for their 
time and inconvenience. That is, the greater com- 
pensation is paid only to those who complete the 
experiment after undergoing stimulation which was 
more likely to have produced the side effect, but 
did not do so. 

The forms of stimulation, side effects, and 
schedule of payments are described on the follow- 
ng sheets. For each experiment you are asked to 
indicate your preference on the stimulation to be 
employed; you are also given the option in each 
experiment of not undergoing the stimulation at 
all, but participating only in the problem-solving 

_ Portion of the experiment. 


The first “experiment” was entitled, “Olfactory 
Stimulation,” and follows in its entirety: 


_ In the olfactory experiment, we are interested 
in assessing the effects on subsequent problem- 
solving of a 15-minute exposure to an odor. 
Some of the odors we are testing are known to 
Produce side effects of nausea and intense sinus 
pain in a portion of the population. The side effects 
subside within an hour and leave no permanent 
effect. The table below indicates the approximate 
Percentage of the population likely to suffer these 
side effects. You will note that the payment is 
larger if you are willing to choose an odor for 
which the likelihood of the side effects is greater. 
That is, greater compensation is offered to those 
who complete the experiment after experiencing @ 
stimulation which had a greater likelihood of pro- 
ducing the side effects. If you happen to suffer the 
side effects, you will be excused from the experi- 
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ment and paid a flat fee of $1,00 for your time 
and inconvenience. 

Please select one of the odors by circling its 
identification code letter. If you are selected for 
an oNactory experiment, this will be the odor 
which you will experience. If you prefer not to 
expose yourself to any odor but only to participate 
in the problem-solving part of the experiment, 
circle “none.” 


Pay in addition to 


Percentage of popu- 75¢ for the hour 


lation experiencin; of TÌ- 
Odor le Pieci 5 TRI Ea 
None — $0.00 

a 10% $2.80 

b 20% $3.15 

c 30% $3.60 

d 40% $4.20 

e « 50% $5.00 

f o $6.25 

g 70% $8.35 

h 80% $12.50 

i 90% $25.00 


The six experiments described in the question- 
naire were identical in format to the one just pre- 
sented, differing only in the stimulation employed 
and the possible side effects which might occur. The 
five other experiments were: (b) “chromatic stimula- 
tion,” in which bright color patterns would be 
presented for 15 minutes by means of special goggles 
containing prisms. Possible side effects were severe 
headache and intense burning sensation in the eyes. 
(c) “Movement stimulation,” in which a motor- 
driven apparatus used for flight simulation would 
provide bodily and vibratory movement, Possible 
side effects were dizziness and loss of muscular con- 
trol in the limbs. (d) “Taste stimulation,” in which 
various tastes would be presented by means of 
flavored sticks held in the mouth, The possible side 
effect, was an intense burning sensation in the mouth. 
(e) “Auditory stimulation,” in which complex sound 
patterns would be presented through high-fidelity 
earphones, Possible side effects were intense, throb- 
bing headache and ringing in the ears. (f) “Odorless 
gases,” in which nontoxic odorless gases would be 
presented for the 15 minutes preceding the problem 
solving. The possible side effect was stomach cramps. 

‘After subjects had read the instructions and ex- 
amined the first experiment, further verbal clarifica- 
tion of the procedures was given. The following 


points were covered: 


1, Subjects for the various experiments will be 
selected by a random procedure; in other words, 
responses on the questionnaire will not enter into 
the selection. Each individual will serve either in 
one or none of the experiments. 

2, Responses on the questionnaire will be held 
confidential and the experimental sessions them- 
selves will be private; you will undergo the 
stimulation and problem-solving alone. 

3. We are equally interested in all of the stimula- 
tions; therefore, you should be guided only by 
your own preferences in making your decisions. 
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4. The side effects are all-or-none affairs; it is 
easy to tell when one is suffering from them, they 
appear almost immediately, and they cannot be 
hidden. S 

5. The scales of stimulations do not represent 
intensity scales; that is, the side effect from odor 
“a,” for example, would be just as severe as the 
side effect from odor “i.” The only feature which 
varies as one goes down the scale is the prob- 
ability that the given side effect will occur. Simi- 
larly, the stimulations themselves are not un- 
pleasant—including the movement stimulation— 
nor do they differ very much from one another 
within a particular experiment. It is only the side 
effects which would be unpleasant. 

6. The salary scale has been arranged to suit 
the percentages. Thus, for example, if you select 
the 90% level, you have a,10% chance of being 
paid—of not suffering the side effect; if you select 
the 80% level, you then have a 20% chance of 
being paid. Since the chances have doubled, the 
salary has been halved. All of the salaries have 
been arranged in this way so that we can calculate 
our budget ahead of time. 


These points were all emphasized so that they 
could not arise as new information in a group dis- 
cussion and lead, thereby, to an artifactual shift in 
risk taking under the group-discussion condition. 
As the last point made clear to the subjects, the 
probability of being paid multiplied by the size of the 
payoff is a constant. Therefore, since the side effects 
have negative utilities, the expected values of the 
various alternatives decrease as one becomes more 
risky. Hence, a “rational” decision-making strategy 
of maximizing expected values would lead to con- 
servatism in the present situation. 5 

This first administration of the questionnaire pro- 
vided the individual base lines against which all 
subsequent shifts in risk taking were evaluated» After 
all subjects had completed the initial questionnaire, 
they either were told to return the following week 
or were given a second questionnaire (as noted below 
in the appropriate treatment conditions). 


Test-Retest Control (N = 18 Subjects) 


Subjects in this control condition were asked to 
return the following week in order to receive their 
salary and. possibly, to provide additional informa- 
tion, At that time they were told that selection of 
subjects had not yet been made and that a number 
of people had expressed a desire to change their 
responses either up or down after thinking about 
their participation. Since it is very important that 
nobody be asked to undergo stimulation which they 
do not really want, the experimenter explained fur- 
ther, they were being encouraged to make any 
changes they desired before experimental subjects 
were actually selected. Thus, change was encouraged 
rather than discouraged. Separating partitions re- 
mained in place as in the first administration. This 
condition provides data on the test-retest reliability 
of the questionnaire as well as a practice or famili- 
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arity control for any shifts in risk levels observed 
in the other conditions. 


Discussion to Consensus (N = 18 Groups) 


Group decisions. This condition was designed to 
answer the basic question: will discussion to con- 
sensus lead to increased risk taking in the present 
decision-making context? Subjects were told; 


This questionnaire I have just handed you is 
identical to the one you have just completed. It 
is to be used for a second phase of our research, 
In this second phase, we are interested in ex 
amining an even more important problem than 
individual problem-solving, that of the effects of 
various stimulations on the ability of a crew or 
group of men who must perform joint tasks, tasks 
which require them to coordinate their efforts. 
It is for this reason that three of you were asked 
to be here at the same time. If you are selected 
for an individual experiment, then the information 
you gave on your first questionnaire will still be 
consulted; but we are also selecting groups of 
three, at random, to participate in a group 
problem-solving session after undergoing the same 
stimulation. In these sessions, if one person suffers 
a side effect, he will be paid his dollar and dis- 
missed and a stand-in will replace him; if two 
suffer the side effects, then two replacements will 
be used. In any case, if you do not suffer the 
side effect, you will be solving problems with two 
other individuals who have experienced the same 
physiological stimulation. If you are selected for 
a group experiment, you will not also serve in all 
individual experiment. 

Since this group may be selected for one of the 
six experiments, we would like you to go through 
the questionnaire and decide among yourselves 
which stimulation you would like to experience in 
each case. That is, you should discuss each expeti- 
ment and come to a unanimous decision regarding 
the stimulation to be employed; be sure the 


decision is unanimous and that a majority is 10t | 


just bulldozing the third member into something 


At this point, the partitions which had separa 
the subjects’ working space on the table were 4 
moved. As in our previous experiments, the subject 


Were encouraged to take ample time for then 


deliberations, ible 

Private decisions. In order to assess the POS,” 
effects on individual risk-taking levels of the dis 
cussion to consensus, the above subjects Wi 
the questionnaire again following the group 
and indicated their private opinions. They 
the subjects were told, since 


it is important to us that nobody in this 
be required to undergo an experience he 
chosen for himself, if possible. . . . If it 
possible, then, we will put you in a group ® Ae 
others who have selected the same level of stim! 
tion rather than placing you in this group- 

e subjects 


decisions 
i j 


research 


has not 


Partitions were again placed between thi 


ent ove | 


is atal | 
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Anticipated Public Disclosure (N = 18 Sub- 
jects) 

As one check on the possibility that the social 
desirability of increased risk taking might enhance 
the risk levels selected, this condition was identical 
to the test-retest control except that the subjects 
were further told at the retest session that each 
person’s decisions would be made public and dis- 
cussed after they filled out the questionnaire, since 


a number of people have expressed an interest in 
knowing what the other people have been deciding. 


In addition, the wooden partitions which had previ- 
ously separated the work spaces were not present 
this time, Again, change was encouraged rather than 
discouraged, Note that subjects still anticipated that 
the experimental session itself would be private. 


Anticipated Presence of Others (N = 18 Sub- 
jects) 


This condition examined the effect on risk-taking 
levels of expecting to undergo the consequences of 
one’s decisions in the company of others who have 
selected the same levels of risk. Subjects received 
the second questionnaire immediately after the first 
one. The first paragraph of instructions was identical 
to the first paragraph for the condition of dis- 
cussion to consensus—group decisions. The subjects 


a were requested to go through the questionnaire 
an 


mark the stimulation in each experiment you 
would prefer to undergo for this group problem- 
solving. If you are selected for an experiment on 
group problem-solving, then you will participate in 
the experiment with two other individuals who 
have selected the same stimulation. 


Anticipated Discussion to Consensus (N= 18 
Subjects) 


sae condition contained all the features of the 
acussion-to-consensus condition except for the dis- 
ae itself. Subjects were given the second ques- 
eran immediately following the first one. Again, 
th Hh paragraph of instructions was identical to 
e first paragraph for the condition of discussion 


. ‘to oe y P 
consensus—group decisions. The instructions then 
continued: 


x its group is selected for an experiment, then 
a be necessary for the three of you to reach 
Ai erent as to which stimulation you will 
of ergo. For this reason, we would like each 
ee to go through the questionnaire, marking 
Pees in each experiment which you 
Prefer to undergo in the group experience. 

ay and if you are selected for an experiment, 

nee decisions on that experiment will be 
aes to the three of you so you can 
EA em and make a unanimous selection on 
which stimulation is to be employed. In other 
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words, you should now mark the stimulation in 
each experiment which you would want to suggest 
to the group for the stimulation to be used. 


« 
RESULTS 


In the presentation of results, risk scores 
on the questionnaire represent the percentage 
level (probability of side effect) selected by 
a subject or group for the average of the six 
“experiments.” Thus, scores can range from 
0% (for the “none” alternative) to 90%, 
with higher scores reflecting greater risk 
taking. A shift score for an individual is 
computed by subtracting his score on the first 
administration of «the questionnaire from 
his score on the second administration 
(N = number of subjects). When group de- 
cisions are under consideration, a shift score 
represents the group questionnaire score 
minus the mean of the questionnaire scores 
obtained by the same three individuals in 
the first administration (NV = number of 
groups). When private decisions after group 
consensus are under study, a shift score repre- 
sents the mean of the postconsensus private 
decisions by a group’s members minus the 
mean of their scores on the first administra- 
tion (NV = number of groups). All subjects 
thus serve as their own controls. Positive shift 
scores represent shifts in the risky direction. 
One-sample ¢ tests of the difference scores 
(Walker & Lev, 1953, pp. 151-153) are 
used to evaluate the null hypothesis that the 
mean shift for a condition is zero. 

The mean initial risk score for the 126 
subjects was 65.5%, with a standard devia- 
tion of 16.8%. Table 1 displays the mean 
shift score obtained in each of the conditions, 
and its statistical test. 

. With regard to the test-retest control 
(Table 1, Row 1), permitting subjects to 
reconsider their decisions after a period of 
1 week and under circumstances that encour- 
age change, does not lead to any systematic 
shift in either direction. The 7 between the 
first and second administrations of the ques- 
tionnaire is .79, thus indicating that the test- 
retest reliability of the instrument is quite 
satisfactory. 

Concerning discussion to consensus, group 
decisions are significantly more risky than the 
mean of the decisions made by the group 
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TABLE 1 
PERCENTAGE OF DEGREE OF SHIFT IN RISK TAKING FOR EACH OF THE EXPERIMENTAL CONDITIONS 
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Condition 


1. Test-retest control 
(N = 18 subjects) iy 
2a. Discussion to consensus—group decisions 
(N = 18 groups) y in 
2b. Discussion to consensus—private decisions 
(N = 18 groups)* 
3. Anticipated public disclosure 
(N = 18 subjects) 
4. Anticipated presence of others 
(N = 18 subjects) 
5. Anticipated discussion to consensus 
(N = 18 subjects) 


Wes ge E sp 
+1.11 12.78 0.36 
+5.43 7.82 2.86*** 
+3.97 6.02 2.71955 
+1.85 11.86 0.64 
—1.30 2.52 2:125 
—4.91 8.14 2.48** 


7 
à Test based on subjects pooled within groups in order to preserve independence. Hence df = 17. 


*p <.05, two-tailed test. 
**p <.025, two-tailed test, 
*i* p << .02, two-tailed test. 


members as individuals (Table 1, Row 2a). 
The groups selected levels of risk which aver- 
aged over one-half a scale step more risky 
than individual decisions. Sixteen of the 18 
groups in this condition display a shift in the 
risky direction, which argues for the con- 
sistency of the effect. Row 2b of Table 1 
indicates, in turn, that the private decisions 
of these same subjects, obtained after com- 
pletion of the group discussions, also shifted 
significantly in the risky direction. Thus, sub- 
jects did not revert to their origina] pre- 
discussion decisions, but rather showed a high 
degree of personal acceptance of the greater 
risk taking ensuing from discussion to 
consensus, 

Regarding anticipated public disclosure, 
there is no evidence for either a risky or 
conservative shift due to knowing that one’s 
decisions are to be disclosed to others rather 
than being held confidential (Table 1, Row 
3). The results are quite comparable to those 
for the test-retest control condition. 

The data for anticipated presence of others 
(Table 1, Row 4) indicate that individuals 
actually tend to be more conservative when 
they anticipate undergoing the consequences 
of the decisions in the company of others 
who have selected the same levels of risk. 
Although the magnitude of the shift is small, 
it is statistically significant, given the con- 
sistency of the effect. Note the very small 
standard deviation relative to that of the 
other conditions. 


Finally, concerning anticipated discussion 
to consensus, decisions made under this expec: f 
tation are considerably more conservati 
than decisions made by those same indi: 
viduals under conditions of confidentiality 
(Table 1, Row 5). The shift in the conserva: 
tive direction under anticipated discussion t0 
consensus is about as large as the shift iti} 
the risky direction which appears when the 
discussion is actually held. 


Discussion 


The present investigation has demonstrated 
in a context of aversive consequences that 
unanimous group decisions concerning mattels, 
of risk show a shift toward greater risk he 
when compared with individual decisions, 00 
postdiscussion individual decisions that follow 
group consensus reflect the risky shift of the 
group rather than the original prediscussion 
decisions. Hence, the general conclusioni 
drawn from our two previous experimenti 
(Wallach et al., 1962, 1964) can now 7 ) 
extended to include decision-making contest 
emphasizing negative consequences.° ft À 

The proposal that a diffusion of responsi 
bility is the process underlying such group i 


present experiment was indeed successful in $» fi 
the focus of the decision making to the negative e 
aversive consequences of risk taking. In fae 
monetary payoffs were rarely mentioned in achie “he | 
consensus; rather, discussion centered aroun for 
relative aversiveness of the various side effects 4 

the group members. 


—— 


induced risky shifts received further support 
"in the present experiment. Each of the al- 
“ternative explanations that has been suggested 
in order to account for the group-induced 
risky shift phenomenon has been found in- 
applicable. 
Consider first the proposal that higher risk 
taking has greater social desirability than 
l conservatism. If this were the causal factor 
at work in the group discussion, one would 
also expect higher risk taking to appear when 
a person knows that his decisions, rather than 
_ remaining private, will be made available to 
others for inspection. Yet, in the anticipated- 
public-disclosure condition, such enhanced 
risk taking failed to appear. Furthermore, a 
dramatic conservative shift appeared in the 
anticipated-discussion-to-consensus condition 
where each subject not only knew that his 
' decisions would be disclosed to others, but 
also knew that he would be expected to 
defend his decisions before the other members 
of the group. 

Second, the proposal that increased risk 
taking might be caused simply by knowing 
that one would be experiencing the conse- 
quences of one’s decisions in the company of 
others who had made similar decisions, also 
was clearly disconfirmed. Such a proposal fol- 
lowed from, the consideration that a person 

Might accept greater risk of aversive conse- 
quences if he knew that others would be 
Present as a source of potential sympathy 
ahd comfort during the period when those 
aversive consequences might be experienced. 
l We find, however, a significant conservative 
À shift in the anticipated-presence-of-others 
Condition. Perhaps, male subjects faced with 
the uncertainty of how they will “take” the 
; Side effects of physiological stimulation are 
* Concerned about the possibility of conveying 
an impression of weakness in the presence 
of. peers. Such a process would enhance 
_ Conservatism. 
; a a previous experiment (Wallach et al., 
she, ; a conservative shift was displayed by 
; ity fo ual decision makers when responsibil- 
& or others was introduced in the absence 
AE interaction. The conservative shift 
es ed in the present experiment’s 
Rise a of anticipated discussion to con- 
eems to be an example of the same 
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phenomenon, even though here the indi- 
vidual’s decisions are not yet binding on the 
other members of the group. In the present 
conditior, the individual is proposing a 
decision which he presumably intends to urge 
upon the group, a decision whose conse- 
quences, therefore, will be experienced by all. 
It seems likely that a feeling of responsibility 
for others would be generated under such 
circumstances. When no discussion to con- 
sensus has yet occurred but is only expected, 
the effect of these forces is to make the indi- 
vidual favor conservative decisions. When the 
discussion to consensus actually takes place, 
on the other hand, increased risk taking is 
the result. These findings offer direct support 
for explaining the group induced risky shift 
phenomenon in terms of a diffusion of 
responsibility.* 

In our previous paper (Wallach et al., 
1964), we touched upon some of the implica- 
tions of our work for committee decision 
making concerning national and military 
policy. While recognizing that these latter 
concerns introduced risks and deterrents 
likely to be qualitatively different from those 
capable of study in a laboratory context, we 
nevertheless felt that our research might be 
of some relevance to the decision making 
characterizing affairs of state. Indeed, the 
present experiment may be viewed as a closer 
approximation to the real-life cases cited 
above, for the reason that the risk of aversive 
consequences is here an important ingredient 
of the decision-making process. It is precisely 
such risks that underlie the deterrence policies 
of the major powers. Of course, the decisions 


4 Wallach, Kogan, and Bem (1962) reported low 
but significant positive correlations between initial 
risk-taking levels and perceived influence exerted in 
the group discussion. While this finding could well 
represent an outcome of a responsibility diffusion 
process, the possibility remains that high risk takers 
may dominate the group discussion and hence exert 
a disproportionate influence in the risky direction 
on the other members of the group. Although 
quantitative data on this point were not obtained in 
the discussion-to-consensus condition of the present 
study, the experimenter observed that group members 
appeared eager to defer to and sympathize with any 
member who found a given side effect particularly 
aversive. There appeared to be little indication of 
particular group members urging an across-the-board 
strategy of high risk taking. 
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reached in this experiment affect only the 
group members participating in the study; 
the decisions have no impact on the larger 
populations from which the subjects are 
drawn. Despite this important qualification, 
it would be most surprising if the shifts 
toward risk taking observed here and in our 
previous experiments did not have some 
counterpart in the actions of governmental 
decision-making bodies. Such matters would 
clearly be worthy of careful study. 
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EVIDENCE FOR PERCEPTUAL DEFENSE UNCONTAMINATED 
BY RESPONSE BIAS + 


RICHARD R. BOOTZIN ? anp THOMAS NATSOULAS € 


University of Wisconsin 


This paper describes an experimental procedure which tends to minimize 


differences in response bias 


presents some evidence of poorer recognition for 


for neutral and anxiety-arousing words and 


anxiety-arousing than for 


neutral words unaccounted for by differences in response bias. When an 
anxiety-arousing or neutral word is presented tachistoscopically S must choose 
his response, respectively, from a pair of anxiety-arousing words or a pair of 
neutral words. For a stimulus duration of .03 sec. a poorer accuracy of recogni- 
tion for anxiety-arousing words is found in the absence of a differential response 


bias between 


the 2 classes of words. With repeated exposures to the words, 


this perceptual defense effect is found to decline to nonsigificant levels. Extreme 
groups with respect to scores on personality scales provide unexpected results. 


Only rarely have students of perception 
attempted to demonstrate that their experi- 
mental results are a consequence of how sub- 
jects perceive as opposed to how they tend 
to respond under the given conditions. This 
state of affairs probably arises from the 
difficulties inherent in the investigation of 
central processes which are not observed 
directly, As Guttman (1963) has written, 
most of the occurrences of real consequence to psy- 
chology are in a rough but important sense int 
to organisms and subject to investigation only by 
the most tedious and inferential means [p. 120]. 
The relative neglect of the experimental dif- 
ferentiation between perceptual and response 
effects may have another related basis, the 
strict operationism which has gained wide 
acceptance in psychology. This methodo- 
logical principle would identify the meaning 
of a statement about perception with the 
evidential basis for determining its truth or 
falsity. It has been challenged by numerous 
philosophers of science (for example, Pap, 
1962) as well as by Garner, Hake, and 
Eriksen (1956). 

The above statements do mot apply, how- 
ever, to the experimental area usually labeled 
perceptual defense. In fact the history of 
alae research was supported in part by the 
Wate Committee of the Graduate School of the 

niversity of Wisconsin with funds provided by the 
Wisconsin Alumni Research Foundation. We thank 
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this area might be well characterized as a 
collective search for experimental procedures 
and designs to reveal the workings of percep- 
tion unconfounded by other variables. A 
recent phase of this search, an experiment 
by Goldstein (1962), is the point of departure 
for the present study. 

Goldstein attempted to distinguish experi- 
mentally between the contribution of percep- 
tual defense and the contribution of response 
bias to differential accuracy in identifying 
anxiety-arousing and neutral words (pre- 
sented tachistoscopically). For the purpose of 
estimating the effect of response bias on dif- 
ferential accuracy, a “stimulus-absent” group 
of subjects was included in the experiment as 
well as the usual “stimulus-present” group. 
The stimulus-absent group was stimulated on 
every trial with typewritten hash marks, al- 
though the subjects were led to believe that 
words were being exposed for subliminal dura- 
tions. Furthermore, they were required on 
every trial to choose a response from a list of 
eight words, four neutral and four anxiety 
arousing (as independently determined for 
each subject), just as the stimulus-present 
subjects were required. Comparisons of the 
responses of the stimulus-absent group with 
the stimulus order of the stimulus-present 
group provided pseudoaccuracy scores for the 


two types of words. 
Both experimental groups displayed an 


equal response bias against choosing anxiety- 
arousing words. This response bias was of 
sufficient magnitude to yield mean pseudo- 
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accuracy scores favoring the neutral words. 
For Goldstein to have concluded that his 
results gave evidence for perceptual defense, 
the accuracy scores of the stimulus-present 
group would have had to favor the neutral 
words even more than was the case for the 
stimulus-absent group. This was found but 
not to a statistically significant degree. 
Goldstein concluded: no evidence for per- 
ceptual defense over and beyond differences 
produced by response bias. 

The use of differences in performance 
between stimulus-absent and stimulus-present 
conditions as indicators of perceptual effects 
has been questioned by Goldiamond (1962) 
on the following grounds. The presentation of 
true stimuli (as opposed to hash marks or 
blanks) increases the information received 
which in turn affects response tendencies. If 
two conditions differ in response biases, then 
differences between them in performance, for 
example, in the differential accuracy of identi- 
fying neutral and anxiety-arousing words, 
cannot be ascribed readily to a perceptual 
process. A difference in response bias between 
stimulus-present and stimulus-absent groups 
was not shown in Goldstein’s experiment; 
therefore, presumably a difference in per- 
formance, had it occurred, could be assigned 
to perception. : 

The present experimental procedure differs 
from that of Goldstein (1962) in that it 
avoids pitting anxiety-arousing words against 
neutral words as responses, a condition likely 
to contaminate perceptual with response ef- 
fects. On every trial the subject must make 
a choice of response from a pair of words 
of which both are either anxiety arousing or 
neutral. This is an attempt to eliminate any 
tendency to avoid anxiety-arousing words, 
thus hopefully equating response bias across 
anxiety-arousing pairs and their matched 
neutral pairs. The procedure does not elimi- 
nate the possibility of differences in response 
bias -between such Pairs; it seeks to make a 
difference unlikely. If it is successful, the 
results will show differential accuracy without 
differential response bias between anxiety- 
arousing and neutral words. 

Another important point of difference from 
Goldstein’s study is that whereas their words 
and hash marks were presented at 50-100 


f; 
milliseconds below the “lower bound of the 
range of S’s visual sensitivity [p. 24],” here 
the words are presented for a duration result-» 
ing in imperfect, yet well above chance accu- 
racy. The basis for this choice of duration 
is the argument by Blum (1954) that per- 
ceptual defense operates to exclude materials 
from awareness; thus, if the material is pre- d 
sented too briefly, the defense phenomenon | 
will be less in evidence. 

A final, significant difference between the 
two experiments pertains to the points at 
which the response alternatives are exposed 
to the subject. In Goldstein’s procedure the 
eight words were made known to the subject 
immediately at the start of the tachistoscopic 
session. A list of them was attached to the 
front of the tachistoscope and could be re- 
ferred to between trials as well as at the point 
of response. Goldstein, Himmelfarb, and 
Feder (1962), discussing their own experi- 4 
ment, imply a criticism of Goldstein’s pro- “ 
cedure: “Motivation for perceptual defense” 
can be “undermined” by having the subject 
perceive the stimulus word every time prior y 
to the presentation in the tachistoscope. A 
similar, related criticism is that with repeated 
exposures in a brief time the anxiety reaction 
to a word may habituate, removing an impor- 
tant determinant of perceptual defense. In 
the present study an attempt is made to 
reduce the rate at which habituation occurs 
by presenting the response alternatives only 
after each stimulus presentation. 


METHOD 
Subjects , 


At preliminary sessions 86 women and 39 men, 
for the most part students of introductory psychol- 
ogy, participating in exchange for points on their 
final examinations, were administered the Hysteri® | 
(Hy), Psychasthenia (Pt), and Suppressor (K) scales 
of the MMPI. The subjects’ scores were determined 
by subtracting K-corrected scores on the Pt scale 
from scores on the Hy scale. This composite score 
has been used in clinical, experimental research as 
a measure of the tendency to repress emotio: 
disturbing material (cf. Dahlstrom & Welsh, 1960, 
Pp. 379-382). Only the 7 subjects of each sex scott 
highest and the 7 of each sex scoring lowest serve 
in the remainder of the experiment, For the men 
scores were +15, +14, +14, +11, +10, +10, +8, —7 
—14, —15, —19, —19, —24, and —25; for the women, 
+18, +15, +13, +13, +10, +10, —17, —19, —!% 
—20, —25, —29, and —30. These 28 students were 
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the remainder of the experiment on 
the grounds that they belonged to a random sample 
of the subject pool; an association between the 
two parts of the experiment was avoided. The two 
personality groups will be referred to as “high- 
positive scorers” and “Jow-negative scorers.” 


recruited for 


Stimuli 


At individual sessions each subject was given a 


word-association test in which he said the first word 


which came to mind in response to hearing a stimulus 
word, One hundred and thirty-six words were pre- 
sented comprised of 34 groups of four words each, 
the members of a quadrad being randomly inter- 
spersed throughout the list. An experimenter selected 
two words for each quadrad as likely to be anxiety 
arousing, and two as in all likelihood neutral. The 
four words of each quadrad were equated for fre- 
quency of usage (Thorndike & Lorge, 1944), length, 
and first letter. The subject’s response to each word 
and his latency as measured by stopwatch were 
recorded. Upon completion of the list, the list was 
begun again in the same order, the subject being 
instructed to repeat the specific response he had made 
the first time through the list. 

Two quadrads were selected for each subject. A 
word was judged to be anxiety arousing if the 
latency of response to it was long and there was an 
error of reproduction as well. It was judged to be 
neutral if the subject had a short reaction time to 
it and no reproduction error. This was an ideal 
criterion and a number of exceptions were made in 
the attempt to find quadrads in which two of the 
words had among the longest latencies and two 
among the shortest for the particular subject. 
Table 1 lists the quadrads finally selected for the 
remainder of the experiment. The table includes the 
number of subjects exposed to each quadrad and 
the number of subjects showing a reproduction error 
for each word. Although there were 14 high-positive 
Scorers and 2 quadrads were selected for each sub- 
ject; only 27 quadrads are listed for them in Table 1; 
1 quadrad is omitted because an error in experi- 
mental procedure reduced the usable data from 1 
subject to that collected for 1 quadrad. 


Procedure 


* Two to 4 days after the word-association session, 
ce subject participated in the perceptual task. At 

> last session the eight words selected for that 
ae were presented singly by means of a 
eae tachistoscope. As soon as a stimulus 
oe the subject had to choose as correct one of 
ie words on the next page of the answer booklet. 
Si two words included the one flashed and the 
R er word of the same type in its quadrad. For 
xample, if an anxiety-arousing word was flashed 


onome of the words used were ones Michael J. 
‘oldstein has been using. He kindly made his list 
available to us. 
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TABLE 1 
Quapraps USED IN RECOGNITION PART OF EXPERIMENT 
Anxiety grousing Neutral Frequency 
High-positive scorers 
HOLE (1) HANG (1)® | HURT (1) HALL (a) 1 
CRUSH CHEST CURSE (1) CRUEL 1 
PINCH (1) PERIL PIANO PILOT 1 
SPERM SLINK (1) | SHOWY SUAVE 1 
INSULT (1) INDUCE (1) | IMPOSE INFANT 1 
BALLS (1) BROAD (1) | BUILD (1) BREAD 1 
IMPOSE (2) INDUCE (1) | INSULT INFANT 2 
THRIVE (1) TRYING THREAT (1) TRENCH 2 
ACCUSE (1) ABSORB AROUSE ASCEND 2 
TEASE (1) TOUGH (1) | TORCH TIMID 2 
COWARD (2) CRISIS (1) | CRADLE CARPET 3 
ACCUSE (1) AROUSE (2) | ABSORB ASCEND 4 
TNsULT|(5)j IMPOSE (3) | INDUCE (2) INFANT _ 6 
Low-negative scorers 
suck! SOFA SCAR SLIM] 1 
PAIN (1) PICK (1) PULL POOR 1 
BEAT gig à BLOW (1) | BONE BURN 1 
BURY BARE (1) | BOIL BENT (1) 1 
TEASE “7 TORCH (1) | TOUGH TIMID 1 
sHowy (1) SLINK SPERM SUAVE 1 
CURSE (1) CRUSH CHEST CRUEL iN 
BREAST BURDEN (1)| BOUGHT BASKET 1 
FORCE (1) FIGHT (1) | FIRST FRONT 1 
CROSS CLEAN CHIEF CATCH 1 
CURSE (1) CHEST CRUSH (1) CRUEL (1) 1 
prick (1) PUNCH (1) | PRUNE PASTE 1 
THREAT (1) THRIVE RI TRYING (1) TRENCH (1) 1 
THREAT (1) TRYING (1) | THRIVE TRENCH 1 
SPERM SLINK (1) | SHOWY SUAVE 1 
INSULT (1) INDUCE (1) | IMPOSE INFANT 1 
BALLS BROAD BUILD BREAD 1 
ACCUSE ABSORB (1) | AROUSE ASCEND 1 
INSULT (1) IMPOSE (1) | INDUCE INFANT 1 
TEASE TOUGH (1) | TORCH TID (1) 2 
ACCUSE (1) AROUSE (1) | ABSORB (2) ASCEND 2 
EXPOSE (3) EXCITE (3) | ESTATE ENERGY (1) 5 


s Number of reproduction errors. 


the choice was between that word and the other 
anxiety“arousing word from the same quadrad, The 
answer booklets contained alternate pages of blank 
sheets so that only during responding were words 
from the booklet exposed. The subjects were told 
that since the words would be presented very briefly, 
they should not feel disturbed if at times they seemed 
to be merely guessing. 

The words were presented at two speeds, .01 and 
03 second. A slide with five typed hash marks was 
also presented at .01 second, A total of 96 stimulus 
presentations were made to each subject. Each 
block of 48 included four presentations of each of 
the eight words selected for a subject at each speed 
and in each block the card with hash marks was 
presented 16 times, twice for response in terms of 
each word pair A different random order within 
each block was constructed for each subject. 


5 There were four errors in experimental procedure 
all involving the presentation or pseudopresentation 
of a stimulus word more or fewer times than indi- 
cated in the text. Since proportions were used as 
response measures, it is believed that these errors 
do not affect the results. 
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TABLE 2 
ANALYSES OF VARIANCE OF AccuRACY SCORES AND RESPONSE BIAS SCORES 
2 Accuracy Response bias 
Source df aa x pa E 

Personality (P) 1 1054.035 2.76 AA 4.04 
Error between (E) 26 381.616 oon 30 
Word types (W) 1 152.106 1.89 340. 3.80 
WXP 1 23.295 693.910 7.75 

A EE at 

1 626.835 i l 

BYP ye 1 277.569 2.35 117.452 
BXE 26 117.991 135.515 Bei 
Input conditions (I) 2 9233.101 31.47*** 321.133 2, 
TKE 2 218.344 59.498 
IXE 52 293.358 121.754 
WXB yd 5,521 19.325 s 
WXBXP `] 200.124 2.67 160.301 2.1 
WXBXE 26 74.859 73.367 
wx! 2 505.055 3.69* 41.254 
WXIXP 2 367.659 2.69 14.402 
WXIXE 52 136.789 116.563 p 
BXI 2 471.645 3.81* 544.562 6.60 
BXIxP 2 54.492 154.347 1.87 
BXIXE 52 123.625 82.589 
WXBXI 2 161.056 71.433 
WXBXIXP 2 291.807 2.56 18.767 
WXBXIXE 52 114.121 83.194 


Note.—Arc-sine transformation of proportions to angles, 
*p <05 


"tp <01; 
"kp < 001. 


Design and Analysis 


The proportion of correct responses was calculated 
for each subject, for each block, for each input 
condition (words at .01 second, words at .03 second, 
and hash marks at .01 second), for each type of 
word (anxiety arousing and neutral). The 28 subjects 
thus provided a total of 336 scores or 12 each. These 
Scores were converted by arc sine to angles and 
submitted to analysis of variance, Pseudoaccuracy 
Scores for the trials with the hash-marked stimulus 
were determined by comparing each subject’s re- 
sponses with a random order of the words used for 
that subject. This random order was not the same 
one used for presenting true stimuli. A different 
tandom order was used for each subject. The vari- 
able of sex is omitted from the reported analyses 
because it has no significant effect on accuracy either 
as a main effect or in interaction with any other 
variable. 

In addition a measure of response bias was com- 
puted and submitted to an identical analysis of 
variance, The kind of response bias that can be 
displayed in this experiment is a tendency to pick 
one of a pair of anxiety-arousing or neutral words 
more frequently than the other. The Proportion of 
trials on which the preferred response was chosen 
was determined for each input condition, for each 
type of word, for each block, for each subject, Again 
this resulted in 336 proportions or 12 per subject, 


Both analyses of variance, one for accuracy the 
other for response bias, are summarized in Table 2. 
Comparisons between pairs of means reported in 
the results are all tested by Duncan’s new multiple 
range test. 


RESULTS AND DISCUSSION 
Input Conditions 


Table 2 shows that the variable of input 
conditions had significant effects on the ac 
curacy of response both as a main effect 
( < .001) and in interaction with word 
types (p< .05). The main effect is to be 
expected on the basis of how much stimulus 
information is reaching the subject; accuracy 
should be relatively high when words are eX 
posed for .03 second, intermediate when words 
are presented for .01-second duration, and at 
chance level when hash marks are the stimuli. 
Comparisons of the six means of the signifi- 
cant interaction of Input Conditions x Word 
Types confirm these expectations in pa 
With mean proportions correct of .675 ani 
-747 for anxiety-arousing and neutral words, 
respectively, the condition of .03-second dura- 


fon yields greater accuracy than at .01 second 
with hash marked slides ($ < .001 in all 
es). However accuracy at the .01-second 
el does not differ for words as stimuli as 
posed to hash marking; all four values 
within 4% of the 50% value. In the 
sent experiment, then, when words are pre- 
ted at the briefer duration subjects ap- 
eared to be responding as if no words were 
fact presented. That this is more broadly 
the case is evidenced by a nonsignificant 
effect of Input Conditions and Word Types 
X Input Conditions for response bias (see 
‘Table 2) as well as a completely negative 
Duncan’s test on the corresponding six 
means. 
The results to this point do not select 
between approaches to perception which in- 
troduce a theoretical concept of perception 
(for example, Garner et al., 1956) and those 
which seek to explain the results of perceptual 
experiments without resort to such a concept 
(for example, Goldiamond, 1962). In keeping 
ith the latter approach one might conclude 
J ‘that at the .03-second duration relatively 
| large fragments of the flashed words stimulate 
the subject and responses associated with such 
“fragments are produced or chosen. 


y. 


ord Types 


C The main effect of word types on accuracy 
Mis not significant but there is the already 
“Noted significant interaction of Word Types 
X Input Conditions. It was expected that the 
largest difference in accuracy between neutral 
and anxiety-arousing words would be found. 
‘Tor words exposed at the .03-second dura- 
_tion, a smaller effect at .01-second duration, 
‘and no effect when hash marks were pre- 
Sented. The expectation of no difference in 
“pseudoaccuracy between neutral and anxiety- 
atousing words is based on the fact that 
“with the present procedure all response biases 
& Of whatever magnitude will result in a mean 
Proportion correct of .50 under hash-mark 


At the .03-second duration greater accuracy 
“was found for neutral words over anxiety- 

arousing words (p < .01), No difference was 
“found between neutral and anxiety-arousing 
‘Words under the other two input conditions. 
The results at .03-second duration do select 
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between approaches which introduce per- 
ceptual concepts and those that attempt to do 
without. The difference in accuracy found 
between ,anxiety-arousing and neutral words 
cannot be attributed to differences in the size 
of the effective stimulus fragments; the same 
stimulus duration was used for both types 
of words. The difference found cannot be 
attributed to different response biases pro- 
duced by the two types of stimuli; as already 
mentioned a Duncan’s test on the means for 
response bias showed no differences. These 
results constitute evidence for a perceptual 
defense effect uncontaminated by response 
bias. 


Blocks 


Table 2 shows a significant improvement 
in accuracy over the halves of the experiment 
(p < .05). Furthermore blocks interact sig- 
nificantly with input conditions (p < .05). 
Comparisons among the six means of this 
interaction result in the conclusion that the 
improvement is specific to the .03-second 
condition (p < .005). The improvement at 
03-second duration is from a mean propor- 
tion correct of .668 to one of .754, an im- 
provement of .086. At the same time there 
is a significant reduction in response bias for 
only the 03-second duration ($ < 005). 
This feduction in response bias may be re- 
sponsible for the improvement or it may be 
a consequence of it. The fact that a similar 
reduction is not found under the .01-second 
and hash-mark input conditions (in fact there 
is a minute increase in response bias) strongly 
suggests that the increased accuracy of recog- 
nition results in both higher scores and a 
reduction in response bias. 

The improvement in accuracy of recogni- 
tion may be due to the learning of helpful 
receptor-exposure acts or it may be due to a 
decrease over trials in the perceptual-defense 
effect. There is evidence in favor of the latter 
alternative. The improvement in accuracy 
over the halves of the experiment at the 
03-second duration takes place to a non- 
significant degree for the neutral words and 
at a statistically significant level for the 
anxiety-arousing words (p < 005). In fact 
the perceptual-defense effect does not achieve 
significance in the second half of the experi- 
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TABLE 3 
MEAN Proportions Correct AT LONGER 
STIMULUS DURATION 
High-positive scorers | Low-negative scorers 
Word type 

i Second First Second 

hal half half half 

Anxiety .615 -107 644 .759 

Neutral 639 «754 «170 .814 


ment although the difference in means is still 
in the originally observed direction. In the 
first half it is present at less than p = .01. 
The corresponding mean proportions correct 
are .624 and .713 in the first half and .726 
and .782 in the second half, respectively, for 
anxiety-arousing and neutral words. Again 
there are no statistically significant differences 
between anxiety-arousing and neutral words 
in response bias within each half. The im- 
provement over halves in accuracy scores for 
anxiety-arousing words is accompanied by the 
only significant decrease in response bias 
(b < .05). This reduction in response bias 
has already been interpreted as a consequence 
of improved perceptual accuracy on the 
grounds that it does not occur under input 
conditions other than .03-second duration. 

The analysis in terms of halves of the 
perceptual task has provided evidence for 
perceptual defense only in the first half of 
the experiment. The improvement in agcuracy 
of perceiving anxiety-arousing words reduces 
the perceptual defense effect to a nonsignifi- 
cant level. It would seem that repeated 
exposures to the stimuli serve to reduce 
their tendency to evoke anxiety and conse- 
quently eliminate the operations of a defense 
mechanism. 


Personality 


So far as accuracy scores are concerned 
it can be seen in Table 2 that the analysis 
of variance does not show that personality 
differences have any effect whether as a main 
effect or in interaction with the other vari- 
ables. Despite this it was considered appropri- 
ate to perform a finer analysis in terms of 
differences between individual means. The 
grounds for this decision are the expectation 
that high-positive scorers would show the 
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perceptual-defense effect to a greater extent 
than would the low-negative scorers, and the 
fact that all the positive results reported thus 
far have to do with the longer stimulus dura? 
tion, the other input conditions producing 
chance accuracies. The following results con- 
cern accordingly the performance of the two 
personality groups in response to the words 
presented for .03 second. 

Table 3 shows the mean proportions correct 
for the two personality groups in each half 
of the recognition task for each type of word. 
In the second half the four means do not 
differ at even the .05 level; neither group 
shows the perceptual-defense effect in the 
second half nor do the two groups differ in 
accuracy. In the first half the low-negative 
scorers do show a difference in accuracy 
between neutral and anxiety-arousing words 
(p < 005), i.e., they show the perceptual- 
defense effect. The parallel difference in re- 
sponse bias between anxiety-arousing and 
neutral words is not significant. 

The high-positive scorers do not show the 
perceptual-defense effect in the first half to a 
statistically significant degree; accuracy of 
recognition of anxiety-arousing and neutral 
words is equally low, at the level of accuracy 
of the low-negative scorers for anxiety- 
arousing words. The high-positive scorers 
have poorer accuracy for neutral words in the 
first half than do the low-negative scorers 
($ < .005). This difference in accuracy is as- 
sociated with higher response bias (p < .001) 
neutral words for high positives than for low 
negatives. This is the main source for the 
significant interaction of personality and word 
types noted in Table 2 for response bias. — 

The results for the low negatives are clear 
then. They show the perceptual defense effect 


in the first half of the recognition task and 


no difference between the two classes of words 
in response bias. The high positives, on the 
other hand, do not show the perceptual- 
defense effect, their accuracy is equally low 
for both types of words, and there is a high 
response bias on the neutral words. The high 
Tesponse bias on the neutral words makes 
difficult the interpretation of the accuracy 
scores. It may be a consequence of the de- 
pressed accuracy and/or a cause of it. That 
it is to some extent a contributor to lowered 


1 


_ and for recogizing neutral words for 
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TABLE 4 


MEAN PROPORTIONS CORRECT FOR Worn Parrs DIFFERING 
MEMBERS (First HALF OF EXPERIMENT, 


IN NUMBER OF LETTER CORRESPONDENCES BETWEEN 
.03-SECOND ST™ULUS DURATION) 


High-positive scerers P Low-negative scorers 
Number of Anxiety pairs Neutral pairs Anxiety pairs Neutral pairs 
corresponding 
letters 
Mean Mean Mean Mean 
Number of i Number of Number of i Number of se 
mer of | proportion | Nimes = | proportion | Apaira | apace | pale” | Peame" 
0 17 .583 17 .574 15 EYE) 21 .129 
1 5 740 8 -750 4 -150 6 1875, 
2 1 -750 2 -750 7 -800 1 1.000 
3 4 +563 0 = 2 375 0 i 


accuracy is suggested by the presence of the 
same bias, but to a lesser degree, for hash- 
marked stimuli (p < .05) and when words 
are presented for .01-second durations (direc- 
tional only). The source of this high response 
bias for neutral pairs is not known. It does 
not appear to be a result of the choice of 
neutral words for the two personality groups: 
the neutral pairs of the high positives have 
a slightly greater mean within-pair difference 
in word-association latency than those of the 
low negatives but this difference (.3 second) 
is smaller than the smallest unit read from 
the stopwatch (.5 second); as shown in 
Table 1 both groups show nearly identical 
patterns of reproduction errors. 

If the difference in response bias between 
the groups for the neutral words is assumed 
to be a consequence of the difference in 
accuracy, this latter difference needs to be 
explained. To set the stage for one possible 
interpretation consider the fact that from the 
first half to the second half there is improve- 
Ment in accuracy in recognizing anxiety- 
arousing words in the case of both groups 
the high- 
positive scorers. This improvement is sufficient 
to eliminate any differences between groups 
and between words. In other words, accuracy 
with respect to neutral words for the low- 
negative scorers shows the same pattern of 
tesults as do the anxiety-arousing words in 
the case of both groups. It may be correct 
to say that in the case of the high-positive 
scorers the perceptual-defense effect is present 


«+ for both neutral and anxiety-arousing words, 


that it is a more generalized, unselective phe- 


nomenon for that group. This interpretation is 
consistent with Eriksen’s (1954) finding of 
greater stimulus generalization under stress for 
high-positive scorers than for low-negative 
scorers. The present experimental situation 
would seem to be particularly suitable for the 
appearance of this generalized effect because 
the neutral and anxiety-arousing words are 
interspersed in an order of presentation in 
which stimuli are presented one at a time in 
fairly rapid sequence. 


An Uncontrolled Factor 


The neutral and anxiety-arousing pairs of 
words were matched with respect to length, 
first ‘letter, and frequency of occurrence in 
popular literature. Inadvertently a difference 
between anxiety-arousing pairs and neutral 
pairs might have occurred in similarity of 
the members which would account for the 
above results. Table 4 shows that there were 
in fact a greater number of anxiety-arousing 
pairs in which both members contained the 
same letters in the same positions. However, 
it can also be seen that this similarity between 
members of a pair does not give the expected 
poorer performance. In addition, when one 
examines the mean proportions correct under 
conditions of no corresponding letters between 
members of a pair, the results are the same 
as those shown in Table 3, i.e., as found 
without making the distinctions between pairs 
contained in Table 4. The source for the 
differences found between accuracy scores for 
anxiety-arousing words and neutral words 
does not seem to be a difference between 
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anxiety-arousing and neutral pairs in the 
degree of similarity between members of a 
pair of stimuli. 

It may be said in conclusion that the 
present experiment provides evidence for per- 
ceptual defense uncontaminated by differences 
in response bias for the low-negative scorers 
and in the first half of the perceptual task 
of the experiment. Furthermore analyses 
which do not distinguish between experi- 
mental groups and between halves of the 
experiment continue to show the effect. How- 
ever, high positives do not show the per- 
ceptual-defense effect in the first half of the 
experiment nor do both experimental groups 
in the second half. 
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In Exp. I, 48 undergraduate 


INFLUENCE OF MEASUREMENT CONDITIONS ON 
SIZE ESTIMATES OF BODY PARTS? 


FRANKLIN C. SHONTZ 


University of Kansas 


students provided ascending and descending 
judgments of the sizes of 9 personal body parts and 6 non-body-part objects. 
Body-part estimates proved to be more sensitive to starting position effects 
than responses to non-body-part objects. In Exp. II, 72 Ss judged similar 
stimuli, using 2 response modes under 3 sets of visual and 2 postural conditions. 
‘All size estimates were found to be sensitive to variations in response mode 
and visual conditions. Postural effects were not significant. The results sug- 


gested a patterning of body-part size estimation tendencies that is relatively 


many measurement situations. 


It has been observed that differences exist 
between size judgment responses to personal 
body parts and to non-body-part stimuli 
(Shontz, 1963a, 1963b). The suggestion has 
also been made that body-part size estimation 
responses are highly sensitive to artifacts of 
the measurement situation (especially to vari- 
ations in the availability to the subject of 
visual cues) but direct evidence has not been 

provided to support this contention (Shontz, 
1963b). The present paper reports the results 
of two studies, designed to examine the effects 
of some conditions under which body-part 
size judgment responses may be measured and 
the nature of the changes that occur with 
systematic alteration of the measurement 
situation. Of specific concern were the effects 
of variations in: starting position (i.e., ascend- 
ing versus descending judgments), type of 
tgsponse apparatus, visual conditions, and 
body position of subject. 

The first investigation tested for starting- 

Position effects. The second investigation 

* “examined the influence of the remaining vari- 

ables. Results from both studies were com- 

bined to evaluate the consistency of the 
overall pattern of response tendencies to body- 
part stimuli. 

1This investigation was supported, in part, by a 
research grant from the Vocational Rehabilitation 

Administration, Department of Health, Education, 

and Welfare, Washington, D. C. Thanks are due to 


picts Vigliano, Irene Vlahakos, Douglas Guess, and 
teven Van de Mark for their assistance. 


independent of objective body-part size and that remains consistent across 


EXPERIMENT I 
Subjects 


Twenty-four male and 24 female undergraduate 
students at a midwestern state university served 
as subjects. All were enrolled in a sophomore-level, 
general psychology course and were fulfilling a 
course requirement by participating in the study. 
With respect to the task at hand, all subjects were 
naive observers. 


Apparatus 


A 4-foot long, 1.25-inch diameter, wooden rod was 
mounted horizontally, 8 inches from the base of a 
wooden frame, shaped like an inverted U, with 
legs 2-feet high. The side of the rod away from the 
subject was calibrated in inches, and the rod was 
equipped with two loose-fitting, 4-inch long card- 
board collars. One side of an approximately equi- 
Jaterale triangular sheet was fastened along the full 
length of the top member of the wooden frame. The 
apex of this sheet was slit, so that it could be 
tied loosely around a seated subject’s neck. In this 
position, the sheet served as a screen, suspended 
between the top of the apparatus and the subject’s 
shoulders. It prevented the subject from viewing 
either the response rod or his own body parts, but it 
did not interfere with his manipulation of the 
movable collars from below the sheet. 

The apparatus, with appropriate supports, was 
mounted on a 30-inch table, placing the response 
rod at a total height of 38 inches from the floor and 
approximately 1 foot in front of the subject seated 


before it. 


Measurements 


The basic data consisted of deviation scores. A 
deviation score is the difference between a loga- 
rithmically transformed measure of stimulus size 
(minuend) and a similarly transformed measure of 
response size (subtrahend). The measures are 
essentially the same as those previously used by Fink 
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and Shontz (1960) and by Shontz (1963a). Loga- 
rithmic values were taken to the base 10 and 
multiplied by 100 for convenience. 


Procedure 2 


The subject was seated before the apparatus and 
told that he would be asked to estimate the sizes 
of several objects, some of which would be parts of 
his own body. He was shown how to manipulate 
the collars on the rod so that the space between 
their inside edges represented his estimates of the 
required sizes. He was then given a practice trial, on 
which he was required to judge the length of a 
pencil shown by the experimenter. Following this 
familiarization, the apex of the sheet was tied 
around the subject’s neck, as described above. 

A second practice trial with the pencil was then 
administered, and the subject, was permitted to ask 
questions about the procedure. The experimenter 
then took a position on the opposite side of the 
apparatus and out of the subject’s field of vision. 
The subject was instructed to form “mental pictures” 
of the various stimulus objects that would be de- 
scribed or presented and to adjust the space on the 
rod to correspond exactly to the sizes of these 
mental pictures, 

Nine body parts were described: 

Head width (HW), 


the width of your head across the temples, from 
a point just above the tip of one ear on one side 
to a similar point just above the tip of the ear 
on the opposite side, imagine the straight line 
distance between these two points, 


Head length (HL), 


the length of your head, from a point oin the 
center of your forehead to a point in the center 
of the back of your head, imagine the straight line 
distance between these points, k 


TABLE 1 
ANALYSIS OF VARIANCE OF DEVIATION SCORES: 
EXPERMENT I 
Source df 3 F 
Starting position order (SPO) 1) 262.23) — 
Subjects (5) within SPO 46| 1,128.73 
Stimuli (St) 14| 1,246.47 | 11.32* 
St X SPO 14| * 48.90] — 
S X St within SPO 644) 110.15 
Starting position (SP) 1 | 21,587.64 | 63.32* 
SP X SPO 1| 2,213.49} 6.49* 
S X SP within SPO 46| 340.94 
St X SP 14| 221.49) 3.94* 
SX St X SP 658| 56.19] 
Total 1,439 
*p <. 025, 
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Face (FC), s 


{iy An 
the height of your face, from the bottom of your 
chin, straight up to the top of your head. 


Waist (W), Je 


the width of your waist, right at your belt-line, 
from one side of your body to the other, imagine 
the straight line distance between these points. 


Hips (HP), 


the width of your hips at the widest place, from 
one side of your body to the other in a straight 
line. 

Arm (A), 


the length of your arm, from the point of your 
elbow straight to the bone bump at your wrist. 
Hand (HD), kk 
the length of the palm of your hand, from the 
point where it joins the wrist to the tip of your 
longest finger, your middle finger. Imagine your — 
hand to be outstretched. i 


Leg (L), 


the length of your leg, from the top of your 
kneecap straight down to the bone that protrudes 
at your ankle on the outside of your leg. 


Foot (FT), 


the length of your foot, from the back of your 
heel to the tip of your longest toe. 


Six non-body-part objects were presented, These 
consisted of sticks of .25-inch wooden doweling 2.5, 
3, 4, 6, 10, and 18 inches long, Each was placed in — 
the subject’s hands and he was told to feel it for as 
long as he liked, until he had a good mental picture 
of its length. The subject then handed the stick A 
back to the experimenter and made the appropriai 
adjustment of the collars on the rod. 

Body parts and non-body-part objects were d 
scribed or presented in random order, except that 
stick stimuli were always estimated on Trials 3, 
7, 9, 11, and 13 of each 15 stimulus series. Tweli e 
males and 12 females provided ascending judgmeni 
of all stimuli (ie. with the collars of the 10 
returned to a closed position after each judgm nt) 
followed by descending judgments of all stim n 
(collars returned to an extreme open position). 
This procedure was reversed for the other half of 
the sample. Subjects were randomly assigned to i 
one of the two procedural variations as they ap: 
Peared for the experiment. g 


Results “ 


In all statistical analyses, a probability 
level of p < .025 was required for significane ' 
The results of the analysis of variance OF y 
deviation scores are shown in Table 1. 


Size ESTIMATIONS oF Bopy Parts 


nificant differences were found among stimuli, 
between starting positions, and in the Stimuli 
(St) x Starting Position (SP) and SP Xx 
Starting Position Order (SPO) interactions. 

Starting-position effects were in the ex- 
pected direction. Ascending judgments pro- 
duced a negative mean deviation score (mds), 
indicating underestimation of stimulus sizes 
(mds per judgment = — 4.13). Descending 
judgments produced overestimation of stimu- 
lus sizes (mds = + 3.61). The significant 
SP x SPO interaction resulted from the fact 
that position effects were more extreme when 
ascending judgments were made first (mds 
ascending = — 5.80; descending = + 4.42) 
than when descending judgments were made 
frst (mds ascending = — 2.47; descending 
= 12.79). 

Analysis of the St X SP interaction revealed 
that differences between ascending and de- 
scending modes of judgment were significantly 
greater for body parts than for non-body-part 
objects (Mann-Whitney U = 3.00; m = 6; 
m = 9). For body parts, the median deviation 
score on ascending judgments was — 5.64; 
the median deviation score on descending 
judgments was + 3.09. For non-body-part ob- 
ae the comparable values were —2.11 and 

2.21. 

Mean deviation scores for non-body-part 
stimuli, irrespective of starting position, fell 
into two distinct groups: those for small 
(2.5, 3, 4, and 6 inch) stimuli (mds = +0.50) 
and those for large (10 and 18 inch) stimuli 
(mds = — 2.30). No correlation existed be- 
tween median objective body-part sizes and 
mean deviation scores for the corresponding 
body parts (tau = —.22, N = 9). On a sub- 
ject-by-subject basis, the median correlation 
(tau) between objective body-part sizes and 
. mean deviation scores was —.11 (Q1 = —.44, 
Qs =.22); the median tau value for non- 
body-part objects was —.40 (Qi = —.60; 
Qs=.07). These distributions differ sig- 
nificantly (sign-test CR = 2.71). 

Relative stability of the pattern of mean 
deviation scores for the various body parts 
Was evaluated by comparing mds values ob- 
tained on first-trial judgments only (ascend- 
ing and descending series combined) with the 
values obtained across all trials. The tau 
between these body-part patterns was 67 
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(p<.025, N=9). The tau between the 
overall ascending and descending body part 
patterns was .78 (p< .025, N = 9). 

Details of the mds pattern for body-part 
stimuli are presented in a later section of this 


report. 
Discussion 

The results of Experiment I were consistent 
with the general proposition that body-part 
size perception differs in certain important 
respects from the perception of non-body-part 
objects. The patterning of mean deviation 
scores for body parts bore little, if any, rela- 
tion to objective stimulus size, and body-part 
size estimation tendencies showed greater 
sensitivity to starting position effects than 
did non-body-objects. 

It remained to be determined whether the 
patterning of mean deviation scores on body 
parts is consistent, or merely a function of 
the apparatus and procedure employed in 
Experiment I. Experiment II was, therefore, 
designed to vary several aspects of the meas- 
urement situation to determine the sensitivity 
of mean deviation scores to alterations in 
procedure and to determine whether a stable 
pattern of body-part size judgment tendencies 
would emerge in spite of these variations in 
method. 


$ e 


EXPERIMENT II 
Subjects 


Thirty-six male and 36 female subjects, none of 
whom had served in the previous study, were drawn 
from the subject pool described for Experiment I. 
The subjects were randomly divided into 12 groups 
of three males and three females each (see Pro- 


cedure). 
Apparatus 


The apparatus for Experiment I was employed, 
both as described above and with several variations. 
For subjects who provided judgments in a standing 
position, the entire apparatus was raised on a special 
table to bring the horizontal rod to a level 52 inches 
from the floor. For subjects who responded in the 
line-drawing mode, the rod was removed and re- 
placed on the table surface by large (18 X 24 inch) 
sheets of newsprint on which response lines were 
drawn with a soft pencil. For subjects who provided 
judgments under the visual-response-feedback- 
present condition, the screen, blocking their view 
of responses, was removed and the subject was 
draped in a large, poncholike sheet that prevented 
him only from viewing his own body parts. Subjects 
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TABLE 2 


ANALYSIS OF VARIANCE OF DEVIATION SCORES: 
ExPERIMent II 


Source df s R 
Posture (P) 1 1,255.07 1.06 
Response mode (R) 1 8,654.00 | 7.31* 
Visual conditions (V) 2 | 33,174.25 | 28.01* 
PXR 1 1,558.06 | 1.32 
PXV 2 950,97 — 
RXV 2 1,247.43 | 1.05 
PXRXV 2 | 2,263.99 | 1.91 
Subjects (S) within 
P, R, and V 60 | 1,184.14 
Stimuli (St) 14 | 2,067.51 | 23.49* 
PX St 14 47.69 — 
RXSt 14 102.98 | 1.17 
VXSt 28 172.64 | 1.96* 
Residual* 938 88.01 
Trials (T) 1 1,320.88 | 8.59* 
PXT 1 253.14 | 1.65 
RXT 1 51.78 — 
VXT 2 212.13 | 1.38 
Residual? 67 153.73 
StX T 14 533.58 | 17.62* 
Residual? 994 30.29 

Total 2,159 


a Contains S X St within P, R, V (df = 840) and higher order 
interactions (df = 98), 

b Contains S X T within P, R, V (df =60) and higher 
order interactions (df = 7). 

e Contains S X St X T within P, R, V (df = 840) and higher 
spare pericope (df = 154), 


whose judgments were provided under the visual 
Stimuli-absent condition wore a pair of tight-fitting, 
blacked-out, cyclist’s goggles. 2 


Procedure 


The investigation employed two postural condi- 
tions (sitting, standing); two response modes (rod, 
lines); and three visual conditions (visual response 
feedback present; visual response feedback absent, 
ie. with the triangular sheet in place, as in Experi- 
ment I; all visual stimuli absent). These variables 
were crossed in a 2X 2 X 3 research design. 

Subjects were randomly assigned to the 12 experi- 
mental conditions as they appeared for testing, and 
instructions paralleled those in Experiment I, al- 
though appropriate minor changes in wording were 
introduced as required by specific conditions. The 
15-stimulus series was administered twice as in the 
Previous study, but in Experiment II all responses 
were ascending judgments (descending judgments 
are not possible in the line-drawing mode). The 
sizes of the six stick stimuli were 3, 4.5, 6, 8.5, 12, 
and 17 inches. As in Experiment I the experimenter 
remained outside of the subject’s field of vision at 
all times. 
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Results 


Significant differences were found between 
response modes, among visual conditions... 
among stimuli, between trials, and in the 
Trials (T) x Stimuli (St) and Visual Condi- 
tions X Stimuli interactions (Table 2). 

The line-drawing response mode produced 
a significantly greater overall tendency to 
underestimate stimuli (mds = —8.79) than 
did the rod response mode (mds = — 4.79). 

The visual-feedback-present condition (mds 
= +0.88) produced significantly more verid- 
ical responses than either the visual-feedback- 
absent condition (mds = — 9.24) or the vis- 
ual-stimuli-absent condition (mds = — 12.01), 
both of which produced marked tendencies 
to underestimate stimulus sizes. 

The first trial produced greater underesti- 
mation (mds=—7.58) than the second 
(mds = —5.99); however, individual sub- 
ject consistency was high from trial to trial 
(r = 81). 

The significant T X St interaction appeared 
to be due primarily to the fact that variability 
among mds values for non-body-part objects 
decreased markedly (F = 18.16), while vari- 
ability among mds values for body parts 
remained relatively stable (F = 1.19) from 
trial to trial. 

Under all visual conditions, the variabilities 
among non-body-part object mds values were 
smaller than the variabilities among body-part 
mds values. The discrepancies were most 
evident under visual-feedback-absent condi- 
tions (F = 17.91) and visual-stimuli-absent 
conditions (F = 8.51); they were least notice- 
able under visual-feedback-present conditions 
(F = 2.87), 

Body-part stimuli, as a whole, were found 
to be neither more nor less sensitive than 
non-body-part objects to the effects of trials, 
Tesponse modes, or visual conditions. 

No correlation existed between mds values 
and median objective body-part sizes (tau = 
:06). (On a subject-by-subject basis, the 
median tau was —.11.) The correlation be- 
tween stick size and mds values was eX 
tremely high and negative, indicating steadily 
increasing tendencies to underestimate non- 
body-part stimuli as objective size increased 
(tau = — 1.00). 

Mean deviation score patterns. The overall 
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mds value pattern for the nine body parts in 
Experiment IT was significantly similar to the 
averall pattern obtained in Experiment I 
(tau = 83). Patterns obtained within Experi- 
ment II were also generally consistent with 
each other. The two conditions under which 
the subject could not view his own responses 
correlated most highly (tau = .78). Visual- 
feedback-present versus visual-stimuli-absent 
conditions correlated next most highly (tau 
= 61). Visual-feedback-present versus visual- 
feedback-absent conditions correlated posi- 
tively but not quite significantly (tau = -50). 

The similarity between the patterns ob- 
tained in the two experiments is evident when 
mds values are plotted for the various stimuli 
employed (Figure 1). Each body-part value 
in Figure 1 has been adjusted for objective 
stimulus size by applying a correction, based 
on the mds value for a non-body-part stimulus 
of comparable size. Applying this type of cor- 
rection increases the differences among body 
parts and tends to make the pattern stand 
out more clearly, but it does not alter the 
basic configuration of the data. 

On the basis of detailed statistical analyses 
and inspection of tables of mds values under 
the several experimental conditions, an over- 
all pattern of (uncorrected) mds expectancies 
was established for the various body parts. 
This pattern ranked seven of the body parts 
in the following order: head width, arm, waist, 
hips, leg, hand, foot. Tau values between this 
pattern and the patterns of each of the 120 
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subjects in both studies were then computed. 
The median of this distribution was .52 (Qi 
= .33, Qs = .62). No evidence could be found 
to suggest that these values were either 
particularly high or low under any given set 
of experimental conditions. In fact, the Qi, 
Qs, and Ọs values for the two experiments, 
considered separately, were identical. 


Discussion 


The results of both investigations confirmed 
the suspicion that body-part size estimation 
responses are sensitive to variations in the 
measurement situation. In Experiment I, body 
parts were found .to be more sensitive to 
starting-position effects than non-body-part 
objects, a phenomenon previously noted by 
Wight and Moed (1962). In Experiment II 
the effects of trials, response modes, and 
visual conditions were significant, but they 
did not appear to operate differentially with 
respect to the two types of stimuli. Variations 
in subject posture (standing versus sitting) 
had no significant effect on judgments of 
either type of stimulus. 

When descending judgments were required, 
stimuli tended to be overestimated (Experi- 
ment I). Stimuli were underestimated most 
greatly when visual cues were entirely absent. 
Judgments tended to be most veridical when 
the subject could view both the environment 
and his own responses. (In Experiment IT, on 
which these last two statements are based, 
ascending judgments only were provided; sub- 
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jects were unable to view their own body parts 
under any conditions in either experiment.) 
The line-drawing mode of response tended 
to produce greater underestimation than 
setting an opening on a rod. The first trial 
produced greater underestimation than a 
second trial with the same stimuli under 
identical conditions. 

Sex differences were not found; and, for the 
sake of expository simplicity, these data were 
not reported. Sex differences appeared in cer- 
tain previous investigations (Boraks, 1962; 
Shontz, 1963b), but dissimilarities among pro- 
cedures make direct comparison of results im- 
possible. Various sets of measurement condi- 
tions and body-part samples are probably not 
equally sensitive to group or individual differ- 
ences. By way of illustration, a study by Guess 
(1963) found that subjects with brain damage 
were inferior to non-brain-damaged subjects 
only in estimating the straight line distance 
between points on the body without somatic 
continuity (for example, from thigh to calf of 
leg, with knee bent). Both groups judged the 
sizes of naturally articulated body parts, 
such as those used in the present study, and 
of non-body-part objects with equal accuracy. 

A pattern of body-part size estimation 
tendencies was Proposed that is relatively 
independent of the objective sizes of the 
body parts judged. This pattern remained 
consistent under all measurement conditions, 
despite changes in overall response tendencies 
due to measurement conditions, Furthermore, 
the same pattern was evident in the responses 
of individual subjects (though by no means 
so perfectly as to be considered universal). 
These findings indicate that an investigator 
cannot speak meaningfully about the absolute 
accuracy or veridicality of body-part size 
judgments, It is Possible to establish either 
that body-part sizes are overestimated or 
that they are underestimated by selecting the 
conditions under which responses are provided 
by the subjects, The Presence of a consistent 
pattern of mean deviation score tendencies 
for body parts Suggests that future researches 
might devote themselves to investigating such 
changes in patterning as may be induced by 
various conditions, for example, drugs, pain, 
affective experiences, etc. 

It is necessary to discuss briefly the nature 
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of the body-part pattern identified in the 
present research. Under all conditions, the 
most consistent tendencies were for head. 
width and arm length to obtain the most 
positive values (i.e., to be overestimated most 
or underestimated least) and for hand and 
foot length to obtain the most negative values, 
Waist width, hip width, head length, and face 
height tended to obtain intermediate values, 
The configuration seems to suggest that devia- 
tion scores possess cephalocaudal and prox- 
imal-distal directionality. Both conclusions 
would be consistent with the findings of 
Boraks (1962). Neither conclusion would be 
consistent with the findings of Shontz 
(1963b). 

It should be noted that body parts must be 
carefully and consistently defined if research 


results are to be compared from one study. 


to the next. The present research shows that 
mean deviation scores for head width, head 
length, and face (or head height) are not 
equivalent measures, despite the fact that 
these parts all lie in the same somatic region. 
Cephalocaudality of mean deviation scores 
can be said to be clearly present only when 
head width alone is used to represent the 
cephalic end of the continuum. Similarly, 
hand width, a body part used by Shontz in 
previous work, does not produce the same 
relative mds value as hand length, as defined 
in the present research. Therefore, the 
proximal-distal directionality of the pattern 
is also, to a large extent, a function of thé 
definition of the sample of body parts 
employed. 

Future researches, employing systematically 
varied samples of body parts, judged under 
otherwise constant measurement conditions, 
may suggest which aspects of body configura- 


tion are most important in determining re- ` 


sponse characteristics. Functional properties 
may well prove to be more important than 
structures alone. For example, head width 
may be a more salient body unit than head 
length or height because of its continuity 
with the arrangement of the visual and audi- 
tory sense organs. Similarly, the importance 
of the reaching function of the arm may give 


it figural properties that are not present in 


more artificially defined body parts, such as 
“knee width” or “circumference of the neck. 


= 
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It cannot be argued that the stimulus situa- 
tions, for body-part and non-body-part ob- 
jects, were identical in the present studies. 
Verbal descriptions of body parts forced the 
subject to rely upon memory and upon im- 
mediately available cues from his own cloth- 
ing and from unspecifiable proprioceptive 
sources. Estimates of non-body-part objects 
were also made from memory, of course, 
since each stick was handed back to the 
experimenter before a response was made. 
The two memory processes may not be 
identical, however, because of the factor of 
differential subject familiarity with the two 
types of stimuli. A different procedure might 
reduce the importance of memory by identi- 
fying body parts with two-point tactile stimu- 
lation (held until the response is completed) 
and by permitting the subject to return to 
handling each stick as often as he likes while 
judging its size. Another approach might be 
to identify both kinds of stimuli verbally, 
using highly familiar non-body-part objects 
of standard size, for example, a foot ruler, 
a yardstick, the height of a dining room table 
from the floor. 

It should be made clear that the results 
of the present study do not imply failure of 
the subjects to discriminate among stimulus 
sizes. In Experiment I, the median tau values 
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between response size, per se, and objective 
stimulus sizes were 1.00 for non-body-part 
objects and .72 for body parts. In Experiment 
II, the Comparable tau values were 1.00 and 
.78. In every case, discrimination among body 
parts was less perfect than discrimination 
among non-body-part objects; but the differ- 
ences, though consistent, were not of great 
magnitude, and the ability of these subjects 
to discriminate among stimulus sizes was 
significantly high under all experimental 
conditions. 
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The relationship between ego identity and maternal identification was studied 
within the framework of Erikson’s psychosocial theory of ego development. 
Because maternal identification is an important childhood identification con- 
tributing to ego identity, it was hypothesized that it would be positively 
related to ego identity. This hypothesis was tested on 245 college women in 
late adolescence, and results showed that the relationship between ego identity 
and maternal identification was positive, as predicted. Higher ego-identity 
scores for sophomores than freshmen was interpreted as their evidencing a 
more effective resolution of the identity crisis. Results support Erikson’s 
theory that one of the early single identifications, that with mother, is sig- 
nificant for identity formation during adolescence. 
. 


Central to Erikson’s (1950) comprehensive 
theory of the epigenesis of the ego is the 
construct of ego identity, the usefulness of 
which has been widely acknowledged. While 
it has enriched clinical practice and has begun 
to demonstrate heuristic value as theory, 
doubts have been expressed that its measure- 
ment could be achieved (Ruesch, 1961). In 
the present research, a new instrument de- 
signed to measure ego identity was used to 
test the hypothesis that ego identity and 
maternal identification in women are related. 

The study is confined to the adolescent 
period, the stage at which the conflict of 
identity versus role diffusion becomes irtensi- 
fied. In the throes of this phase-specific con- 
flict, many adolescents, experiencing a, keen 
sense of futility and aimlessness, lose them- 
selves in “the crowd” to reduce the anxiety 
of what Erikson (1956) has called the 
“identity crisis.” In many of these students, 
clinical experience reveals poor mother- 
daughter relationships, which, in turn, sug- 
gest that faulty maternal identification is 
related to the syndrome of identity diffusion 
just described. 


Theoretical Framework 


Erikson’s (1956) psychosocial theory of 
ego development encompasses the life span of 
eight stages, from the oral Period, to adoles- 


1Based on a doctoral dissertation at Fordham 
University, The author wishes to express her ap- 
preciation to Joseph F. Kubis for his advice and 
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cence, and eventually to maturity. Each stage, E 
involving as it does a decisive encounter with 
the environment, embodies a conflict that 
must be resolved for continued development ~ 
of the individual. The conflicts are not neces- 
sarily restricted to any one stage, but tend to 
predominate at particular periods. 
Identity has been described by Erikson, 
although not clearly defined in his treatment. 
His description stresses the psychosocial 
nature of ego identity, with the focus, not on 
conflicts between psychic’ structures, but 
rather on a conflict within the ego itself, 
namely, of identity versus role diffusion. i 
Emphasis has shifted to the ego and the way 
it is affected by society. Ego identity can — 
therefore be defined as the complex of self- 
referent images which evolves through social 
interactions, thereby delineating the self. 
Identity formation encompasses both con- 
tinuity of self and identification with some- 
thing beyond self. This means the individual _ 
must live up to his own expectations as well 
as those of the society of his time. He is 
constantly striving to maintain a sameness, | 
of meaning for himself and to assure himself — 
that this constancy is recognized by others. 
Identifications with significant persons of 
his past contribute to an individual’s ego- 
identity formation. At the time of adoles- < 
cence, childhood identifications are reinte- — 
grated into the matrix of self-images which 
then gradually assimilates new social and 
occupational roles. Among the single early 4 
identifications, that with the mother is most Í 
important. As the ego resynthesizes all child- 
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hood identifications, including maternal iden- 
tification, with recent libidinal changes, 
emergent aptitudes, and current social and 
roles, a unique configuration 
emerges. In the process, the ego works toward 
self-continuity, as well as constancy and 
sameness of meaning for himself and others. 
Erikson calls this sense of continuity and 
sameness which embraces early identification, 
ego identity. 


Research in Ego Identity 


While the concept of ego identity has ap- 
peared increasingly in the literature, research 
has been limited. In one of the first attempts 
to verify the construct operationally, GSW 
Bronson (1959) demonstrated the existence 
of “identity diffusion” as a dimension of 
identity structure. His study would seem to 
imply that adequate ego identity is character- 
ized by a self-concept evolved gradually from 
roots in earlier identifications and experienced 
as continuity in the personality. 

Using younger subjects, Howard (1960) 
found that girls with more identity diffusion 
experience a greater sense of isolation from 
one or both parents. No differences in identity 
diffusion were found among subgroups differ- 
ing in age, religion, or locale. 

Rassmussen (1961) studied ego identity in 
relation to character disorders, where psy- 
chosocial functioning is disturbed. He found 
that ego-identity scores differentiated the best 
adjusted group from the poorly adjusted men 
and from those with character disorders. 
Age was not significantly related to ego 
identity within the three groups. 


Identification 


In Freudian theory, identification plays & 


. fundamental role in personality development 


(Fenichel, 1953). Osgood, Suci, and Tannen- 
baum (1957) have related this view of 
identification to recent developments in learn- 
ing theory by postulating that the process 
of modeling in identification takes place 
through meanings, as subsets of behavior 
which the child learns. Lazowick (1955) 
emphasized the distinction between imitation, 


. Which refers to the learning of behavior, and 


identification, which refers to the Ìearning of 
meanings. The extent of the relationship be- 
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tween meanings of parent and meanings of 
child indicates the degree to which identifica- 
tion has taken place. Wanda Bronson (1959) 
calls this relationship “ego identification,” 
implying a striving to be like the model in 
such a way that a blending of self with the 
qualities of the model brings about a real 
likeness but not an identity. 

Several studies have defined identification 
operationally as the assumed or real similarity 
between parent and child (Beier & Ratzeburg, 
1953; Gray, 1959; Helper, 1955; Sopchak, 
1952), Multiple administrations of the Min- 
nesota Multiphasic Personality Inventory 
(MMPI), rating scales, and the semantic 
differential were ‘employed to determine 
similarity. A consistent finding was that 
female patterns of identification with the 
mother or father were not as well defined as 
male patterns of parental identification. 

Erikson (1956) has explained how over- 

identification with parents cuts off a “budding 
identity” from its milieu, thus stifling the 
ego and restricting its emergent identity. 
In other words, such overidentification can 
only result in poor ego identity. Some in- 
vestigators (Helper, 1955; Sopchak, 1952) 
have suggested that even strong identification 
with the mother may be detrimental to the 
daughter’s adjustment. 
+ On*the other hand, Erikson indicates that 
where adequate feminine and masculine iden- 
tification take place, the personality develops 
into a healthy and harmonious blend of 
feminine and masculine qualities. Ordinarily 
these identifications are acquired through the 
mother and father. If, however, one of the 
identifications is suppressed, it remains un- 
synthesized by the ego, thus weakening the 
identity structure. Thus, where sex identity, 
central to one’s being (Snygg & Combs, 
1959), is denied its appropriate place in the 
complex of self-referent ideation, it is inevi- 
table that identity formation will be un- 
satisfactory. Implicit in the theory is the 
expectation that girls with inadequate ma- 
ternal identification will have poor ego 
identity. 

It is expected, then, that 
weak with poor maternal 
strong with good maternal ide: 
poor again with overidenti 


ego identity is 


tification, 


‘| 
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borders on the pathological. In a normal 
group of girls, however, where such excessive 
overidentification may be minimal, one would 
generally anticipate a positive relationship be- 
tween ego identity and maternal identifica- 
tion. This hypothesis underlies the present 
research inasmuch as the sample selected, 
college freshmen and sophomores, represents 
a normal group of girls in which excessive 
overidentification with the mother was a 
relatively rare occurrence. 


PROCEDURE 
Subjects 


College students and their mothers were asked 
to participate in the study. Included in the sample 
were 130 freshmen and 115 sophomores, who ranged 
in age from 17 to 20 years. All were of middle-class 
socioeconomic background, whose parents were 
living at home. The students were in residence on 
the campus of a Catholic woman's college in the 
Midwest. 

Of the 329 mothers who were invited by mail to 
participate in the study, mothers of 143 freshmen 
and 131 sophomores returned their ratings on the 
semantic differential. Of this group, Tesponses from 
99 freshman and 83 sophomore mothers were in- 
cluded in the sample. The remainder were excluded 
either because their daughters were not Participating, 
because their own responses were incomplete, or 
because of clerical coding errors. 


Measuring Instruments 


Three test instruments were used: an Ego Identity 
Scale, a semantic differential, and a Rating Scale for 
Identity Traits. In addition, scores from the Scho- 
lastic Aptitude Test (SAT) of the College Entrance 
Examination Board (CEEB) were obtained from 
the college records. 

Ego Identity Scale. Since no Personality inventory 
available at the time appeared to be suitable for 
the measurement of ego identity, an Ego Identity 
Scale was constructed (Dignan, 1963). Its rationale 
was based on Erikson’s (1947, 1955) description of 
ego identity, and on several other discussions of 
identity (Academy of Religion and Mental Health, 
1960; G. W. Bronson, 1959; Lynd, 1958; Ras- 
smussen, 1961; Rubins, 1961; Ruesch, 1961; Strauss, 
1959; Symonds, 1951; Wheelis, 1958; White, 1952). 
After analyzing these reports on ego identity, the 
author derived several components of ego identity 
implicit in the various descriptions of the construct: 
Sense of Self, Uniqueness, Self-Acceptance, Role 
Expectations, Stability, Goal-Directedness, and Inter- 
personal Relations (Dignan, 1963), Examining the 
implications of these components of identity, the 
author then constructed 161 statements representa- 
tive of them. 

After a pilot study, some items were eliminated, 
many were reworded to avoid ambiguity, and a few 
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more were added. Content validity was determined 
by submitting a final pool of 163 items to five 
judges who evaluated and selected items relevant to 
ego identity. One of the five was a woman, a 
clinical psychologist, and Dean of Women at a 
coeducational university. Of the four male judges, 
two were psychologists, working mainly with 
adolescents, and two were Psychiatrists. Fifty of 
the 163 items were considered indicative of aspects 
of ego identity in the judgment of 80-100% of the 
judges. These 50 items constituted the Ego Identity 
Scale, yielding a maximum score of 50. 

As indicated above, the construction of items was 
based on a description of ego identity found in the 
writings of Erikson and others who have critically 
examined and developed the construct. The several 
components which emerged as consistent descriptive 
indicators of ego identity are presented here in 
succinct form with illustrative items from the 
Ego Identity Scale. 


Sense of Self—an abiding and intimate experi- 
ence of self, an interior knowledge of what one is; 
a self-image central to one’s being (Academy of 
Religion and Mental Health, 1960; Erikson, 1956; 
Ruesch, 1961). 

At times I think I am a mystery even to myself. 
What I am now is pretty much what I am going 
to be. 

Uniqueness—a recognition of one’s separate 
identity, distinctness from others, independence 
(Academy of Religion and Mental Health, 1960; 
G. W. Bronson, 1959; Erikson, 1950; Symonds, 
1951; White, 1952). 

People seldom mistake me for another girl. 
I feel swallowed up by the crowd here at college, 

Self-Acceptance—self-knowledge, self-appraisal, 
and subsequent self-acceptance or rejection of 
these self-images (G. W. Bronson, 1959; Rassmus- 
sen, 1961; Strauss, 1959; White, 1952). 
I like to picture myself as someone else, 
I know I’m not perfect but I prefer to be as I am. 

Role Expectations—responsiveness to expecta- 
tions of “important others” in the achievement of 
adequate sex identity, appropriate social roles, and 
satisfactory vocational roles (Erikson, 1950; Gesell 
& Ilg, 1946; Snygg & Combs, 1959; Symonds, 
1951; White, 1952). 

I believe I see myself pretty much as others see me. 


Tt seems to me that most of the things girls do 


are very dull. 

Stability—preservation of the same meaning 104 
oneself and for others; implies a “rootedness, 
an enduring “I,” consistency (G. W. Bronson, 
1959; Erikson, 1947; Rubins, 1961; Ruesch, 1961; 
White, 1952). 

First I try to be like one person I know, then 
another. 

I don’t seem to be changing as much in college as I 
did in high school. 

Goal-Directedness—awareness of what onp 
stands for, where he is going, and self-assertion 
to achieve goals (Academy of Religion and Mental 


„men and sophomores (Dignan, 1963). The 
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Health, 1960; Erikson, 1950; Symonds, 1951; 
Wheelis, 1958; White, 1952). 
J feel a deep need to live up to my ideals. 

“My problem is that I don’t really know what I 
would like to become. 

Interpersonal Relations—sufficient delineation of 
self to permit involvement with others; intimacy, 
competition, love (Erikson, 1955; Lynd, 1958). 
Meeting new people is fun for me. 

Tt is easier to lay aside my principles than to 
fight for them against opposition. 


The distributions of freshman and sophomore ego- 
identity scores were basically normal, The means 
and standard deviations for the Ego Identity Scale 
were 34.58 and 5.55 for 130 freshmen; 37.42 and 
4.73 for 115 sophomores. The difference between 
these means was significant well beyond the .01 level. 

The reliability of the Ego Identity Scale was 
determined in two ways. The odd-even correlation 
coefficient, corrected by the Spearman-Brown 
formula, was .74 for 130 freshmen and .64 for 115 
sophomores. The test-retest coefficients for a 1-week 
interval were .72 and .78 for smaller groups of 83 
freshmen and 96 sophomores, respectively. 

One estimate of validity of the Ego Identity Scale 
was obtained by correlating ego-identity scores with 
self-ratings on the Rating Scale for Identity Traits, 
administered to the same smaller group of 83 fresh- 
men and 96 sophomores. The correlation coefficient 
for 83 freshmen was .34, for 96 sophomores, .60, 
both significant beyond the .01 level. It was recog- 
nized, however, that part of the variance in these 
coefficients may be due to a response set inasmuch 
as both measures were obtained from self-report 
inventories. 

Data from the present study and from the in- 
vestigations of two other researchers who used the 
Ego Identity Scale contribute some beginnings 
toward construct validity. As expected, the differ- 
ence between freshman and sophomore ego-identity 
Scores was significant and in the predicted direction. 
Working with a similar age group, ranging from 17 
to 22 years, Thompson (1963) also reported that 
ego identity is a positively increasing function of 
age. Results of both studies are construed as a 
verification of White’s (1952) observation that the 
Sense of identity becomes more pronounced with 


„age. Corroborative evidence for Erikson’s (1947) 


statement that “In a general way it is plain that 
everything that makes for a strong ego contributes 
to its identity [p. 379]” comes from Thompson 
(1963) and Cleare (1963), who, although utilizing 
different tests of ego strength, found significant posi- 
tive correlations with ego identity. Implicit in all 
discussions of deficient ego identity are the associated 
feelings of malaise, bewilderment, and dissatisfaction. 
the _ anticipated negative relation between ego 
identity and anxiety was confirmed for both fresh- 
t alleged 
importance of one’s commitment to goals and values 
i determining ego identity was substantiated by 
hompson when she found that modification of an 
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individual’s value system entails some disintegrating 
effects upon ego identity. However, with subsequent 
assimilation of a newly embraced value system, 
one’s sense of self is restored, and his ego-identity 
score increases. 

The Ego Identity Scale proved independent of 
intellectual performance. The correlation between the 
scale and CEEB scores for freshmen was .10, for 
sophomores, .01, neither of which reached the .05 
level of significance. This finding confirms a similar 
one by G. W. Bronson (1959) who studied identity 
diffusion in 46 collegians. 

Semantic differential. This instrument, devised by 
Osgood (1952) as a generalized technique for the 
measurement of meaning, was adopted by Lazowick 
(1955) to measure identification. In the present 
study, the same form, consisting of 10 concepts and 
9 scales of bipolar adjectives employed by Lazowick, 
was given to both students and mothers. The 
semantic differential yielded two scores, measuring 
Assumed Similarity and Real Similarity between 
mothers and daughters. The difference between the 
scale positions for each of 10 concepts as rated by 
the student herself and then as she “thinks mother 
would do it” provided a basis for measuring As- 
sumed Similarity to mother. The difference between 
a student’s ratings and her mother’s ratings on the 
10 concepts yielded a comparable measure of Real 
Similarity to mother. In both cases, the root of the 
summed squared differences was used as the measure 
of similarity. To facilitate the interpretation of the 
similarity scores, this difference statistic was scaled 
on the negative real line. With this scaling, a low 
score means low maternal identification, a high 
score, high identification. Both Assumed and Real 
Similarity have been reported in the literature as 
measures of maternal identification and were used 
in the present study. 

Rating Scale for Identity Traits. This scale, em- 
ployed’ for purposes of validation, consists of 17 
paired traits, each rated on a 6-point scale (Dignan, 
1963). The traits were evaluated by the same five 
clinicians who provided content validity for the 
Ego Identity Scale. 


Administration 


A procedure was employed to insure anonymity 
for all subjects. The students were tested in two 
sessions held a week apart. During the first, the 
Ego Identity Scale and the semantic differential 
were administered. The semantic differential was 
completed twice} initially the students rated the 
concepts according to standard instructions; next 
they were instructed to rate the same 10 concepts as 
they believed their mothers would do it. During the 
second session, the Ego Identity Scale was readminis- 
tered, and students also rated themselves on the 
Rating Scale for Identity Traits. 

The semantic differential was 
mothers of freshmen and sophomores, with a cover- 
ing letter, soliciting their cooperation, and providing 
instructions for returning the ratings by mail. 


also mailed to 329 
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TABLE 1 


MEANS AND STANDARD DEVIATIONS oF Eco IDENTITY 
Scores AT THREE LEVELS OF MATERNAL 
IDENTIFICATION (AssuMED SMLARIFY 
TO MOTHER) 


Ego identity 


T M SD 
ficationa 
Freshman | Sophomore| Freshman | Sophomore 
Low 33.80 36.65 5.21 4.98 
Medium 34.10 37.55 3.90 4.46 
High 37.60 39.35 3.61 3.15 


* Twenty scores randomly selected at each level for each class. 


RESULTS 


The hypothesis that ego identity would 
vary with maternal identification was tested 
in a two-way analysis of variance design. One 
criterion of classification was level or degree 
of identification (low, medium, and high), the 
other was class designation (freshman, sopho- 
more). The high, medium, and low levels of 
maternal identification consisted of the upper, 
middle, and lower 20% of the identification 
scores, The analysis utilized both measures 
of maternal identification, namely, Assumed 
and Real Similarity to mother., 

Assumed Similarity to Mother Gii 

Table 1 presents the means and standard 
deviations for ego-identity scores at three 
levels of maternal identification, that is, as 
measured by assumed similarity. Since hetero- 
geneity of variance was not indicated, an 
analysis of variance of the ego-identity scores 
was completed, the results of which may be 
found in Table 2. There is significant varia- 


TABLE 2 


ANALYSIS OF VARIANCE OF Eco IDENTITY SCORES FOR 
THREE LEVELS OF MATERNAL IDENTIFICATION 
(ASSUMED SIMILARITY TO MortHER) 


Source SS af MS F 
Level of identification 239.27| 2 |119.64| 6.20* 
lass 216.02 1 | 216.02 | 11.19* 
Interaction 14.86] 2| 7.43| 39 
Error 2,199.85} 114 | 19.30 
Total 2,670.00 | 119 
*d<.01, 
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tion (.01 level) among the three levels of 
maternal identification. For both freshmen 
and sophomores, ego-identity scores increase 
from levels of low to high maternal identifica- 
tion. Table 2 also reveals significant class 
differences in ego-identity scores. Sophomores 
obtained significantly higher ego-identity 
scores than freshmen. The Level X Class inter- 
action is nonsignificant. 

The relationship between ego identity and 
maternal identification is positive, as hy- 
pothesized; students who show high identifi- 
cation with their mothers obtained high ego- 
identity scores, and girls with low maternal 
identification were found to have low ego- 
identity scores. 


TABLE 3 


MEANS AND STANDARD DEVIATIONS or Eco IDENTITY 
Scores AT THREE LEVELS OF MATERNAL 
IDENTIFICATION (REAL SIMILARITY 


TO MOTHER) 
Ego identity 
ie m sD 
fications 
Freshman | Sophomore| Freshman | Sophomore 

Low 31.80 36.60 5.89 4.53 
Medium 33.87 38.60 3.93 4.19 
High 35.53 38.00 4.49 2.88 


a Fifteen scores selected randomly at each level for each class. 


Real Similarity to Mother 


The means and standard deviations for ego- 
identity scores at three levels of maternal 
identification, as measured by Real Similarity 
to mother, are shown in Table 3. The data are 
based on a sample of 99 freshmen and 83 
sophomores, whose mothers completed and 


returned the semantic differential. It is evi-’ 


dent from Table 3 that for freshmen, the 
Positive trend for ego-identity scores and 
maternal identification (real similarity) cor- 
responds to that found for ego identity and 
assumed similarity. For sophomores, the trend 
is disrupted by a maximum ego-identity 
score at mid-level of maternal identification. 

However, the analysis of variance revealed 
that differences in ego-identity scores among 
the three levels of identification were not 
significant, as indicated in Table 4. Neverthe- 
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less, class differences, as expected, are sig- 
nificant. 

=< Jn comparing the degree of Assumed and 
Real Similarity demonstrated by students, it 
was found that 81 of 99 freshmen showed 
greater Real than Assumed Similarity to their 
mothers, and 77 of 83 sophomores manifested 
a similar pattern. A chi-square analysis in- 
dicated that results for both classes are sig- 
nificant beyond the .001 level. 


Discussion 


As predicted, low identification (Assumed 
Similarity) with the mother was related to 
low ego identity, while high maternal identifi- 
cation was associated with strong ego identity. 
This is interpreted to mean that shared mean- 
ings between mother and daughter, indicative 
of maternal identification, contribute to sta- 
bilization of the daughter’s identity forma- 
tion. A common fund of meanings may reflect 
the daughter’s interiorization of the maternal 
image, which brings about a feeling of close- 
hess to her mother. This identification, reach- 
ing back to childhood, safeguards the sense 
of self-continuity during adolescence when 
many new and transient role identifications 
disturb the continuity and sameness of mean- 
ing for self and others. 

Moreover, since maternal identification is 
one of the important identifications resyn- 
thesized by the ego with other elements of 
personality at adolescence, it provides a long- 
standing identification with the feminine role. 
Perhaps it is this sex-role identification, estab- 
lished through the mother, which contributes 
another stabilizing element of personality 
during the period of role diffusion in adoles- 
cence. Anchored in her feminine role, as she 


“has come to see it, the young girl can experi- 


ment more freely with other roles, and at 
Some point make the initial social and voca- 
tional commitments that begin to stabilize 
her identity formation. Such initial role com- 
fitments do not prevent students from embrac- 
ing other roles later on; they simply mean 
that current stabilization of ego identity is 
facilitated by clarity of sex role. 

It is, of course, recognized that the group 
Studied is a special one in several respects. 
The students are college girls, in a small 
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TABLE 4 


ANALYSIS OF VARIANCE OF Eco IDENTITY SCORES FOR 
THREE LEVELS OF MATERNAL IDENTIFICATION 
e (REAL SIMILARITY To MOTHER) 


Source ss |g | MS | F 
Level of identification 110.06| 2 | 55.03] 2.64 
f 360.00| 1 | 360.00 |17.29** 
Interaction 26.47| 2 | 13.24] .64 
Error 1,749.07| 84 | 20.82 
Total 2,245.60) 89 
+ p<.001. 


woman’s college, living away from home, girls 
whose parents’ marriage is still intact. These 
circumstances do not afford the same op- 
portunities for experimentation with the 
wider variety of roles as those to which a 
working girl might have access, or those 
available to a student living at home and 
perhaps working to support her college educa- 
tion. Undoubtedly other populations of girls 
in late adolescence would provide fruitful 
areas for evaluating the impact of such crit- 
ical environmental variables on ego identity 
formation in young women. 

The better ego identity achieved by sopho- 
mores illustrates, in part, the resolution of the 
“identity crisis” described by Erikson (1956). 
During adolescence, stabilization of ego 
identity occurs gradually as the individual 
resynthesizes newly acquired social and voca- 
tionak roles with other elements of personality. 
Several factors contribute to the more suc- 
cessful resolution of the identity crisis by 
sophomores. First, they have experienced con- 
tinuity of place to a greater extent than fresh- 
men, an important factor in identity forma- 
tion, according to Ruesch (1961). They feel 
at home in the college milieu. Second, they 
have identified with the social roles of young 
woman and college student longer than fresh- 
men who are still experimenting with varied 
social roles and expectations in collegiate life. 
In the third place, while freshmen continue 
to weigh vocational possibilities, sophomores 
usually have already committed themselves 
to major fields of study and fairly definite 
vocational goals. 

Research relevant to ego identity and ma- 
ternal identification is sparse. Howard (1960) 
reported that among high-school girls those 
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with poorer identity were more isolated from 
cone or both parents. This report did not 
specify results for the same-sex or cross-sexed 
parent. A similar finding emerged” in the 
present investigation; that is, when maternal 
identification, implying an emotional closeness 
to the mother, is low, so also is ego identity. 

In the present study, both Assumed and 
Real Similarity to mother were employed as 
measures of maternal identification. A con- 
sistent trend for ego-identity scores to in- 
crease with levels of maternal identification 
was found with both measures, although with 
the Real Similarity measure differences in 
ego-identity scores were not significant. These 
findings suggest that identity formation is 
more closely related to the girl’s subjective 
experience of mother-daughter similarity than 
to any objective likeness of thought between 
them. 

Contrary to Lazowick’s (1955) findings 
that college women showed greater Assumed 
than Real Similarity to mother, the present 
study indicates that by far the greater ma- 
jority of students manifest more Real than 
Assumed Similarity to mother. Thus the 
present findings fail to confirm the “masculine 
protest” reported by Lazowick. Differences 
in methodology and sampling may account for 
the contradictory results. Although both, stud- 
ies used the same technique to assess Real 
Similarity, Lazowick measured Assumed 
Similarity by students’ ratings on only 2 con- 
cepts, “Myself” and “Mother,” whereas, in 
the present investigation, all 10 concepts were 
rated by the students twice, once as they 
evaluated them and again as they thought 
their mothers would do so. The two samples 
also differ in size and background. Lazowick’s 
30 subjects included male and female students 
in a coeducational university, hence the size 
of the female sample was small. However, the 
present investigation used a sample of 245 
students in a woman’s college. Furthermore, 
scores for two college classes were analyzed 
separately and found to yield consistent re- 
sults. Not only does the present study fail, 
then, to corroborate Lazowick’s hypothesized 
“masculine protest” in college women; it in- 
dicates that girls who identity well with their 
mothers manifest better ego identity. 
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DENIAL OF AFFECT AND CONFORMITY * 


DAVID WRENCH 


University of Oregon 


AND 


KIRK ENDICOTT 
Reed College 


In 2 replications of an experiment, Bales’ interaction process analysis was 
put to the new use of measuring individual differences in social perception. 
Ss who perceived a standardized social interaction as having less emotional 
significance conformed more to group pressure in a Crutchfield conformity 
situation. The result is seen as having significance both to the nature of 
conformity and the possible uses of interaction process analysis. 


Among the recent studies which examine the 
personality correlates of susceptibility to social 
pressure, there are a number which suggest that 
alienation of the individual from his own emo- 
tional reactions is an important factor. Hoffman 
(1953), who conceived of compulsive conformity 
as a reaction formation against hostile impulses 
towards parents, inferred the inability of high 
conformers to tolerate impulses from two pieces 
of evidence. These were their dislike of a larger 
number of TAT cards than low conformers, and 
their denial that their past experiences had influ- 
enced the stories they told to the TAT cards. 
He also found evidence from the content of 
responses to the cards that high conformers 
express hostility less directly than low conform- 
ers. Similarly Tuddenham (1958), in summa- 
rizing characteristics of high conformers „as as- 
sessed by a number of personality measures, 
concluded that “A second group of variables 
negatively correlated with yielding seem to con- 
tain an element of psychological sensitivity and 
perceptiveness, both of self and others [p. 12].” 
Crutchfield (1955) found that high conformers 
were rated high on overcontrol of impulses, in- 
hibition, and lack of insight into their own 
motives and behavior. 

A study utilizing a more direct measure of 
emotional alienation, the anti-intraception items 
from the F Scale, was done by Linton and 
Graham (1959). Although they related these 
items to change in response to a persuasive 
message rather than conformity to group pres- 
sure, their positive results suggest that a rela- 


1This study has been supported by a Faculty 
Summer Research Grant to the senior author from 
the Office of Scientific and Scholarly Research of 
the University of Oregon. We are indebted to Louis 
Breger and Julie Sandall for aid in carrying out the 
study. Endicott was a National Science Founda- 
tion research participant at the time the study was 
carried out, 


tionship to conformity might be found also, 
Recently, Breger? (1963) found that high con- 
formers were less likely than low conformers to 
express hostility in an experimental situation. 
This result can well be conceived of as one 
aspect of a more generalized alienation from 
emotional responses. 

The present study was designed as a more 
direct test of the dependence of conformity on 
the denial of affect, defined as the inability of 
an individual to perceive emotional meaning in 
social interaction. Because of the small sample 
involved in the study, a replication was carried 
out using similar procedures. 


PROCEDURE 
Subjects 


Data were gathered on 26 subjects of both sexes 
drawn from an introductory psychology class. Four 
subjects were eliminated prior to data analysis 
because prior knowledge of conformity experiments 
prevented their being adequately deceived to measure 
conformity. Š 

In the replication of the study data were gathered 
on 27 subjects from an introductory sociology class 
and 5 were eliminated because of prior knowledge 
of conformity experiments. There were thus 22 cases 
analyzed in each replication. 


Denial of Affect 


The inability of individuals to perceive social inter- 
action as having emotional meaning was measured 
by a novel use of Bales’ (1950) interaction process 
analysis, While raters are generally trained in the 
use of Bales’ categories and their ratings used a5 
an indication of the actual nature of social inter- 
action, in the present study the categories were 
used to record the differing perceptions which rela- 
tively untrained raters had of a standardized social 
interaction. 

The rationale for this procedure stems Å 
comment by Bales (1950). In Interaction Process 


2 


from 4 | 
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2L. Breger, personal communication, 1963. | 
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‘Analysis he noted that “It appears that there may 
be certain emotional biases which make it difficult 
for certain persons to recognize or score certain 
types of interaction [p. 88].” On the basis of pre- 
yious research on conformity, it appears that denial 
of affect may be such a bias. If it is, it should be 
reflected in an inability or unwillingness of some 
partially trained raters to code affect laden state- 
ments in the appropriate interaction process analysis 
categories. 

The measurement of the acceptance or denial of 
affect involved four steps. 


1, A script was prepared of the interaction to be 
coded, The central figure was a foreign student, and 
the interaction consisted of his discussing cultural 
differences with three American students. This situa- 
tion was chosen for its similarity to a group pressure 
situation, the assumption being made that the sub- 
jects would identify with the central character and 
differ in their interpretations of the statements by 
the group members. The script was broken down 
into 48 units, with almost all of the statements by 
the group members expressing either antagonism to 
or solidarity with the central character. 

2. A lecture was delivered to the subjects during 
their usual class period, explaining interaction process 
analysis and some of the research findings obtained 
with it. 

3. The subjects were then asked to code the inter- 
action performed for them, partly to familiarize 
themselves with the categories and partly to gather 
data on the reliability of untrained judges. The script 
was read through twice by the actors, first rapidly to 
give the subjects an idea of the total interaction, 
then more slowly to permit coding. In the initial 
study the part of the foreign student was actually 
played by a student from Scotland; when the study 
was repeated all of the parts were played by 
American students. 
+4. Acceptance or denial of affect was scored from 
the ratings made by the subjects. The “index of 
acceptance of affect” is the number of statements 
by group members which the subject coded in 
Categories 1, 2, 11, or 12 in the interaction process 
analysis, These categories, “shows solidarity,” “shows 
tension release,’ “shows tension,” and “shows 
antagonism,” have the greatest emotional significance. 


‘Conformity 


Conformity was measured using a somewhat 
simplified version of Crutchfield’s (1955) conformity 
apparatus. The subjects were seated around a table 
and separated from each other by plywood parti- 
tions, Each subject confronted a double throw 
switch to be used in indicating his agreement or dis- 
agreement with statements read by the experimenter, 
and five pairs of lights purporting to record the 
judgments made by himself and the other subjects. 


„Each subject was led to believe that he was in the 


Lae position on each trial, while the “judgments” 
oe the other subjects were ‘actually manipulated by 
e experimenter from a control panel with five 
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double throw switches. The subjects were thus sub- 
jected to the same apparent social pressure, and 
the experimenter recorded their judgments, which 
were actually made simultaneously, from the lights 
at the control panel. Subjects were run in groups 
of four or five. 

The critical items used were 11 information items 
on which pretesting by Breger ? revealed that from 
72% to 96% of college students gave the correct 
answer under nonpressure conditions. Over all of the 
11 items the percentage of correct response from a 
group of 50 pretest subjects was 86.4%. The critical 
items were embedded in a list of 45 information and 
opinion statements read to the subjects by an 
experimenter. 

In order to reduce the suspiciousness which might 
result from obtaining responses of the subjects to 
the critical items under nonpressure as well as 
pressure conditions, responses were only obtained 
under pressure conditions. Some inaccuracy is thus 
involved in taking as an index of conformity the 
number of critical items on which the subject gave 
an incorrect response, without measuring the number 
of incorrect responses which he would have given 
under nonpressure conditions. 


RESULTS AND CONCLUSIONS 


In the first experiment, a correlation of —.285 

was found between the index of acceptance of 
affect and the index of conformity. Although 
this correlation is in the predicted direction, it 
is only suggestive, reaching the .10 level by a 
one-tailed ¢ test (N = 22). The study was 
therefore replicated. 
* In the replication, a correlation of —.358 was 
found between the two indices, reaching the 06 
level by a one-tailed test (N = 22), The two 
significance tests were combined by using 
Stouffer’s technique as reported by Mosteller and 
Bush (1954, pP- 328-331), The results of the 
two replications combined reach the 025 level 
by a one-tailed test. 

An alternate way of analyzing the data is by 
simply pooling the results of the two replications. 
To the extent that there is a replication effect, 
this procedure is statistically less powerful. When 
the data are analyzed in this manner, the correla- 
tion for the two groups combined (N = 44) is 
—324 and is significant at the .025 level by a 
one-tailed test. It was therefore concluded that 
the denial of affect is significantly related to 
conformity to group pressure. 

The implications of this finding seem to the 
authors to be twofold. It clearly lends support 
to the view, suggested by previous research, that 
alienation of the individual from his own emo- 
tional responses is a factor in conforming be- 
havior. The discovery of how important a factor 
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it is must await the development of measurement 
techniques more refined than those employed 
in the present study. 

Furthermore, the finding may have some 
methodological significance. Although the rela- 
tionship found in the present study is small, it 
suggests that with further refinement of tech- 
niques interaction process analysis may be profit- 
ably put to a new use, the measurement of 
individual differences in perception, 
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ABILITY EVALUATION, POSTDECISION DISSONANCE, AND 
CO-WORKER ATTRACTIVENESS * 


DENNIS T. WILSON 


University 


It was hypothesized that ‘a person’s attraction 
would depend upon the consonance between a 
a task and the appraisal which he receives, In 
members of 2-person teams were motivated to 
ing to pass tests. Experimental Ss were induced 
avoid 1 test on the basis of an ambiguous pretest failure. Control Ss, given the 
same pretest results, were arbitrarily denied 
Then each S received either a favorable or 
from his co-worker, As predicted, the unfa’ 


tive to experimental Ss than control Ss. 


Several investigators (Aronson & Carlsmith, 
1962; Bergin, 1960; Bramel, 1962) provide evi- 
dence that strong dissonance can be produced by 
giving a person an evaluation of his personality 
or performance which is inconsistent with his 
self-concept. As Festinger (1957) describes dis- 
sonance as aversive, it follows that a person 
should tend to dislike and avoid an evaluator 
who gives him an appraisal which is dissonant 
with a self-evaluation. Accordingly, a person 
should dislike an evaluator of his ability whose 


1 The article is based upon a doctoral dissertation 
(Wilson, 1962) submitted to the University of Min- 
nesota. The author wishes to express his gratitude 
to Ben Willerman, Dana Bramel, and Harold H, 
Kelley for their valuable advice and criticism. 
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of Florida 


to an evaluator of his ability 
decision regarding attempting 
a laboratory experiment, the 
work for rewards by attempt- 
to make a personal decision to 


the opportunity to take the test. 
unfavorable test ability evaluation 
vorable evaluator was more attrac- 


assessment is dissonant with a self-appraisal ø 
competence, 


unfavorable, Each control subject received onli 
the high or the low deviant appraisal. 
experimental subjects exceeded the control s 
jects in their rejection of both deviant ass FA 
ments, and also tended to be less attracted toii 
the deviant assessors. There are at least tw 

Possible explanations of these results. The devial 
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appraisal may have been rejected by experimental 
subjects because it aroused dissonance or because 
it. was perceived as more extreme, when con- 
trasted with moderate assessments. 

Stronger evidence is provided by an experi- 
ment by Deutsch and Solomon (1959), Some 
girls, who worked on a group task, were led to 
believe that they had done poorly; other girls 
were led to believe that they had done well. 
Subsequently, each girl received a fictitious com- 
' ment from a teammate, which either derogated 

or congratulated her performance. The girls who 

received evaluations dissonant with their per- 
ceived performance rated their evaluators more 
unfavorably. In this study, the true effect of 
dissonance may have been dwarfed, as the apti- 
tude evaluations were confounded with accept- 

ance or rejection of the subjects. That is, the 
derogatory comment tended to be endorsed by 
poor performers, even though the evaluation 
implied rejection as a teammate. 

The experiments cited illustrate that a per- 
former may reject evaluators whose assessments 
are inconsistent with his knowledge of his per- 
formance or with an awareness of others’ evalua- 
tions of him. Ability evaluators may also be 

. rejected when their appraisals are inconsistent 
with a self-evaluation implicit in a decision about 
attempting to achieve. For example, suppose that 
a person confronts a decision to attempt mastery 
of a lucrative but hazardous test situation. If the 
person avoided the test because he decided that 
he was inept, he should tend to dislike someone 
who affirmed his competence. Since the person 
decided to abandon his chance for reward, he 
should prefer someone who attested to his 
itcompetence. Thus, a performer's attraction 
to.an evaluator should depend upon the con- 
' sonance between his decision and the appraisal 
he receives. 

As a test of the hypothesis, the following 
experiment was conducted. Experimental subjects 
were induced to decide that they could not pass 

_ „a'test and to avoid taking it. Control subjects 
Were arbitrarily denied the opportunity to take 
this test. Subsequently, half of the subjects in 
each condition received a co-worker’s appraisal 
of them which affirmed their competence; the 
other half received an evaluation which attested 
their incompetence, If attraction to the assessor 
depends on the postdecision dissonance effect of 
his comment, the competence evaluation should 
Produce less attraction after a personal decision 
F w avoid the test than after a forced decision. 
* And the incompetence evaluation should produce 
more attraction following a personal decision 
than after a chance decision. 
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MerHop 
Subjects 


The subjects were 84 male introductory psychology 
students ùt the University of Minnesota, who 
reported to the experiment in pairs. 


Personal Decision (PD) Condition 


Task and incentive instructions. The experimenter 
began each session by leading two previous strangers 
to believe that they were a team competing against 
all other teams in the experiment. The subjects were 
then told that their team could earn $10 and 8 
examination points by performing well on a series 
of six tests. In turn, each member would take a 
battery of three tests, as in a relay tace. The team 
would earn 100 game points for each test that a 
member passed, but wéuld lose 100 points for each 
test that a member failed. Their team would earn 
money if it finished in the top 20% of all teams, 
and would earn examination points only by earning 
a total of 300 points. 

The experimenter explained that a member could 
bypass any test which he thought he could not 
pass. When a member decided to avoid a test, the 
team would simply lose one opportunity to earn 
points. To provide some basis for decision, the 
testing member would be given pretests, which would 
yield only “rough indications of ability” to pass 
the tests proper. The experimenter explained further 
that the testing member would be informed of his 
pretest grade before making a decision to take each 
test, but he would not be told his exact pretest 
score. 

The subjects were then instructed that while one 
member was working on his pretests and tests, the 
ther member was to review a copy of the same 
testing material. The nontesting member would be 
informed of the testing member's pretest grades and 
unmarked answers. While a test-taking decision was 
being made, the nontesting member would be allowed 
to write a comment to his partner, which would 
be delivered after the decision. 

Procedure. When the experimenter finished reading 
the instructions, he placed each team member in a 
separate room. Then each subject was told that 
he would be the first to take his tests. Subsequently, 
each subject was given the pretest for Test 1, which 
was the only test material actually administered. 
The pretest concerned a hypothetical ability called 
“probability inference.” Each test item required the 
subject to choose the most probable reason why 
a man would act in the manner described. Several 
minutes after finishing the pretest, each subject was 
informed that he had failed the pretest and was 
given 5 minutes to decide whether he should take 
the first test. 

The subject was confronted with a dilemma, The 
instructions implied that if he decided to take the 
test and failed, the total possible team score would 
be 400 points. As 300 points were needed to earn 
the 8 exam points, a failure would jeopardize the 
subject’s chances of earning credit for the 2-hour 
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experiment. But if he avoided the test and could 
pass it, he would needlessly hinder his chances of 
earning money. 

Ability evaluations by the co-worker. After a sub- 
ject had made his decision, he received a bogus 
comment, supposedly written by his partner. At 
random, one subject on each team received an 
evaluation affirming his ability to pass Test 1, and 
the other subject received an evaluation attesting 
his inability to pass the test. 

The competence evaluation read: 


Even though you failed the pretest, I don’t 
think you failed by much. If you give it a real 
try, I think you would have a pretty good chance 
of passing test 1. 


The incompetence evaluation read: 


To tell the truth, I don’t think you did very 
well on the pretest. Too few of your answers were 
right. I don’t think you have too good a chance of 
passing test 1. 


Ratings of the co-worker. After reading the note, 
each subject completed a rating form which sup- 
posedly allowed the experimenter to find out how 
the teams were working together. The assessment 
form was composed of 21 7-step semantic differential 
ratings, The ratings included several accuracy and 
emotional support assessments of the co-worker’s 
evaluation and personality. The form also contained 
ratings of attraction to the partner, preference for 
another partner, and desire for comments after subse- 
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quent pretests. In addition, the form included a 
rating of subject’s chances of passing Test 1. 


Chance Decision (CD) Condition 4 
The instructions and procedures for the control 


subjects were equivalent to those for the experi- | 


mental subjects, with one exception. Control sub- 
jects were not allowed to make a personal decision 
to take the tests after receiving their pretest grades. 
Instead, they were told that the decisions would be 
made for them by a “chance” procedure having odds 
which favored their taking each test. All control 
subjects were told that they had failed the pretest 
for Test 1, and that “chance” decided that they 
would not take the first test. As in the PD condi- 
tion, half of the subjects received the incompe- 
tence evaluation, and the other half received the 
competence evaluation. 


RESULTS 


Eighteen subjects were successfully assigned 
to each condition in the 2 X 2 design. Only those 
subjects who decided to avoid the test completed 
the experiment. Four subjects decided to take 
the test and eight subjects did not believe the 
information that they were given. The data 
analysis does not include these subjects. 

Table 1 shows the mean ratings by condition. 
There is strong evidence that PD subjects who 
received the incompetence evaluation were left 
with less postdecision dissonance than those who 


TABLE 1 
Mean Ratines, SIGMAS, AND COMPARISONS BY CONDITION 
Comment evaluation gerona evaluation 4 Attraction i: 
Condition —s peire tate 
Accuracy Snonona Accuracy a onal comment" beolute | Relative ier: 
No ratings 3 3 6 1 
No 5 1 1 1 
conpstent (A) 
407 | 2.85 4.03 2.50 | 2.00 2.72 2.39 4.44 $ 
z 1.41 ‘94 1.22 : : i : 20 
Ramee tant (B) 86 1.33 .89 1.34 1 
1.17 2.50 2.18 2.30 1.28 2.17 1.39 5.67 
ong 33 60 52 74 57 1.32 61 1.50 
copie (C) 
3.13 2.20 2.59 1.89 1.44 1.72 1.39 3.94 
c a a7 ‘4 7 ; í 51 
Incompetent ay 5 55 85 67 61 1 
2.98 3.85 4.12 4.06 2.44 3.44 2.11 4.44 
EA 34 ‘61 1.39 98 1.65 ‘95 1.67 1.20 
A versus B 8.53" | 1.33 5.89** 15 * we | 2.66** 
A versus C 2.52** | 2.27* | 4.68** | 252+ G aoe ase 1,08 
B versus D 6.83%" | 8.17** | 5.56** | 611 | 283* | 3 50e* | 412 | 2.67" 
C versus D SO | 7.11% | dga | gaze | 215+ | Gage | 412** | 1.08 
A-B versus C—D SAT | 5.43** | 7.19% | 7.70% | 316% | soom | 577 | 1.20 


Note.—Favorable ratings have low scores, 
* $5.05, one-tailed test. 
** D<,01, one-tailed test, 
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received the competence evaluation. For the ter- 
minal self-evaluations of PD subjects who were 
given the incompetence evaluation were reliably 
lower than the self-evaluations of those who were 
given the competence evaluation. And there was 
no reliable difference in the self-evaluations 
of the CD subjects who received high versus low 
assessments. 

Appraisal of the evaluator did depend upon 
the dissonance effect of the comment. Table 1 
shows that evaluation favorability and source of 
decision interact according to prediction for the 
accuracy and emotional support ratings of the co- 
worker’s comment and personality. The compe- 
tence evaluation led to higher ratings following 
the PD than following the CD. With three 
exceptions, all components in each interaction 
effect reveal significant differences in congru- 
ence with prediction. The PD subjects receiving 
the competence and incompetence evaluations do 
not differ reliably in their ratings of emotional 
support. However, as the CD subjects who re- 
ceived the incompetence evaluation considered 
the co-worker much less supporting than those 
who received the competence evaluation, dis- 
sonance did appear to influence the support 
ratings. The PD subjects may have considered 
the incompetence evaluation to offer emotional 
support in maintaining a conservative orientation 
when faced with their decisions. 

Dissonance also appeared to effect the desire 
to receive further comments from the co-worker. 
On the desire for comments rating, the inter- 
action effect and three of four component ef- 
fects are significant in keeping with prediction. 
The only unreliable rating difference is between 
the PD subjects and CD subjects who received 
the competence evaluation. 

On the critical ratings of attraction to the 
Partner, the predicted interaction and all expected 
component differences are confirmed, except one. 
Although the trend is in the predicted direction, 
the attraction ratings of PD subjects receiving 
„iħcompetence and competence evaluations do not 
differ reliably. But on the ratings of preference 
for the same co-worker, the interaction effect 
and all component effects are reliable in accord- 
ance with prediction. The results give strong 
Support to the hypothesis that appraisal of an 
ability evaluator and attraction to him depend 
on the consonance between his evaluation and the 
performer’s self-evaluation. 


Discussion 


The present findings are consistent with those 
of Aronson and Carlsmith (1962). These investi- 
gators found that when a person’s performance 
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is dissonant with his expectations, he will change 
his performance so as to restore consonance 
with his self-evaluation of ability. Thus, per- 
formers who had scored poorly in the past 
changed answers that gave them an isolated high 
performance. 

The results of the present study, together with 
Aronson and Carlsmith’s findings, cast serious 
doubt on Homans’ (1961) contention that favor- 
able evaluations of performance are generalized 
reinforcers. If Homans were right, a performer 
should increase his tendency to make responses 
leading to positive evaluations—under all condi- 
tions. Aronson and Carlsmith’s finding negate this 
expectation. 

Further, Homans’ assertion implies that a per- 
former should always be more attracted to an 
evaluator who gives him a favorable evaluation 
of his performance. In the present study, when a 
performer had decided to avoid a test, he pre- 
ferred an evaluator who gave him an unfayor- 
able assessment of his pretest performance. So, 
instead of supporting Homans’ hypothesis, the 
results support the general hypothesis of Secord 
and Backman (1961). A person’s attraction to 
another depends upon the consistency between 
his self-evaluation and the manner in which 
another responds to him. 
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FIELD DEPENDENCE AND EXTERNAL DIRECTEDNESS 


NORMA KONSTADT 
State University of New York, Downstate Medical Center, Brooklyn 
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This experiment compared the performance of a routine clerical task by field- 
dependent and field-independent children as a function of changes in the 
human environment. A total of 38 children (20 boys and 18 girls), 4 of whom 
were field dependent and 3 field independent, received letter-cancellation tests 
alternately administered by an “approving” and a “disapproving” examiner. It 
had been hypothesized that field-dependent children would demonstrate a 
greater attentiveness to the attitudes of those around them (particularly to an 
authority figure) and therefore their performance of any kind of task would 
reflect the interjection of the experimental conditions to a significantly greater 
degree. A 3-way analysis of variance indicated a significant interaction be- 
tween perceptual style and the experimental conditions. A significantly poorer 
performance by field-dependent children under the disapproval condition was 
accompanied by a greater observed frequency of reference to the human 


environment, 


Witkin, Dyk, Faterson, Goodenough, and Karp 
(1962) propose “psychological differentiation” as 
a dimension of human functioning which cuts 
across performance in areas traditionally classi- 
fied as perception, intellectual functioning, social 
relationships, and personality. These authors sug- 
gest that in the realm of personality the “sense of 
separate identity” parallels the construct of per- 
ceptual field dependence in that each represents a 
different facet of the differentiation dimension. 
According to these formulations, greater atten- 
tiveness to the human environment observed 
among field-dependent individuals is occasioned 
by their continuing need for definition, Specif- 
ically, such people are concerned primarily with 
the impression they make—particularly upon 
those in authority—and are therefore constantly 
attuned to signs of approval and disapproval on 
which to pattern their behavior. 

Evidence of a relationship between individual 
variation in attending to the environment and 
field dependence comes from Crutchfield, Wood- 
worth, and Albrecht (1958) and DeVaris (1955 1). 
In both studies, results can be intrepreted as 
reflecting a greater attentiveness to superficial 
identifying features which is implicit in the 
perceptual style of those with a global field 
approach, Furthermore, Beller’s (1958) study of 
nursery-school children suggests that the dy- 
namics of field dependence may be akin to the 
operation of dependency in the affective sense. 

While these experiments serve to suggest a 


1 Unpublished study entitled “Field Dependence 
and Self-Recognition.” 


structure of the sort proposed by Witken et al, 
they do not demonstrate a direct relationship be- 
tween the theorized process of articulation and 
the observed differences of attentiveness as a 
function of one’s perceptual style, While their 
results allow the inference of a greater reliance 
upon external cues by field-dependent individuals, 
they do not establish the actual motivation for 
such external directedness. The present experi- 
menters reasoned that if people with a global 
field approach seek, and indeed require, evidence 
of approval in order to function effectively, their 
performance of any task should reflect experi- 
mentally induced changes in the human environ- 
ment. Thus, we compared the effects of con: 
trolled, predetermined approval and disapproval 
factors upon the performance of a routine clerical 
task by field-dependent and field-independent 
children. k 


We hypothesized that performance of such a: | 


task by field-dependent children would reflect the 
experimental interjection of approval and dis- 


approval to a significantly greater degree than 


would the performance of the same tasks by 
field-independent children. It was also expected 
that field-dependent children would be more pro- 


nounced in their reference to the human environ- 


ment during the test as measured by the number 
of times that each child actually looked at the 
people around him. 


& METHOD 
Subjects 


Twenty boys and 18 girls were preselected from a 
group of 90 randomly chosen fourth graders who 
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had received the Children’s Embedded Figures Test 
(CEFT) ; they represented equally the extremes of 
performance on that test, 10 boys and 9 girls who 
were field dependent and a like number classified 
as field independent. The high-scoring group, with 
a mean of 17.86 defined our field-independent 
subjects, while the low-scorers comprised our 
field-dependent group with a mean of 6.52. 
This preselection of subjects afforded maximal 
discriminability between groups. 

The subjects were selected and assigned to testing 
groups by someone not involved in the conduct of 
the experiment, enabling the experimenters to remain 
naive to the initial designation of subjects as either 
field dependent or field independent. 


Materials 


Children’s Embedded Figures Test (CEFT) 2 The 
CEFT was used as the criterion for selection of 
field-dependent and field-independent subjects. It is 
a modification of the Children’s Embedded Figures 
Test (CHEF) developed by Goodenough and Eagle 
(1963). 

Letter Cancellation Test, Forms 1 and 2: The 
experimental measure. This task involves the location 
and cancellation of the letters a, t, and c each time 
they appear in a matrix of randomly ordered letters. 
Subjects received Forms 1 and 2 in varying order 
under each experimental condition. Both forms con- 
sisted of identical sets of letters; Form 2 was the 
reverse order of Form 1. 


Experimental Conditions: Disapproval and 
Approval 


The condition was defined by comments made 

by the experimenter at specified, frequent intervals 
during the test session. These comments represented 
positive and negative -instances of the following five 
general statements which purportedly concerned the 
gtoup of subjects, their general ability and intelli- 
gerice, their understanding of the task, their neatness, 
speed, and cooperativeness. Thus, to convey approval, 
tke examiner might state: “This is a very bright 
group, you are going to do well on this test,” or 
You certainly have caught on faster than most 
children, I can see you will do well.” To convey dis- 
„approval, statements would be made to the effect 
that, “This doesn’t seem like the kind of group that 
can do well, but you might as well try as long as 
you're here,” or “You're not as fast as our other 
groups, don’t you want to cooperate?” 

An average of 15 such statements were made 
during each test session. 


* The CEFT is available (free) from the publisher: 
Cognitive Tests 
P.O. Box 4 


|. > Vanderveer Station 


a Brooklyn 10, New York 
r is available gratis from the author. 
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Procedure 


Subjects were tested in groups of five, made up 
of varying proportions of high- and low-CEFT 
scorers. Zach group was tested once during the 
morning and once during the afternoon, receiving 
one condition per session. One experimenter assumed 
the role of an approving examiner for one test, and 
the other a disapproving examiner for the second 
test session. The experimenters alternated in the roles 
of approving and disapproving examiner. One half 
the subjects received the disapproval condition first; 
half received the approval condition first. 

A 10-minute time limit was set for each test 
session. Completion time was recorded for those 
subjects who finished in less than the allotted time. 

During the conduct of the experiment, the ex- 
aminer who was not immediately involved in the 
testing served as an. observer and recorded the 
following information on a tally sheet: 


1. The performance of the current examiner: the 
number of approving (or disapproving) statements 
made by the examiner. (This allowed for subsequent 
determination of the degree to which the desired 
stimulus situation was interjected into the testing 
situation.) 

2. The subject’s behavior during testing: the num- 
ber of times each subject looked away from his 
paper and at another person in the room. 


At the end of the experiment, the performance of 
the examiners was compared and found to be ap- 
proximately equal in terms of frequency of making 
approving and disapproving comments. 

Test scores were derived on the basis of the num- 
ber of correct cancellations and the completion time; 
the score is the correct number of cancellations per 
minute. The number of incorrect cancellations was 
subtracted from the total of correct cancellations 
and then divided by the time. This deduction had a 
negligible effect on a subject’s score, except in the 
few instance where a subject was very careless and 
made numerous incorrect cancellations. 

For purposes of statistical analysis, subjects were 
organized into four groups according to whether 
they were field dependent or field independent, and 
whether they had received the disapproval or the 
approval condition first. 


RESULTS AND DISCUSSION 


The differential responsiveness of field-depend- 
ent and field-independent subjects to the experi- 
mental conditions of approval and disapproval, 
as reflected in their performance of a routine 
clerical task, was the primary question in the 

resent study. The mean number of correct 
cancellations per minute, for each group and 
condition are reported in Table 1. 

A generally poorer performance by field- 
dependent subjects under the disapproval condi- 
tion is immediately apparent from the table. 
The distinction becomes even more acute when 
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TABLE 1 


MEAN NuMBER OF CORRECT CANCELLATIONS PER 
Minute ror Eac Group AND CONDITION 


Disapproval Approval 
Disapproval first 
High CEFT 10.39 11.83 
Low CEFT 8.74 11.55 
Approval first 
High CEFT 11.29 8.71 
Low CEFT 8.18 11.31 


it is observed that the general tendency to im- 
prove (with practice) on the second test session 
is actually reversed in the instances where field- 
dependent children received disapproval after 
approval. A two-way analysis of variance was 
performed to evaluate the significance of these 
effects. The analysis indicated that the following 
effects contributed significantly to the variation 
among the groups: Perceptual Group (field 
dependent-independent) Condition interaction, 
F (df=1/34) = 41.342, indicating that the 
greater disparity in performance observed among 
field-dependent subjects between the experimental 
conditions is statistically significant, that is, they 
were more seriously hindered in their perform- 
ance when the examiner expressed disapproval; 
the experimental conditions of approval and dis- 
approval, F (df = 1/34) = 16.375, reflecting the 
tendency of all subjects to be hindered by ex- 
aminer disapproval; Conditions X Order inter- 
action, F (df =1/34)= 12.967, indicating a 
specific interaction between these factor$; and 
the three-way interaction between field-depend- 
ence/independence, the order of presentation of 
the conditions, and the conditions themselves, 
F (df 1/34) = 12.428. The F ratios of these 
effects are all significant beyond the .01 level. 
In contrast, neither the perceptual group (field 
dependent-independent) nor the order in which 
subjects received the conditions, per se, proved 
to be a significant factor in performance, and 
their F ratios ranged about the magnitude of 1. 
It can thus be assumed that there was nothing 
inherent in the cancellation test which, in the 
absence of the experimental condition, would 
cause field-dependent subjects to perform differ- 
ently from field-independent subjects, 
5 From these results, it is reasoned that sub- 
Jects with a global field approach exhibited a 
greater sensitivity to the human environment as 
reflected by their pronounced reaction to experi- 
mentally introduced changes therein. Coupled 
with the analysis of variance findings, the ob- 
served mean differences (Table 1) take on a 
statistical significance. The initial hypothesis that 
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field-dependent children pay more attention to 
the human environment is supported by the data 
of the present experiment.® 

In line with the second hypothesis concerning 
the parallel between the formulations of Witkin 
et al. (1962) on perceptual field dependence and 
Beller’s (1958) observations on affective de- 
pendency, comparisons were made of the number 
of times members of each group gazed at some 
other person in the room. An analysis of variance 
was performed upon the recorded tallies of the 
number of “looks” per subject. Here, the analysis 
revealed a single significant effect. The order in 
which the conditions (disapproval and approval) 
were presented, differentially affected field- 
dependent and field-independent subjects; F (df 
= 1/34; CEFT Score X Order)= 4.58, p<.05. 
Field-dependent subjects tended most to gaze 
at the people about them under the disapproval 
condition when it was given after approval. 

With regard to the general impact of disap- 
proval and approval upon each group, the com- 
bined tallies (Perceptual Group X Condition) 
failed to show a significant effect in the analysis. 
However, examination of the separate (condition) 
means for each perceptual group suggests that 
field-dependent subjects gazed at people more 
frequently than field-independent subjects under 
the disapproval condition. The mean number of 
“looks” for the former group was 4.42 as com- 
pared with a mean of 2.78 for the latter; t= 
2.05, <.05. A similar tabulation for the ap- 
proval condition yielded no significant differences; 
the mean for the former group was 3.42, while 
that for the latter group was 3.47. 

These data on the visible signs of dependency 
showed a significant difference in the frequency 
with which field-dependent children gazed at the 
human environment under the disapproval con- 
dition. We interpret this as an attempt by them 
to monitor their behavior in terms of external 
cues when the emotional climate became un- 
pleasant. 

The parallel that can be drawn between the 


cancellation test data and the recorded instances’ 


of observable dependent behavior is also of in- 
terest. It was under disapproval condition that 


3 Further support for this hypothesis comes from 
an unpublished study by Stark, Parker, and Iverson 
cited by Witkin, Dyk, Faterson, Goodenough, and 
Karp (1962). Using “ego-enhancing” and “ego- 
threatening” instructions to introduce a relatively 
simple task, these authors found that subjects with 
a global field approach tended to be influenced by 


the examiner’s attitude toward them, while subjects 


with an analytical approach were unaffected by the 
instructions, 


i 
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both measures reflected the differential respon- 
siveness of the field-dependent group. It appears 
that disapproval hindered these children in their 
performance and suggests that they required a 
favorable emotional climate to function effec- 
tively. In contrast, the interjection of disapproval 
had no such effect on the behavior of field- 
independent children who were hypothesized to 
require less external definition. 

The major finding of this study, namely, that 
disapproval had a differential effect upon field- 
dependent and field-independent subjects, sup- 
ports the formulations made by Witkin et al. 
distinguishing between individuals who rely upon 
an internal frame of reference and those who 
seck external definition of a situation. In ad- 
dition, it provides more specific information on 
the kind of situations which are most likely to 
foster the need for external direction. 

By demonstrating that subjects who were 
classified on the basis of a perceptual task 
(CEFT) will differ in their performance of a 
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routine clerical task as a function of the emo- 
tional environment, we have provided some evi- 
dence for the hypothesized operation of a 
“differentiation” dimension across behavioral 
modalities. 
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TEST ANXIETY, SEX, AND EXPERIMENTAL CONDITIONS IN 
RELATION TO ANAGRAM SOLUTION * 


DOUGLAS G. RUSSELL anp» IRWIN G. SARASON 
University of Washington 


The performance of 48 males and 48 e 
scores on a test-anxiety scale, was studied u: 
variables: 


structions, (b) verbalization vs. nonverbalization $ 
tion of anagrams. Low test-anxious Ss performed at 


task. There were 3 experimental 


and (c) order of presenta’ 


females, 4 with high and 4 with low 


ging a difficult anagram solution 
(a) differential motivating in- 
by S during performance, 


a higher level than did high test-anxious Ss. This difference was particularly 


« strong in the case of females. The variable o. 


tion interacted significantly with sex and test anxii i 
ed as a class of interfering responses and the 


cussed in terms of anxiety conceiv 


desirability of incorporating organismic variables such 


in research on problem solving. 


The present experiment was concerned with 
the effects of four variables on the speed of solu- 
tion of anagrams. These were subjects’ test- 
anxiety scores, sex, and two experimental con- 
ditions. One of these conditions was differential 
Motivating instructions and the other was whether 


1This research was supported in part by grants 
(M3889) from the National Institute of Mental 
Health, United States Public Health Service, and 
the University of Washington Graduate School Re- 
Search Fund. 


f verbalization vs. nonyerbaliza- 
ety. The results were dis- 


as test anxiety and sex 


or not subjects verbalized during performance. 
Half of the subjects were encouraged to verbalize 
while working on anagrams. The remaining sub- 
jects were not encouraged to do this. 

This experiment grew out of a previous one by 
Crager (1959). Crager, using a procedure com- 
parable to ours, found that under a verbalizing 
condition, high and low test-anxious males did 
not differ significantly in performance, How- 
ever, he found that for females, high test-anxious 
subjects were markedly inferior in performance 
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to low test-anxious subjects. Since this Test Anx- 
ietyXSex interaction might be due to some aspect 
of the verbalizing condition, it was felt necessary 
to determine whether or not this interaction 
would also be obtained for a situation ii which 
subjects are not asked to verbalize. 

Of relevance, also, to the present study, was 
an anagram experiment by Sarason (1961), who 
found that subjects differing in test anxiety re- 
sponded differently to preliminary motivating in- 
structions, Since this result was obtained using 
group administration, it seemed of interest to see 
if the finding could be replicated in the case of 
individual administrations of the anagram task. 


METHOD 
Experimental Design 


A five-factor analysis of variance design was em- 
ployed, The variables were: high and low scores on 
the Test Anxiety Scale (TAS; Sarason, 1958; 
Sarason & Ganzer, 1962), male and female subjects, 
high motivational and neutral instructions, verbaliz- 
ing and nonverbalizing conditions, three orders of 
Presentation of the anagrams, 


Subjects 


The subjects were drawn from a pool of 239 
students in introductory psychology courses at the 
University of Washington to whom the TAS had 
been administered. On the basis of the TAS, four 
groups were selected: 24 high TAS males, 24 high 
TAS females, 24 low TAS males, and 24 low TAS 
females, The subjects were then asked to Participate 
in the experiment; all of them volunteered. The sub- 
jects were randomly assigned to the experimental’ 
conditions by someone other than the experimenter, 
so that the experimenter had no knowledge of the 
TAS scores of any of the subjects. The 48 high 
TAS subjects had a mean score of 10.23, with an 
SD of 2.39, The 48 low TAS subjects had a mean 
score of 84, with an SD of .65, 


Apparatus and Task 


The 10 anagrams used were the same as those 
used by Sargent (1940) and Crager (1959). The 
order of presentation of the anagrams employed by 
Crager was one of the three orders used in the 
Present study. Two others were drawn by chance. 

Each of the subjects sat at a table and was 
separated from the experimenter by a screen, The 
screen had a hole in it, but was so constructed that 
the experimenter could slide out each anagram and 
recover it after solution, and the subject saw only 
the experimenter’s hand. During the solution process 
the subject could not see the experimenter, but the 
experimenter could watch the subject’s hands, Each 
anagram consisted of six or seven green plastic, 
$-inch squares, with black letters on them, from a 
regular anagram game set. The subject could thus 
move the letters about at will. 
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Procedure 
The following preliminary instructions were used: 


This research is concerned with problem solving, 
The problems I will give you to solve are called 
anagrams. An anagram is a word with the letters 
all mixed up, and your task is to make a meaning- 
ful word. Only one word can be made from each 
anagram and they are all common words, Here is 
an example [rTSEO is placed before the subject], 
If these are rearranged [the experimenter re- 
arranges them], they spell the word srore [the 
experimenter withdraws the letters]. 


The nonverbalizing subjects were then told: 


What I am trying to find out is how you solve 
anagrams. I am trying to find out the mental 
process you are using all the time you are work- 
ing on the anagrams. To do this I will be able to 
watch your hands as you move the letters about 
as you solve the anagrams. 


The verbalizing subjects were told: 


What I am trying to find out is how you solve 
anagrams. I am trying to find out the mental 
process you are using all the time you are working 
on the anagrams. The only way I can find this out 
is for you to tell me, so I want you to do some- 
thing a little different from what you are used to. 
I want you to talk out loud ALL THE TIME you 
are working on each anagram. Do not be con- 
cerned about what you say. The important thing 
to do is to keep talking out loud. Tell me what 
you are doing, how you are trying to solve the 
anagram, and anything else that comes to mind 
all the time you are working. REMEMBER TO 
KEEP TALKING. This is the only way I can 
get this information. 


After the preliminary instructions and the proper 
verbalization instructions, the subjects were given 
the motivational instructions. This applied to half of, 
the subjects only, since the other half received no 
motivating instructions at all. High motivating 
instructions were: 


The anagrams I am going to give you are ® 
measure of general intelligence and the ability to 
think in abstract terms. College freshmen have 
been found to solve these particular anagrams 


quickly and with little difficulty, so you should , 


have little trouble. I purposely selected easy ones 
for college students so I could get at the mental 
process you use. 


At this point all subjects were told: 


When you know the solution and have spelled it 
out with the letters, pronounce it for me. 


The nonverbalizing subjects were then told: 
Here is the first anagram. Start working. 
The verbalizing subjects were told: 


Here is the first anagram. Start working and 
REMEMBER TO KEEP TALKING ALL THE 


| 
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ARE WORKING. Tell me what you 

are doing, how you are trying to solve the 
anagram, and anything else that comes to mind 
“all the time you are working. Start talking. 


TIME YOU 


After each anagram was solved the verbalizers 
were asked: 

Would you now summarize how you solved this 

anagram beginning with when you started working 

on it and also tell me any reactions you had to it? 

When the summary was finished, the experimenter 
removed the word and pushed the next anagram 
out saying: 

Here is the next one, start talking. 

When the nonverbalizer finished each anagram, it 
was removed, and the experimenter pushed the next 
one in front of him saying: 

Here is the next one, start working. 


If the verbalizing subject did not talk for 1 
minute of the first anagram, he was told: 


Please keep talking during your solution. [No 
further reminders were given.] 


If, after 4 minutes, the subject had not solved 
the anagram, the experimenter told him the first 
letter of the word; at 5 minutes, the second letter; 
at 6 minutes, the third, etc., until the solution was 
reached. The criterion for solution was correct 
recognition of the word as shown by spelling and 
pronouncing it correctly, Errors of spelling were 
ignored when it was apparent that the subject knew 
what the word was. 


Scoring 


The subjects’ protocols were scored for both the 
total number of seconds needed to solve the 10 
anagrams, and for the number of cues given. The 
time was measured for the presentation of the ana- 
gram until the subjects recognized the word and 
spelled it out. The time for the summaries, or for 
changing anagrams, was not counted. 


bi RESULTS 


Bartlett’s test of homogeneity of variance was 
computed for the time of solution scores in 
seconds and the chi-square was not significant 
(p=.609); therefore, homogeneity of variance 
was assumed to be present. The product-moment 
Correlation coefficient between the time of solu- 
tion scores and the number of cues scores was 
93, and so it was felt necessary to make only 
the analysis for the time of solution scores. 

An analysis of variance was performed on the 
subjects’ time of solution scores. The mean time 
of solution for the 96 subjects was 1873 seconds. 
The variable of TAS scores was significant at 


-the .056 level. High TAS subjects had a mean 


nuke time of 1988 seconds; low TAS subjects 
ad a mean solution of 1758 seconds. The TAS 
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TABLE 1 


Mean Souution Tomes (IN SECONDS) AS A FUNCTION OF 
TAS, SEX, AND THE VERBALIZATION CONDITION 


Verbalizers Nonverbalizers 
Group 
Males Females Males Females 
High TAS 1760 2273 1989 1932 
Low TAS 2115 1538 1672 1710 


X Sex interaction was significant at the .025 
level. High and low TAS males had mean solu- 
tion times of 1874 seconds and 1893 seconds, 
respectively; high and low TAS females had 
means of 2102 seconds and 1624 seconds, respec- 
tively. The verbalization variable per se did not 
affect performance. However, the TAS X Sex 
X Verbalization interaction was significant at the 
025 level. Table 1 presents the means and stand- 
ard deviations of the groups involved in this 
interaction. This interaction was due, to a con- 
siderable degree, to an interaction between 
TAS and Sex under the verbalizing condition 
(p<.01), but not under the nonverbalizing 
condition. This interaction stemmed from the 
superiority at the 05 level of high TAS male 
verbalizers (M = 1760) to high TAS female 
verbalizers (M = 2273), the superiority at the 
(05 level of low TAS female verbalizers 
(M = 1538) to low TAS male verbalizers 
(M=2115), and the superiority at the .01 level 
of low TAS female verbalizers to high TAS 
female verbalizers. 

Looking at this interaction by levels of TAS, 
it istseen that for low TAS subjects, male 
verbalizers seemed to be the slowest solvers of 
anagrams. For high TAS subjects, female ver- 
palizers turned in the poorest performance. The 
Sex X Verbalization X Order interaction, which 
was significant at the 015 level, was due to the 
superiority at the .03 level of male nonverbalizers 
for Order 1 (M = 1460, SD = 520) to male non- 
verbalizers for Order 2 (M = 2126, SD = 669). 


Discussion 


Our results, and Crager’s (1959) both show 
that high TAS females perform more poorly than 
low TAS women or men whether high or low 
in TAS. Comparing the results for our. verbalizers 
with Crager’s verbalizers, there is agreement that 
high TAS females perform at a lower level than 
do low TAS females, This suggests that the 
verbalizing condition may have exacerbated the 
tendency of high TAS females to emit inter- 
fering responses on @ difficult task (Sarason, 
1960; Sarason & Palola, 1960). We found, but 


496 


Crager did not, that low TAS males performed 
at a lower level than high TAS males. This 
result does not appear to be compatible with a 
response interference of anxiety (Sarason, 1960; 
Sarason & Palola, 1960). 

On the basis of Sarason’s (1961) study, one 
might have anticipated an interaction between 
the TAS and instruction variables. While there 
are many possible explanations for the lack of 
significance of this interaction in the present 
study, perhaps the most prudent conclusion is 
that investigators should exert considerable 
caution in generalizing over tasks and situations 
about the interactive effects of anxiety and 
experimental conditions. On the other hand, the 
present significant findings involving TAS and 
sex would suggest the desirability of attending 
more in future research to the roles of these 
organismic characteristics in the study of 
problem solving. 
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3 unsuccessful attempts were made to replicate the Eriksen-Kuethe “paradigm 
of repression.” This paradigm consisted of repetitive administrations of a 
15-item word-association list with Ss being shocked for associations to 
5 critical stimuli which were determined randomly for each S, The contingency 
for shock avoidance was changing initial associations to critical stimuli on 
subsequent trials. An additional experiment was undertaken in which familiarity 
of the critical stimuli was systematically varied to determine the effects on 
response-time functions and postexperimental verbalizations. Both response- 
time and postexperimental verbalizations were affected by familiarity in a 
direction consistent with the interpretation that familiarity had been con- 
founded with verbalization and response time in the original study. 


Several years ago, Eriksen and Kuethe (1956) 
reported the successful avoidance conditioning of 
verbal responses without awareness. The experi- 
mental design consisted of repeated administra- 
tions of a 15-item word-association list. Associa- 
tions to five of the words (critical stimuli) were 
followed by shock on the first trial and were 
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Mental Health, United States Public Health Service. 
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shocked each time they appeared as association 
to these stimuli on subsequent trials. Critic oe 
stimuli were selected randomly for each subject, 
The subjects were instructed that they woui 
be shocked for slow responses and for anot 
reason which they could discover on their Oy 
The only contingency actually leading to sh 
was repetition of first trial associations. After 
conditioning trial, subjects were interviewe 
determine their level of awareness. 
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On the basis of the interviews, approximately 
one-third of the subjects were classified as having 
a high level of awareness and approximately one- 
third were classified as having a low level of 
awareness. High-awareness subjects were able to 
state the “real” basis of the shock, reported 
deliberately withholding punished associations 
and replacing them with other responses, and 
identified at least three of the critical stimuli. 
Low-awareness subjects were unable to describe 
any basis for the shock other than the mis- 
information contained in the instructions, re- 
ported not deliberately withholding associations, 
and were unable to identify more than one of the 
critical stimuli. 

High- and low-awareness subjects learned to 
avoid the shock equally well. However, response 
times of high-awareness subjects increased on 
early trials, while those of low-awareness subjects 
decreased as a function of trials. The interview 
and response-time data were interpreted as in- 
dicating conscious suppression of punished re- 
sponses in high-awareness subjects and repres- 
sion of punished responses in low-awareness 
subjects. 

A series of replication studies were undertaken 
by the present investigators. All subjects in these 
experiments were males drawn from introductory 
psychology classes at Syracuse University. In the 
first experiment (N = 28), the Eriksen-Kuethe 
procedures were duplicated as closely as pos- 
sible. In the second experiment (N = 43) the 
postexperimental interview was elaborated. In the 
third (V = 10) instructions referring to 4 second 
basis for shock were omitted. 

_ From the total of 81 subjects in the three 
experiments, 4 in the second, and 4 in the third 
could be considered as low awareness by the 
Eriksen-Kuethe criteria. None of these subjects 
reached the learning criterion in 10 trials, how- 
ever. While many of the subjects who learned 
did not satisfy the high-awareness criteria they 
were all able to describe strategies which were 

. fective for shock avoidance. It seemed reason- 
able to consider these as “correlated” hypotheses 
(Adams, 1957). Three experiments were subse- 
quently reported which also failed to obtain a 
Sufficient number of low-awareness subjects for 
Meaningful comparative purposes (Jantz, 1959; 
Kuethe & Wunderlich, 1959; Teece, Commins, & 
Smith, 1960). 

Eriksen and Kuethe’s interpretation of the 
relationship between awareness and response time 
is open to question because the particular words 


t * of the list which were critical were not the same 


a all subjects, and because the stimulus words 
ffered with respect to at least one dimension— 
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TABLE 1 
Srmutus Worps AND FAMILIARITY RATINGS 


High Medium Low 
GARMENT (40) TYPHOON (1) PERCEPT (1) 
wacon (50) RURAL (20) SPOTTY 
HOT toot UNIT (29) NAPHTHA W 
Krronen (100+) |HuNcER (37) RAMPART (4) 
TALL (100+) | STRAIGHT (100+-)| DISCREET (5) 


Note.—The familiarity values refer to the number of oc- 
currences per million words of printed text (Thorndike & 
Lorge, 1944). 


familiarity—known to be related to response time 
(Hall & Ugelow, 1957). The random. selection 
of critical stimuli for each subject would very 
likely produce different. combinations of critical 
words for different subjects, thus providing the 
opportunity for varying reinforcement of the 
speed set and of the tendency to report the speed 
contingency in the postexperimental interview. 
Also, subjects who receive positive reinforcement 
for the speed set should tend to increase their 
speed of response more than subjects who receive 
negative reinforcement for speed, Since verbaliza- 
tion of speed as a basis for shock apparently 
differed between Eriksen and Kuethe’s high- and 
low-awareness groups, it might be inferred that 
the groups differed in the extent to which they 
were actually shocked for fast and slow re- 
sponses. On the first trial, low-awareness sub- 
jects responded relatively slower while high- 
awareness subjects responded relatively faster to 
‘critical stimuli. 

In the following experiment, the attempt was 
madd to manipulate shock for fast and slow 
responses by controlling for the familiarity of 
the critical stimuli. 


METHOD 

Stimulus Words 

The 15 stimulus words were selected from the 
(1944) list, with 5 high-, 5 
medium-, and 5 low-familiar words chosen from 
preliminary elicit consistently short, 
and long response times, respectively, in a 
standard word-association situation, Nine random 
orders of the list were constructed. The stimulus 
words and familiarity are presented in Table 1. 


Procedure 
The subjects were assigned randomly to two 
groups. The critical stimuli were the five high. 


familiar words for one group, the five low-familia 
were told tha 


they would be shocked if an association to a stimulu 


word were too slow for t 
practice and for another reason that the experimente 


498 


could not divulge in advance but which might be 
discovered during the session, All subjects were run 
to a criterion of two consecutive shockless trials 
or to a total of nine trials; all were run a minimum 
of four trials, 

Immediately following the conditioning trials 
subjects were asked the following questions: What 
seemed to be the reason for the shock? What effect 
did your speed have on whether or not you’d be 
shocked? If you were going to do this experiment 
all over again and knowing what you know about 
it now, how would you go about avoiding the 
shock? There were 22 male subjects in each group, 
drawn from the introductory psychology course at 
Syracuse University. There were two experimenters, 
one of whom read the stimulus words, manually 
operated a Standard Electric timer and recorded 
response times, while the other administered shock 
and recorded responses. The shock was administered 
to the right ankle through an inductorium activated 
by a 3-volt de source. The intensity of the shock 
was set just below the level described by the subject 
as intolerable. 


RESULTS AND DISCUSSION 


Of the 22 subjects with low-familiar critical 
stimuli, 15 reported at least some belief in the 
speed contingency in the postexperimental inter- 
view compared to 6 of the 22 subjects with high- 
familiar critical stimuli (x? = 5.83, p < .02). All 
subjects reported “correct” or correlated hy- 
potheses and none was judged to have a low 
level of awareness. 

The median response time to the critical 
stimuli was calculated for each subject for, each 
trial. The means of these medians, together with 
the comparable Eriksen-Kuethe data are pre- 
sented in Figure 1. The decreasing response times 
to the low-familiar stimuli and the increasing 
times to the high-familiar stimuli correspond to 
the response time functions of the Eriksen- 
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Kuethe low and high awareness, respecti 
However, the position of the respon 
curves based on familiarity are displaced to 
the extremes relative to the Eriksen- 
awareness curves. This difference in position 
the curves appears to be due to the rela 
longer and shorter response times asso 
with the extremes of stimulus familiarity 
in the present experiment, as well as the prol 
greater homogeneity of the groups with 
to stimulus familiarity. 

A mixed analysis of variance design perfo 
on the response-time data yielded signi 
effects due to familiarity (F = 4.567, df=1/ 
p<.05), and to the interaction Famili 
X Trials (F = 6.150, df = 3/126, p < .001). 
analysis indicates that the differences bel 
the curves of the high- and low-familiar crit 
stimuli are reliable. if 

First trial associations to critical stimuli we 
repeated significantly less on succeeding 
than associations to noncritical stimuli and 
was significantly greater variability on succe 
trials to critical than to noncritical stimuli. 
were no significant relationships between fa 
arity of the critical stimuli and the fol 
measures: the number of first trial associati 
repeated on succeeding trials, number of d nt 
associations given on subsequent trials, or 
number of trials to criterion. 

Eriksen and Kuethe found that subjects 
sified as high awareness differed from sub, 
classified as low awareness in their respol 
latencies to critical stimuli. One of the i 
view variables differentiating low- from 
awareness subjects was retention of the in 
tional speed set. The present results indicate t 
retention of the speed set and respons 
are both related to properties of the 
stimuli. Since the critical stimuli were not 
trolled in the Eriksen and Kuethe study, 
possible that the relationship between these 
dependent variables reflected the relations 
each to the uncontrolled stimulus variable. 
course, familiarity of critical stimuli is only 4 
of several possible uncontrolled variables 
could have been associated with the aw 
groups. 


ness have been compared on many other } 
ables and in several instances given the 
of experimental groups with awareness 
treated as an independent and antecedeni 
able. When awareness is inferred from 
variables, it is important to insure that 


—— 


Brier ARTICLES 


in the response are not a function of stimulus 
variables, Unfortunately such variations are 
not usually recoverable since they are seldom 


reported. 
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TEST ANXIETY AND EXPERIMENTAL CONDITIONS + 
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during learning. 


_ 


+ 


and error scores, showed 


variables manipulated. It was concluded that more 


to conceptualizing the psychological experiment as a social situation. 
. 


* Previous research in the area of anxiety and 
serial learning suggests that subjects differing in 
test anxiety respond differentially to experimental 
instructions given to them prior to performance 
Garason, 1960). The performance of high test- 
anxious individuals seems to be more disrupted 
by high motivational or personally threatening 
„conditions than is the performance of other indi- 
viduals in the test-anxiety score distribution. Un- 
der neutral or reassuring conditions, differences in 
performance between subjects differing in test 
anxiety have been found in several instances to 
be negligible or reversed, in the sense that, in 
one study, reassurance had a greater facilitative 
effect on high than low test-anxious subjects. 

_ In the present experiment, in addition to study- 
ing performance as a function of instructions 
Sen prior to performance, a second experimental 


’ 1This research was supported by Grant AS) 
ae the National Institute of Mental Health, Ui 
tates Public Health Service. 


variable was, also, introduced during perform- 
ance. This variable was defined by brief com- 
ments made by the experimenter, at certain points 
in learning. Half of the subjects were told on four 
occasions that they were doing a good job in 
learning the material presented to them; the re- 
maining subjects were urged to improve their 
performance and were given the impression that 
their performance was not adequate. On the basis 
of theoretical considerations and previous re- 
search, we hypothesized that the learning of high 


first of these two conditions and hindered by the 
second. The reverse of 
pated for low test-anxiety scorers. 

‘The experimental design of this study involved 
five variables. Two of these, test anxiety and sex, 
were organismic factors associated with the sub- 
jects. Another variable was that of the experi- 
menter. There were two manipulated experi- 
mental variables, preliminary instructions, and 
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the experimenter’s comments to the subject dur- 
ing performance. One feature of the treatment 
of the data was that in its analysis attention was 
paid not only to the number of correct responses, 
but also to other categories of subjects’ perform- 
ance. Specifically, we were interested in compar- 
ing the various groups in the study with respect 
to anticipatory errors, i.e., errors made by the 
subject in which items in the serial learning list 
were misplaced by him; intrusive errors, i.e., 
errors in which the subject responded with items 
not on the list; and nonresponses. With respect 
to the last named variable it seemed possible that 
the inhibiting effects of anxiety might keep high 
test-anxious subjects from making guesses early 
in performance, and, thus, retard their perform- 
ance. 


METHOD 
Subjects 


The subjects were 144 high-school sophomores se- 
lected from a sample of 379 students. There were 72 
males and 72 females. They were divided into high 
(upper 25%), middle (middle 30%), and low (lower 
31%) test-anxiety groups on the basis of their scores 
on the Test Anxiety Scale (TAS; Sarason, 1958b; 
Sarason & Ganzer, 1962). The TAS had been ad- 
ministered prior to, and independent of, the subject’s 
participation in the serial learning experiment. 

High TAS subjects had scores ranging from 10 to 
14; middle TAS scorers were either 6 or 7; and low 
scorers ranged from O to 4. The mean scores were 
11.71, 6.63, and 1.92 for the high, middle, and low 
TAS groups, respectively. r- 


Experimenters 


The experimenters were second-year psychology 
graduate students, One experimenter was male and 
the other was female. 


Materials 


The subjects learned the verbal material presented 
to them by means of the serial anticipation method. 
The two lists administered to each subject consisted 
of 17 high difficulty level dissyllable words as defined 
by Noble (1952) m values. Each subject received 16 
trials on each of the lists. The first of the two lists 
had a mean m value of 1.55 with a range of .99 to 
2.57. The second list had a mean.m value of 1.61 
with a range of 1.04 to 2.28. 


Procedure 


After receiving the necessary preliminary instruc- 
tions for performance on the serial Jearning task, the 
subjects were divided into three instructional groups. 
A control or neutral group received no further in- 
structions prior to performance. One experimental 
group was administered high achievement-orienting 
instructions which were essentially the same as those 
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used by Sarason (1956) in a previous study. 'These 
instructions stated that achievement on the learning 
task was a measure of intelligence. The third instruc. 
tional condition, which we shall refer to as a low 
achievement-orienting one, was similar to instruc- 
tions used in a previous study (Sarason, 1958a), À 
These instructions seemed to be reassuring in na- 
ture, and informed the subject that the task on which 
he was performing was a difficult one and that, theres 1 
fore, progress on it would be slow. Consequently, the 
subject was told, there should be no concern about 
performance level in working on the list. 

The other experimental variable, that of the ex- 
perimenter’s comments during performance, was de- 
fined by two conditions which we have labeled; 
“good” and “try harder.” The events which occurred: 
for subjects run under the good condition were as 
follows: y 

1. In the intertrial interval following the first cor- 
rect response, the experimenter commented to the 
subject: “You got that time. That’s a good 
start.” È 

2. Following Trial 1 on which the subject im- 
proved his performance by at least one correct re- 
sponse, the experimenter commented: “You're doing 
fine.” 

3. The next increase in the subject’s score was fol- 
lowed by: “Very, very good.” 

4. The next increase was followed by the experi- 
menter’s comment: “You are doing a very excellent 
job.” 

5. At the conclusion of the last trial, Trial 16, the 
experimenter commented: “You performed at a very 
high level.” 

The procedure concerning the try-harder condition 
was as follows: í 
1. In the intertrial interval following Trial 2, the 
experimenter commented: “Do you think you could 

try to do a little better?” r 

2. At the end of Trial 4, the experimenter com- 
mented: “Try to show more improvement.” 4 

3. The'comment at the end of Trial 7 was: “Diy 
a little harder.” a 

4. At the end of Trial 10, the experimenter comaa 
mented: “Make more of an effort to get them right 
if you can.” k7 

After performing for 16 trials on the first list, the 
subjects were, then, administered 16 trials on the 
second list. For each subject, administration of thè, 
second list was preceded by a paraphrasing of the 
same instructions which had preceded the first list. 
The experimenter, then, stated: “Now let’s go on to 
the next part. This will work in the same way as the 
first part.” During the subject’s performance on the 
second list the experimenter made neither good not 
try-harder comments. H 


RESULTS 


The results were analyzed in terms of 3 X 3? 
2X 2X2 16 analyses of variance with the 10 
trials as a repeated measurements variable. The 
data for the two lists were analyzed separately. 
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TABLE 1 
ANALYSIS OF VARIANCE OF NUMBER oF Correct RESPONSES ON List 1 
Source af MS F Source af MS F 
Anxiety (A) 2 | 492.79 | 11.01%* || Trials (T) 15 | 745.67 | 278.34*** 
Motivating instructions AXT 30 | 10.13 INA: baie 
Ore. 2 | 11.03 TXT 30 2.04 
Sex of subject (S) 1 | 113.78 SXT 15 4.01 
Experimenter (1 1 | 309.17 6.91* EXT 15 25.40 9.48*** 
Conditions (good-try COL 15 13.02 4.36*** 
harder) (C) 1 | 432.29| 9.66* || AXIXT 60 | 2.26 
EXI 4 | 108.53 AXSXT 30 2.10 
AXS 2 25.75 AXEXT 30 2.24 
AXE 2 18.37 AXCXT 30 2.91 
AXC 2 56.17 IXSXT 30 2.75 
IXS 2 67.40 IXEXT 30 347 
IXE 2 23.28 IXCXT 30 3.16 
IXC 2 53.70 SXEXT 15 1.81 
SXE 1 2.01 sxcxT 15 1.74 
SXC 1 0.02 EXCXT 15 6.39 2.39%" 
EXC 1 | 147.02 AXIXSXT. 60 3.39 
AXIXS 4 | 77.21 AXIXEXT 60 2.43 
AXIXF 4 69.25 AXIXCXT 60 2.36 
AXIXC 4 57.20 AXSXEXT 30 1,29 
AXSXE 2 12.00 AXSXCXT 30 4.09 15537 
AXSXC 2 | 118.60 AXEXCXT 30 1.80 
AXEXC 2 70.24 IXSXEXT 30 2.00 
IXSXE 2 59.29 IXSXCXT. 30 3.80 
IXSXC. 2 | 90.75 IXEXCXT 30 | 0.95 
IXEXC 2 21.88 SXEXCXT 15 1.74 
SXEXC 1 4.52 AXIXSXEXT 60 | 2.13 
. AXIXSXE 4 | 42.20 AXIXSXCXT 60 | 2.29 
AXIXSXC 4 30.04 AXIXEXCXT 60 1.13 
AXIXEXC 4 |* 14.00 AXSXEXCXT| 30 2,42 
AXSXEXC 2 70.07 IXSXEXCXT 30 2.44 
IXSXEXC 2 | 10.97 AXIXSXEXC 
AXIXSXEXC 4 | 27.77 XD 60 | 268 
Error (a) 12 44.16 Error (b) 1080 2.68 
*p <.05. Š k 
d rb <.005. 
Number of Correct Responses did high TAS males under the try-harder condi- 
"Tabl ¢ ton (M =40.34, P< 05). No comparable dif- 
able 1 presents a summary of the results Of ferences were obtained for high TAS females. 


the analysis of variance pe 
ber of correct responses on 
TAS variable, significant 


at 


rformed on the num- 
List 1. The F for the 
the .005 level, was 


ry-har 


der condition 


were found to be superior to mid- 


due to the superiority of the low TAS group M 


=75.54) to the middle (M = 55:93) and high 
TAS groups (M = 51.44). This superiority mani- 
» fested itself, also, in the TAS X Trials interac- 
tion which indicated a steeper learning curve for 
low than for middle and high TAS subjects. The 
interrelationships of TAS and sex were demon- 
strated by several findings. It was found that for 
females under the good condition, low TAS sub- 
jects (M = 88.42) were significantly ($ < 05) 
superior to high TAS (M = 57.92) and middle 
TAS (M = 46.83) scorers. Female low TAS sub- 
jects under the good condition (M = 88.42) were 
„also significantly superior to comparable subjects 
w the try-harder condition (M= 54.91). 

ale high TAS scorers under the good condition 


(M = 63,50) performed at a higher level than 


The 
tion was 
middle TAS subjects under try-harder (M = 
52.42) than for female middle TAS subjects un- 
der good (M = 46.83, p < 05). Comparisons 
among all other groups 
for subjects run 
harder condition. 

The two experimenters had significantly dif- 
ferent effects on the subjects. The subjects run 
by the male performed (M = 66.83) at a higher 
level and learned at a faster rate than did sub- 
jects run by the female experimenter (M= 
35.10). This difference was due mainly to the 
subjects’ performance under the good condition. 
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Under it, the subjects run by the male experi- 
menter (M = 77.81), performed at a higher level 
than did all other groups, including the subjects 
run by the female experimenter under the good 
condition (M = 58.00). Overall, the good condi- 
tion (M = 67,90) led to a higher level of per- 
formance and faster learning than did the try- 
harder condition (M = 54.05). 

For List 2, the TAS, experimenter, and good 
versus try-harder conditions, again, had sig- 
nificant effects on learning. Table 2 summarizes 
the analysis of variance performed on the num- 
ber of correct responses for List 2. The major 
differences between Tables 1 and 2 relate to 
the variable of instructions. Whereas instruc- 
tions were not involved in any of the List 1 
significant Fs, they were a significant factor in 
several List 2 interactions. The TAS X Instruc- 
tions X Experimenter interaction was due pri- 
marily to the significant superiority of the low 
TAS (high and low achievement-oriented sub- 
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jects run by the male experimenter) to all other 
groups, under high and low achievement-orient- 
ing conditions. The Instructions X Sex X Experi- 
menter interaction was due to the superiority of 
male subjects run by the male experimenter 
under the achievement-orienting condition (M= 
115.68) to male subjects run by the female 
experimenter under the same condition (M= 
69.76). However, the comparable female groups 
did not differ in level of performance. The 
Instructions X Sex X Conditions interaction was 
due to the significant superiority of males under 
the neutral instructions and good condition 
(M = 115.68) to comparable males under the 
try-harder condition (M = 62.72). The analogous 
result for females did not approach significance. 
A significant interaction involving trials, Instruc- 
tions X Conditions X Trials, resulted from faster 
learning for groups under the neutral instructions 
and good condition than under the neutral in- 
structions and the try-harder condition. No com- 


TABLE 2 
ANALYSIS OF VARIANCE OF NUMBER oF Correct RESPONSES ON List 2 
Source df MS F Source af MS F 
Anxiety (A) 2 | 555.01 8.33*** Trial (T) 15 | 1284.95] 418.96" 
Motivating instructions AE 30 10.67|  3,48*** 
m) x 2 18.22 LCT 30 2.51. 
Sex of subject (S) 1 0.77 SXT 15 2.20 
Experimenter (E) 1 | 927.71| 13.92* | EXT 15 | 22.26] 7.26*** 
Conditions (C) 1 | 434.03 6.51** CXT 15 6.85 2.236% 
AXI 4 | 151.82 AXIXT 60 3.90 
AXS 2 1.90 AXSXT 30 1.89 
AXE 2 71.30 AXEXT 30 2.90 
AXC 2 | 135.84 ABC xed) 30 4,79 1.56" 
Ixs 2 62.68 cd IXSXT 30 2.21 
IXE 2 4.85 IXEXT 30 1.17 
TX 2 | 135.50 IXCXT 30 5.55 1.81" 
SXE is 0.29 SXEXT 15 2.27 
SXC 1 21:78 SXCXT 15 1.66 
EXC 1 70.84 EXCXT 15 3.06 
AXIXS 4 | 112.56 AXIXSXT 60 416| 1.36% > 
AXIXE 4 | 197.66] 2.97* AXIXEXT 60 6.62 216% 
AXIXC 4 | 49.73 AXIXCXT 60 2.03 
AXSXE 2 | 110.69 AXSXEXT 30 | 3.47 
AXSXC 2 | 212.96]  3.20* AXSXCXT 30 6.09}  1.99** 
AXEXC 2 22.47 AXEXCXT 30 2.05 3 
IXSXE 2 | 318.95 4,79** IXSXEXT 30 6.73 2.19*** 
Ixsxc 2 | 276.36) 4.15** IXSXCXT 30 7.68|.  2:50% 
IXEXC 2 | 58.77 IXEXCXT 30 3.03 
RSS 1 51.36 SXEXCXT 15 1.94 
MTS Sie 4 | 21.16 AXIXSXEXT 60 1.96 
XIXSXC 4 | 49.56 AXIXSXCXT 60 3.10 
AXIXEXC 4 | 51.70 AXIXEXCXT 60 4.76]  1.55*** 
ps oe AE 2 | 187.89 AXSXEXCXT 30 4.15 
XEX 2 | 076 IXSXEXCXT 30 1.36 
AXIXSXEXC 4 40.27 AXIXSXEXC 
Error (a) 72 66.65 XT 60 2.30 
Error (b) 1080 3.07 
* p <.05. 
#5 S023 


| in learning to make on-the-list errors, 
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parable differences were obtained under the 
other two instructional conditions. 

The TAS X Conditions X Trials interaction, 
significant for List 2, but not for List 1, was due 
to the faster learning of high and low TAS sub- 
jects under the good than under the try-harder 
condition. No comparable difference was obtained 
for middle TAS subjects. 


“Number of On-the-List Errors 


On List 1, there was a significant interaction 


between TAS and Instructions (F=3.34, df= 


4/72, p < .025). This effect resulted from the 
relatively large number of on-the-list errors made 
by high TAS-neutral subjects (M = 21.50) and 
the relatively small number of such errors made 
by high TAS-low achievement-oriented subjects 
(M=7.44). An obtained Instructions X Sex 
interaction (F = 3.50, df=2/72, $ < 05) was 
due to a greater number of on-the-list errors for 
male (M = 19.38) than female subjects (M = 
10.34) under the high achievement-orienting 
instructions, than under the other instructional 
conditions. An interaction involving TAS, Sex, 
and Experimenter (F=4.69, df = 2/12, $ < 
025) was due primarily to the relatively fewer 


~ on-the-list errors made by high (M = 6.58) and 


low (M = 10.83) TAS female subjects run by 
the female experimenter than by the other groups 
pun by both experimenters. There was a sig- 
nificant Instructions X Sex X Conditions inter- 
action (F = 6.47, df =2/72, p< 005). Under 
the try-harder condition, males under the neutral 
instructions gave the fewest on-the-list errors 
(Mu =7.25); for females, under the same con- 
dition, the high achievement-orienting instruc- 
tions were associated with the fewest errors 
(M=8.58). For males, under the neutral in- 
structions and good condition, there were more 
aot errors than was the case under the 
other two instructional conditions. There were 
no significant differences among the three in- 
structional female groups given the good 


„condition. 


There were several interactions involving 
trials: Sex X Trials (F=2.61, df= 15/1080, 
b< 005); Experimenter X Trials (F = 3.34, df 
F 15/1080, p<.005); Sex X Experimenter 
X Trials (F= 2.54, df = 15/1080, p <.005); 
anstructions x Sex X Conditions X Trials (F 
= 146, dj = 30/1080, p< 05). The major dif- 
erences relating to the trials effects were: the 
greater tendency of males than females early 
the 
greater tendency of the male experimenter’s sub- 
R to make errors in the middle learning trials 

an was true for the female experimenter’s sub- 


503 


jects, and the greater tendency of female subjects 
run by the male experimenter to show a steeper 
increase in on-the-list errors than all other 
groups. * 

The results of the List 2 analysis of errors 
on the list were similar to those of List 1. 
However, on List 2, it was found that females 
run by the male experimenter made a greater 
total number of errors than did all other subjects 
(F=7.63, df = 1/12, p< .01). High TAS sub- 
jects under the low achievement-orienting instruc- 
tions and low TAS subjects under the neutral 
instructions showed the smallest increase over 
trials in errors on the list of all other groups. 
High TAS-neutral subjects showed the greatest 
increase in errors for, Trials 7-12 of all groups 
(F=1A1, df= 60/1080, p < .025). In addition, 
there was a significant Instructions X Sex X Trials 
interaction (F = 1.68, df = 30/1080, p < .025). 


Number of Off-the-List Errors 


The most striking finding in the List 1 analysis 
of this class of errors was a significant difference 
due to the experimenters (F=31.47, df= 1/72, 
p<.005). The subjects run by the male experi- 
menter gave significantly more errors off the list 
(M =8.67) than did those run by the female 
experimenter (M = 1.33). 

All of the other significant List 1 results 
involved the repeated measurements variable. 
There was a significant increase in off-the-list 
errors over trials (FH=5.15, df= 15/1080, $ 
< 00%). There was, also, an interaction between 
the experimenter factor and trials (F = 2.84, 
df = 15/1080, p < .005). This resulted from a 
very small increase over trials of off-the-list 
errors for the female and an increase (followed 
by an asymptote) in these errors for the male 
experimenter’s subjects over the first six learning 
trials. 

There was a significant Instructions X Rein- 
forcement X Trials interaction (F= 1.66, af 
= 30/1080, p< 025). This was attributable pri- 
marily to the performance of the high achieve- 
ment-oriented-try-harder group. This group 
showed a steeper increase in these errors early 
in learning than did the other experimental group. 

i interactions: 

TAS X Instructions X Sex X Trials (F = 1.39, df 
= 60/1080, $ <.025) and TAS X Sex X Ex- 
rimenter X Trials 


p<.025). j 
As was the case for List 1, subjects run by 


the male experimenter gave markedly more 
errors off the list on List 2 than did the female 

rimenter’s subjects (F= 30.65, df= 1/72, 
p < .005). Also, the results for Trials (F = 5.42, 
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df = 15/1080, p<.005) and Experimenter 
X Trials (F= 2.34, df= 15/1080, p< .005) 
were similar to those obtained for List 1. The 
TAS X Sex X Trials interaction was significant 
for List 2 (F =1.58, df = 30/1080, p< .025), 
and was due to relatively greater increases in 
off-the-list errors for female than male middle 
and low TAS subjects, but just the reverse for 
high TAS subjects. There were two higher order 
interactions which attained significance in the 
List 2 off-the-list errors: Instructions X Sex 
X Conditions X Trials (F = 1.88, df = 30/1080, 
p<.005) and Sex X Experimenter X Conditions 
X Trials (F = 1.68, df = 15/1080, p < .05). 


Nonresponses 


The subject’s score on this variable was the 
number of times he made no attempt to antici- 
pate a dissyllable. These are, then, errors of 
omission. For List 1, a significant F for TAS 
reflected significantly more nonresponses made 
by the high (M=201.17) and middle TAS 
(M = 195.52) groups than the low TAS group 
(M =175.96, p<.005). The subjects run by 
the female experimenter made more nonresponses 
(M = 202.05) than did those run by the male 
experimenter (M = 179.12, p < .005), and those 
administered the try-harder condition (M 
= 196.94) made more errors of omission than 
did those subjects administered the good condi- 
tion (M = 184.22, p < .025). 

Significant at the .025 level was the TAS 
X Instructions interaction. Under the aeutral 
condition, there were no significant differences 
among the three anxiety groups; under the low 
and high achievement-orienting conditions, low 
TAS subjects made fewer errors of omission 
than did middle and high TAS subjects. These 
two low TAS groups made fewer errors than 
all other groups combined. The high TAS-neutral 
group made fewer nonresponses than the other 
high TAS groups; the low TAS-neutral group 
made more nonresponses than did the other low 
TAS groups. The errors for the middle TAS sub- 
jects under the three instructional conditions 
did not differ significantly. 

The Instructions X Sex X Conditions interac- 
tion (p<.025) was due primarily to fewer 
nonresponses by male groups administered the 
good condition than other neutral groups, and 
fewer nonresponses by male groups under the 
achievement-orienting conditions than the com- 
parable female subjects. 

The F for Trials (p < .005) resulted, expect- 
edly, from a decrease in errors of omission over 
trials. The F for TAS X Trials (p< .005) was 
due to the more rapid decline in nonresponses 
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over trials for low as compared with the mi 
and high TAS subjects. Analysis of the 
underlying the Experimenter X Trials interacti 
(p<.005) indicated a faster decline in 
responses for subjects run by the male e 
menter than those run by the female expel 
menter. The Conditions X Trials F (p< .00 
resulted from the more rapid decline of é 
of omission by the good than by the try-hard 
group. A significant higher order interaction 
involving instructions, sex, conditions, and trials 
was also obtained (p < .025). Í 

An analysis of variance performed on List 2 
errors of omission indicated that the bases 
the significant TAS, Experimenter, TAS X 
structions, Instructions X Sex X Experimen| 
Trials, TAS X Trials, and Experimenter X Tris 


than for all other high TAS groups. Just the op ; 
posite was the case for low TAS subjects. Under 


low TAS subjects. A significant TAS X 
X Experimenter interaction (p< .05) resul 
from fewer errors for subjects run by the male 
experimenter than by the female experimenter 
except for middle TAS females and low TAS 
males, s 

There were two higher order interactions in 
volving trials: TAS X Instructions X Experi 
menter X Conditions X Trials (p < .005) and 
TAS X Sex X Experimenter X Conditi 
X Trials (p < .025). 


DISCUSSION 


The results of the present experiment supi 
port previous evidence concerning the impo 
influences of characteristics associated with 
subject and characteristics associated with 
situation in which performance occurs on sen 
learning. However, although strong support 
provided by the present findings for the i 
relationships among organismic and situation 
variables in learning, the form of them for thei 
present sample of high-school students was not 
the same as has been previously reported fori 
most of the experimental work on anxiety whic 
has involved college students as subjects (Sarason, 
1956, 1960; Sarason & Palola, 1960). Previous 
research supports the hypothesis that high- 
anxious subjects perform at a lower level th 
do low-anxious subjects under high achievement” 
orienting conditions, but that under low achieve= 


a 
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ment-orienting or neutral conditions this differ- 
ence in performance does not occur. In the 
present study, the low TAS group showed an 
overall superiority when compared with high 
and middle TAS subjects. Furthermore, both the 
high and low achievement-orienting instructions 
had facilitative effects on the performance of 
low TAS subjects and led to higher levels of 
performance than for other groups. In accord 
with previous findings was the lack of differences 
among the three TAS groups under the neutral 
instructional condition. When the results were 
analyzed in terms of nonresponses it was found 
that while high TAS subjects made fewer errors 
of omission under the neutral than under the 
other two instructional conditions, just the 
opposite result obtained for low TAS subjects. 
Most of the significant effects involving in- 
structions occurred for List 2. It should be 
recalled that the instructions were administered 
twice, before performance on each list. On the 
other hand, the good and try-harder conditions 
were employed only for List 1, but their effects 
were discernible on List 2 as well as List 1. In 
view of the strong differences obtained between 
these conditions, it seems somewhat surprising 
that the serial learning literature contains so 
little work on the effects of the human rein- 
forcer on acquisition. Of particular interest was 
the fact that, in addition to an overall superior- 
ity of subjects under the good condition to those 
under try-harder, the experimenter and the ex- 
perimenter-comment variables significantly inter- 
acted, The subjects given the good comments by 
the male experimenter performed at a higher level 
than did all other groups in the experiment. 
"Studies of reinforcement and punishment in 
the verbal conditioning area provide good evi- 
dence that changes in operant verbal responses 
are influenced differentially by reinforcement and 
characteristics associated with experimenters 
(Krasner, 1958). Does it not seem worthwhile 
to inquire into the same influences in learning 
and performance? On the basis of our findings 
Concerning the good—try-harder variable, we feel 
that it would be potentially very fruitful to 
investigate systematically the interpersonal rein- 
forcement of behavior in intellectual performance 
Situations. Winkel and Sarason (1964), Archer, 
Cejka, and Thompson (1961), and Hetherington 
and Ross (1963) in recent studies have provided 
evidence which supports our affirmative answer 
to this question. 
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The present results also suggest that analysis 
in terms of correct responses alone does not 
provide as comprehensive a reflection of the 
subject’ ,behavior as is true for the several 
analyses performed in our experiment. It would 
appear that instructions may facilitate guessing 
on the part of subjects, but not necessarily facili- 
tate performance level. It would seem worth- 
while in future research in verbal and other 
learning situations to study experimentally vari- 
ables which might influence the subject’s 
willingness to guess and to make errors. 
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EFFECTS OF INDUCED MUSCULAR TENSION ON PERFORMANCE 
AND THE INVERTED U FUNCTION * 


CHARLES G. WOOD, Jr. anD JACK E. HOKANSON 
Florida State University 


It was hypothesized that heart rate, as a function of increasing levels of 
induced muscular tension, would increase linearly throughout the range of 
tension, whereas, performance on a simple intellectual task, would progressively 
increase up to a moderate level of tension, beyond which it would progressively 
decline. A group of 100 undergraduate students enrolled in an introductory 
psychology course, equated for basal level heart rate, were divided into 5 
groups. Each group was subjected to 1 of 5 levels of induced muscular tension. 
‘Heart rate and performance on the intellectual task were measured before and 
after the induced tension, and the computed differences recorded as the 
measure of change. These differences, analyzed by means of an analysis of 
variance, revealed significant differences for both heart rate and performance. 
In addition, trend tests were performed to see if these differences followed the 
predicted trends. It was found that heart rate displayed a significant linear 
trend across conditions, whereas performance scores manifested the predicted 


significant quadratic component. 


There have been numerous studies reported in 
the literature since the early 1930s concerned 
with the effects of induced muscular tension on 
various types of performance. Many of these 
studies had their common impetus in a study 
by Bills (1927). In this study he had experi- 
mental subjects perform several types of mental 
tasks while squeezing a dynamometer in each 
hand, The subjects in the control group per- 
formed on the same tasks under relaxed condi- 
tions, Bills then analyzed the results on the basis 
of three criteria of efficiency: speed, amount of 
recall, and effects of practice. The results re- 
vealed that for all criteria, performance, under 
the conditions of tension was superior to per- 
formance under relaxed conditions. In addition, 
the increase in efficiency under tension was en- 
hanced with practice, where speed was the cri- 
terion, but remained constant for the other two 
criteria, Also, with speed as the criterion, there 
appeared to be an improvement in efficiency 
under conditions of tension as the subject grew 
more fatigued while, again, for the other criteria, 
the amount of efficiency remained constant. 

There followed other studies reporting results 
that were consistent with those found by Bills 
(Courts, 1939; Freeman, 1938; Stauffacher, 
1937; Stroud, 1931). In an effort to resolve a 
conflict in findings by Freeman (1933) and a 
study by Jacobson (1929), Stauffacher per- 
formed an experiment to determine whether 
varying the amount of induced muscular tension 


1 This research was supported by a grant from the 
Research Council of the Florida State University. 
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would have differential effects upon learning. He 
took 40 subjects and determined each indi- 
vidual’s maximum strength by having them pull 
with all of their strength on a spring dynamome-— 
ter for a 1-minute interval. Then, taking propor- 
tions of this maximum pull (0, 4, 4, 2) he had 
all subjects learn a list of 15, 3-letter nonsense 
syllables, Thus, the four proportions of the 
original tension constituted the four exper- 
mental conditions of the experiment. Since all i 
subjects performed under all four conditions, he 
used four lists of syllables which were equated 
and which controlled for effects of learning from 
one condition to the next. Stauffacher reported 
that for all three degrees of tension (4, 4, 4) 
there was a facilitating effect on learning when | 
compared to the amount of learning observe 
under the fourth condition of no tension. He 
pointed out that only one condition (4) was 
significantly above normal but the curve ob- 
tained was suggestive of an inverted U function: 
Stauffacher also extrapolated data from the 
studies by Freeman and Jacobson which he 
superimposed upon his original curve. When the 
two tails of the curve were extended in the 
direction of the extremes (that is, complete 
relaxation and intense tension) it was that of the 
inverted U. Later Courts replicated these findings 
and increased the tension beyond that used in 
Stauffacher’s study. He found a detrimental ef 
fect on learning at the extreme levels of tension 
as Stauffacher had predicted. 

Results such as these have been recently 
marized by Malmo (1959) and related tong 


sum- 
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general theory of physiological activation. The 

hypothesis advanced states generally that as 

“arousal” increases from low to moderate levels, 

there is a concomitant improvement in intel- 

lectual performance; but that as arousal is in- 
creased from moderate to high levels, there is 

a decrease in intellectual efficiency. Methodo- 

logically, Malmo also suggested that physiological 
indicants of arousal be measured concurrently 

with performance data in order to adequately 
test this hypothesis. 

In an attempt to relate intellectual perform- 
ance with an indicant of arousal, Burgess and 
Hokanson (1964) and Hokanson and Burgess 
(1964) used subjects who had characteristically 
low or high heart rates. Following a frustration 
procedure, which elevated heart rate equally for 
both groups, the low heart rate subjects showed 
a significant improvement on Digit Symbol 
scores, while those with high heart rates had a 
relative decrement in scores. These results were 
interpreted as being consistent with Malmo’s 
inverted U hypothesis. 

The studies reviewed thus far have had 
methodological limitations which curtail their 
adequacy in testing Malmo’s hypothesis. The 

 Hokanson and Burgess studies used extreme 
groups and omitted the middle range of heart 
Tates, thus making an interpretation of the data 
as a continuous, inverted U function highly 
tenuous, Along different lines, the early studies 
reviewed, along with a more recent study by 
Lovaas (1960) did not use any independent 
measure of physiological arousal and thereby 
failed to present data necessary for a complete 
evaluation of the hypothesis. 

The present study is an attempt to utilize the 
essentials of the Stauffacher design, with induced 
muscular tension as the independent variable, 
but with two dependent variables: Digit Symbol 
Performance and heart rate. In addition, only 
subjects who fall in the middle range of resting 
heart rates are utilized, thereby reducing the 
Possibility of obtaining performance differences 
because subjects were initially drawn from 
different populations. 

The specific hypotheses to be tested in this 
experiment are: that performance on a simple 
leaming task (a digit-symbol task) will increase 
as cs function of the degree of induced muscular 
tension up to an optimal point and then progres- 
sively fall off as the degree of tension is in- 
+, “teased further, and that heart rate will continue 
to increase in a relatively linear fashion 
throughout the range of induced tension. 
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METHOD 
Subjects 


The subjects used in this experiment consisted of 
100 undergraduate students enrolled in introductory 
psychology courses at Florida State University. 
These subjects were selected from a total of 173 
subjects on the basis of their basal heart rate. Only 
those subjects whose basal heart rate was within 
the normal range of 72-92 beats per minute were 
retained, thus, necessitating the elimination of 73 
subjects. This procedure was followed in an effort 
to control for the possible effects that individual 
differences in heart rate might have on a given 
response pattern. The subjects were then assigned 
to one of five condition groups. These groups were 
differentiated on the basis of the amount of propor- 
tion of induced muscular tension. These were: 
maximum pull, ł of maximum pull, 4 of maximum 
pull, 4 of maximum pull, and no induced tension. 

Each group consisted of 20 subjects, 8 males and 
12 females. 


Apparatus 


A method similar to that described by Stauffacher 
(1937), was employed to induce muscular tension. 
Two pulleys were mounted on the back edge of a 
table about shoulder-width apart. A rope, suspended 
through either one or the other pulley (depending 
on whether the subject was right- or left-handed), 
was attached at one end to a stirruplike handle and 
at the other, to weights. Seated in front of the table 
the subject was able to lift the weights off of the 
floor and support them at his side with a minimum 
of distraction. 

In order to determine a subject’s maximum 
strengtl, an identical stirruplike handle was at- 
tached, by means of a rope, to one of two produce- 
type scales. These were mounted shoulder-width 
apart On a solid piece of wood measuring 4 X 30 
inches. This assembly, as well as the pulley assembly, 
was secured to the table by means of two large 
C clamps. Thus, the removal and exchange of the 
two assemblies during the course of the experiment 
was greatly facilitated. 

The learning task consisted of two equivalent 
forms of a modified digit-symbol task. This task 
was chosen for its demonstrated reliability as well as 
its simplicity and ease of administration. 

Heart rate was recorded by means of a Beck-Lee 
electrocardiograph equipped with flexible leads which 
enabled the subject to perform the tension-inducing 
and learning tasks unimpaired. Cardiac pulses were 
recorded as a prominent vertical pen displacement 
on EKG recording paper. The chart drive on this 
apparatus is exceptionally reliable and fast (25 milli- 
meters per second). In addition, the line divisions 
on the recording paper are in 4-second spaces; thus, 
making the counting of heart beats relatively simple 
during each recording interval. Heart rate was 
recorded periodically during an initial 5-minute rest 
period and after the induced muscular tension, for 
30-second intervals. Turning the EKG recorder on 
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and off did not interfere with the subjects’ 
performance in that a single, silent switch was used. 


Procedure 


Upon entering the experimental room, the subject 
was seated at the table and asked a number of 
questions pertaining to a personal data sheet. The 
subject was then told that before the main part of 
the experiment began, it was necessary to determine 
his maximum strength. To minimize interference 
with the subject’s subsequent handwriting due to 
fatigue, he was instructed to pull back only with 
the hand opposite the one with which he ordinarily 
wrote, The reading on the scale at the end of 1 
minute was taken as the subject’s maximum pull. 
The subject was then told to relax while the experi- 
menter removed the scale assembly and replaced it 
with the pulley assembly. , 

The nature of the experiment was briefly explained 
as one dealing with the relationship between the 
circulatory system and learning. This was followed 
by a brief description of the procedure as consisting 
of the adaptation period, the first digit-symbol task, 
the physical task, and the second digit-symbol task. 

Following this introduction, the electrocardiogram 
leads were attached to the subject’s right arm, left 
arm, and right leg and he was instructed to rest 
for 5 minutes in order that his heart rate could 
reach a basal level. During this period, heart rate 
was recorded three times at approximately 2-minute 
intervals, with the last recording taken as the 
measure of basal heart rate. If the subject’s heart 
rate was within the acceptable range, the remainder 
of the experiment was instituted. Those whose heart 
rate was above or below the acceptable range were 
released, 

Immediately after the 5-minute rest period, the 
subject was given the first digit-symbol task and 
told to match the digits as indicated in the key. 
There were eight digit-symbol pairs on tHe task. 
The number of digits correctly matched at the end 
of 1 minute was recorded as the pretension measure 
of performance. This was in turn followed by having 
the subject pull back the weights and supporting 
them at his side for a 1-minute period. The amount 
of weight was determined by the proportion of 
his maximum pull corresponding to the condition 
to which he had been assigned. Heart rate was 


TABLE 1 
Heart Rate DIFFERENCE Scores 


Source 


df SS MS td 

Rows i 19 1101.44 
Induced tension | (4) | (1954.24) | 488.56 | 11.27* 
Linear — 1 1848.32 | 1848.32 | 42.65* 
Quadratic 1 2.06 2.06 
Cubic 1 54.08 54.08 
Quartic 1 49.78 49.78 
Error 76 | 3293.76 43.34 

Total 99 6349.44 

* p <01. 
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TABLE 2 
PERFORMANCE DIFFERENCE SCORES 


Source df SS MS F 

Rows 19 191.31 
Induced tension | (4) | (122.06) | 30.52 | 4.5417* 
Linear 1 6.48 6.48 
Quadratic 1 108.15 | 108.15 | 16.0907* 
Cubic 1 0.00 0.00 
Quartic 1 7.43 7.43 
Error 76 | 510.34 6.72 

Total 99 823.71 

*p <01. 


again recorded and the difference between this and 
the initial recording constituted the amount of 
arousal associated with that condition of induced 
muscular tension, The subject was then immediately 
given the second form of the digit-symbol task. The 
number of digits correctly matched at the end of 
1 minute was recorded as the posttension measure of 
performance. The difference between this score and 
the pretension measure was computed and this value 
recorded as the change in performance associated 
with that level of induced tension. 


RESULTS 


An examination of the pretension performance 
scores for each level of induced tension (0, 4, 
3, 2, and maximum) reveal means of 25.25, 25.40, 
24.75, 23.80, and 24,80, respectively. Similarly, 
the mean pre-heart-rates were, in order, 82.20, 
82.20, 82.20, 83.80, and 82.20 beats per minute. 
An Fmax test performed on both the pre-heart- 
rate and the preperformance scores indicated 
that there were no differences in the variability 
of the subjects in the five groups. Thus, the 
condition of homogeneity of variance was met, 
Tn addition an analysis of variance of pre-heart- 
rate and preperformance scores revealed no 'sig- 
nificant differences between the means of the 
five groups. h 

The mean difference values (difference between 
pre- and postmeasures) for heart rate were 
—0.8, 5.0, 5.2, 9.0, and 12.4. In the same manner, 
the mean difference scores for performance Were 
1.75, 3.40, 4.95, 3.80, and 2.45. The difference 
scores for both heart rate and performance welt 
analyzed separately by means of an analysis of 
variance. As Tables 1 and 2 indicate, ‘both 
revealed significant condition effects. In addition, 
the sum of squares for conditions was further 
divided into four trend comparisons using 
orthogonal polynomials, Thus, in Table 1 the 
term for linearity shows that heart rate differ 


ence scores follow a significant linear trent. 


Likewise, the quadratic component of the ee 
formance difference scores is shown to be Si 


= 
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cant (Table 2). The curves of mean difference 
scores for heart rate and performance are 
depicted in Figure 1. 

The curves in Figure 1 provide a clearer 
picture of the relationship between heart rate 
and performance as a function of increasing 
levels of induced muscular tension. It can be 
seen that performance increases monotonically up 
to an optimal level of tension (in the vicinity 
of 4 of maximum tension), after which it begins 
to fall off. The resulting curvilinear trend was 
seen to be significant at the .01 level in Table 2. 
Heart rate, by contrast, is seen to increase in 
a linear fashion throughout the range of induced 
tension, The test for linearity in Table 1 reveals 
this to be significant at the .01 level. 


Discussion 


The results of this experiment support the 
hypotheses presented in the introduction. These 
hypotheses, derived primarily from Malmo’s 
inverted U function model, stated that perform- 
ance on a simple learning task would be facili- 
tated with increasing levels of induced tension, 
up to some maximum point whereupon it would 
begin to fall off with further increases in tension. 
Heart rate on the other hand, as the criterion 
of arousal, was predicted to increase in a rela- 
tively linear fashion throughout the range of 
induced tension. A look at the results presented 
in Tables 1 and 2 and Figure 1 shows that both 
of these predictions were correct. 

The results of this study are very similar to 
those obtained by Stauffacher (1937) and later, 
by Courts (1939), insofar as the relationship 
between induced muscular tension and perform- 
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ance is concerned. This is evident in looking at 
the performance curve in Figure 1, which takes 
the general shape of an inverted U and, as such, 
closely, resembles the curve obtained in the 
earlier studies. In addition, the results are con- 
sistent with the more recent studies taking into 
account individual differences among subjects 
(Burgess & Hokanson, 1964; Hokanson & Bur- 
gess, 1964; Lovaas, 1960). Of possibly greater 
interest here is the demonstrated analogy to 
Malmo’s model of arousal and the predictions 
therefrom; that is, using heart rate as the cri- 
terion of arousal, the results support the arousal- 
performance relationship as predicted from the 
model. à 

The authors recognize that the use of a single 
autonomic measure as an indicant of the general 
physiological status of any single subject may be 
questionable (Lacey & Lacey, 1958). The cur- 
rent group data however fits the Laceys’ concept 
of “group indicant concordance,” and as such 
portrays a general relationship with the stimulus 
parameters involved. In this light there are sev- 
eral methodological features of interest which 
tend to minimize group variability in the current 
design: the use of subjects with heart rates in 
the average range at resting level may tend to 
screen out persons with deviant cardiovascular 
responsiveness; the use of a standardized physical 
task, which in effect is tailored to each subject’s 
muscular strength, very likely reduces the vari- 
ability in autonomic responding found in studies 
utilizing interpersonal forms of stress. 
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PREDICTING THE SUCCESS OF PEACE CORPS VOLUNTEERS 
i IN NIGERIA +: 


> WALTER MISCHEL 
Stanford University 


The outcomes of assessment in 1 of the 1st Peace Corps projects are described. 
Self-report measures of various ratings during training in the U.S. were ob- 
tained on a group of 41 Peace Corps volunteers (PCVs) assigned to teaching 
posts in Nigeria. The criterion consisted of staff ratings of the volunteers in 
Nigeria. Scores on self-report measures (F, Ego Strength, and Manifest 
Anxiety scales) were related significantly to performance abroad. In contrast, 
global ratings (by training faculty, assessment board, and interviewer), as 
well as grades and peer ratings, were not significantly related to the criterion. 
Outcomes were examined also for PCVs with extreme scores on single and com- 
bined self-report measures. Results indicate that the self-report measures em- 
ployed may have utility for screening and selection decisions. Implications for 


future assessment are discussed., 


This paper reports on certain aspects of assess- 
ment in one of the first Peace Corps projects. 
The measures selected were largely determhined’ 
by practical considerations, There was little time 
for assessment, and testing had to be inserted 
into already overcrowded training ‘schedules. 
Although the lessons of past large scale assess- 
ment projects (for example, Stern, Stein, & 
Bloom, 1956) were available, for the most part 
they could not be applied. No volunteers were 
abroad yet, and the details of the volunteers’ 
assignments and probable situations abroad 
were not known clearly. In brief, the definitions 
of the criterion performance remained ambigu- 
ous and global. Further, because of training 


1A modified version of this paper was read at 
the conference “Peace Corps and the Behavioral 
Sciences,” Department of State, Washington, D. C., 
March 5, 1963. Grateful acknowledgment is due 
to Alex Sergienko who was responsible for collecting 
data (with the exception of the self-report question- 
naires and objective tests) during training and to 
David Seeley who collected the criterion ratings in 
Nigeria. This paper is the responsibility of the author 
and should not be construed as an official Peace 
Corps document. 


policies and overcrowded time schedules, it was 
impossible to observe and rate volunteers in 
training situations sampling the kinds of behav- 
iors required abroad, for example, teaching. 
These factors imposed severe limitations upon 


assessment possibilities, but since the assessment, 


situation was so novel and important the oppor- 
tunity to attempt a prediction study, albeit based 
on limited data, had to be accepted. 

This study describes the use of some structured 
self-report measures for predicting the perform- 
ance of volunteers in the Harvard-Nigeria Peace 
Corps project. The paper also presents results 
from the use of grades during training, peer 
ratings, interviews, faculty ratings, and fini 
assessment-board ratings, all collected independ- 
ently by the training staff during the Harvard 
training period. 

Findings from previous assessment projects led 
to the following expectations. First, it was antici- 
pated that self-report measures would be of value 
for broad screening-out purposes, with the 
Poorest scores predicting relatively unsatisfactory 


performance. Second, it was expected that - 


academic records (grades) obtained during train- 
ing, and peer ratings would be moderately te 
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Jated to criterion performance, In view of their 
past performance, the writer had little optimism 
with respect to the “global” judgment techniques, 
namely, the ratings from unstructured interviews, 
faculty judges, and the assessment conference. 
However, due to the “face validity” of these 
latter measures they were included by the train- 
ing staff and provided an opportunity for com- 
paring the predictive values of a variety of 
techniques. 


METHOD 
Subjects 


When tested this group consisted of 28 male and 
13 female Peace Corps Volunteers (PCVs) trained 
for approximately 2 months at Harvard University 
during the summer of 1961 in preparation for teach- 
ing assignments in the schools of western Nigeria. 
The candidates in training were preselected from 
a much larger national pool of applicants. Prior to 
inclusion in the training program, candidates were 
screened in Washington, D. C., by an assessment 
staff which based the screening decisions primarily 
on the PCV application questionnaire, letters of 
recommendation, the applicant’s previous academic 
and work record, and performance on the Peace 
Corps entrance examinations. 

From this group of 41, 4 were screened out at the 
end of the Harvard training program. After a further 
training period at the University of Ibadan (Nigeria) 
the PCVs took their teaching assignments in different 
schools throughout the western region of Nigeria. 
At the time of the criterion ratings (April 1962), 35 
PCVs remained in Nigeria. 


Predictor Data: Self-Reports 


p The self-report data were collected by the writer 


_ Subtests (Similarities, 


) 


in late August of 1961, that is, towards the end 
of the Harvard training program. The tests were 
administered to all PCVs in a group session at 
Harvard. It was made clear to the PCVs that these 
tests were used for research purposes only and it 
was stressed that the results would not affect any 
Peace Corps decisions that might be made about 
individual respondents. (This was indeed the case.) 
The tests were administered in the standard ways 
customary for each instrument. The test battery 
Consisted of several paper-and-pencil measures. These 
included the 24-item version of the California F 
Scale composed of 12 items from the original F Scale 
(Adorno, ‘Frenkel-Brunswik, Levinson, & Sanford, 
1950) and 12 items dealing with “traditional family 
ideology” (Levinson & Huffman, 1955), the Ego 
Strength (Es) scale (Barron, 1953), the Manifest 
Anxiety (MA) scale (Taylor, 1953), and an abbre- 
Viated intelligence test.? Items from the Zs and MA 
Were interspersed within the same format. 
—— 


2A group-administered adaptation of three verbal 
Information, Vocabulary) from 
Scale (WAIS) was 


the Wechsler Adult Intelligence 
between total 


also administered. The correlation 
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The above measures were selected because they 
can be administered and scored quickly, simply, and 
objectively, and a considerable research literature 
exists for each of them. Most important, it was 
known that the criterion ratings would be fairly 
global and the assessors had insufficient information 
concerning the details of the criterion to make 
specific predictions. Therefore, the predictions of 
relative success (or relative failure) had to be based 
on fairly general considerations, in accord with the 
broad goals stated by the Peace Corps. These goals 
included the PCV’s appreciation of the new culture 
and the development of mutual understanding, as 
well as competent job performance. Finally, the same 
scales were included in several Peace Corps projects 
and thus make cross-project comparisons ultimately 
possible. 


Predictor Data: Measures from the Training 
Staff 


The second set of available data came from the 
training staff. These data were collected independ- 
ently by the training staff at Harvard during the 
summer of 1961 and included the PCVs’ academic 
grades during training, faculty evaluations of the 
PCVs, peer ratings, interview ratings, and assignment 
to a final category by the assessment review board 
at the end of training. 

The index of academic performance during train- 
ing was the PCV’s grade record. Specifically, the 
datum for each PCV was his mean grade during 
training. 

The peer ratings, obtained near the end of the 
training period, required each PCV to list the seven 
persons from the group with whom he ideally would 
most ljke to share his Nigerian school assignment. 
More specifically, each PCV indicated the one person 
(A) with whom he would ideally like to be assigned 
as a sehool colleague in Nigeria; then the PCV was 
asked to list two additional others (B); followed 
by four additional others (C). Crude “weighted 
peer ratings” were computed in which each A nomi- 
nation was assigned a value of 3, each B nomination 
a value of 2, and each C nomination a value of 1. 
For each PCV the sum of these scores was computed. 

There were 10 faculty oa eis san each 

culty person rated each PCV. Only faculty mem- 
th A the training staff who felt that they knew 
the PCV sufficiently made these ratings and an 
average of six faculty ratings was obtained for each 
PCV. Each rater assigned a score on a global rating 
scale from 1 to 5 (ranging from “outstanding” to 
“must not go”) and the data are based on the mean 

rating for each PCV. 
ER PCV. was individually interviewed for about 
30 minuteš. The interview was essentially unstruc- 
tured and “informal,” “open-ended,” and ranged 


scores on these three subtests and total criterion 
so low (r = .002) that no subtler analyses 
. This finding must be interpreted 


cautiously since the administration of the WAIS was 
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over many topics including “motivation” for joining 
the Peace Corps. It was conducted by an educator 
with counseling experience who served in a counsel- 
ing and liaison role and who remained essentially 
“in residence” with the PCVs during the training 
program at Harvard. The interviewer recorded his 
impressions on a 5-point rating scale, reflecting his 
judgment of the PCV’s probability for successful 
teaching performance in Nigeria, These ratings ap- 
pear to have been based on a global impression and 
were probably influenced by observations outside 
the interview as well as by the content of the 
interview itself. 

At the end of training each PCV was recom- 
mended for one of five categories by an assessment 
board subcomniittee, consisting of three staff persons, 
all of whom had occasion to observe the PCVs 
during training. These recommendations were made 
by the group on the basis of their discussion and 
review of the data obtained during training (faculty 
“evaluations, academic records, peer ratings, and 
interview). The categories were: I, outstanding 
(N =4); II, very good (N =17); III, satisfactory 
(N =11); IV, some doubt (V=7); V, not recom- 
mended (N = 2). These recommendations were dis- 
cussed in detail by a larger final assessment review 
board, representing a variety of disciplines and 
coming with diverse assessment experiences. The final 
review board discussed each PCV individually and 
had available a descriptive summary prepared by 
the subcommittee for each PCV. The subcommittee 
recommendations were accepted, in essence, except 
that two PCVs from Category IV were moved into 
Category V. Thus, of the 41 candidates, 4 were 
asked not to continue at the end of training. Two 
kinds of data analyses were computed for these cate- 
gory assignments: correlations between the category 
assignment and the criterion scores, and chi-square 
tests comparing combined Category I and II PCVs 
with Category III and IV PCVs on performance 
above or below the criterion median. 

The training staff had no knowledge of the PCV’s 
Scores on the objectively scored tests administered 
by the writer. Thus, results from the Measures de- 
scribed above under Predictor Data: Self-Reports 
did not affect selection decisions, whereas the data 
obtained from the techniques described under Pre- 
dictor Data: Measures from the Training Staff did 
affect such decisions at the end of training. 

There may have been important differences in the 
PCVs’ test-taking “set” for the measures described 
above in the self-report data opposed to those 
described in the training staff data, The measures 
under the former were administered by an examiner 
unfamiliar to the PCVs and with the implication 
that the results would not affect decisions about the 
candidates. In contrast, the measures under the 
latter were administered by persons associated with 
the training program and with the implication that 
the outcomes could affect decisions about the candi- 
dates. Hence, it is to be expected that responses 
to the latter measures might be much more subject 
to defensive distortions, attempts to “fake good,” etc. 
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Predictions 


It was predicted that strong “authoritarianism” as 
measured by high scores on the F Scale, and low | 
“ego strength” as measured by low scores on the ` 
Es scale, would be indices of poor performance in 
Nigeria. That is, it was expected that the strong 
presence of “authoritarianism” associated with high 
F Scale scores and the “low ego strength” associated 
with poor Es scores would impede the PCV’s ef- f 
fectiveness in a very different, foreign environment, 
under strenuous personal and interpersonal circum- 
stances. With respect to the MA scale, it was antici- 
pated that extremely high scores would be an index 
of debilitating “anxiety” and that PCVs with such 
extreme scores would not function as effectively as 
PCVs with more moderate scores. 

Especially since this was a prescreened group, 
strong correlations between the predictors and cri- 
terion performance were not expected. Rather, it 
was anticipated, and explicitly predicted, that the 
greatest value of these self-report scales would be 
for the prediction of relatively poor performance 
on the part of PCVs with relatively extreme “poor” 
scores on the self-report measures. The a priori 
cutoff point arbitrarily selected for predicting poor 
performance was the “poorest” 25% of the distribu- 
tion of scores on each scale. Thus it was predicted 
that PCVs with scores in the highest 25% on the 
F or MA scales, or in the lowest 25% of the dis- 
tribution on the Es scale, would be the “doubtful” 
candidates. These predictions were formally recorded 
in September 1961, and were not made available 
to the training or field staff. k 

As regards the measures collected by the training 
staff, it was anticipated that performance during 
the training periods, as measured by grades and 
peer ratings, would relate positively to performance 
in Nigeria. In view of the fact that the unstructured 
or “global” interview has had a persistent history 
of unsatisfactory predictive validity in previous as- 
sessment projects, no positive relationship was antici- 
pated. Similarly, the global category assignments, by 
the assessment board and the global ratings of 
faculty judges were not expected to yield useful 
predictions, -! 


Criterion Ratings 


Ratings of the relative success of each PCV were 
obtained in April 1962. These are the criterion ratings’ 
which were made independently by the field staff Le 
Nigeria and without any knowledge of the predictor 
data collected earlier. The criterion ratings Were 
made on six dimensions, each of which was described 
in considerable detail by the field staff. The dimen- 
sions were: teaching effectiveness; person-to-person 
contacts (interpersonal behavior with Nigerians) i 
appreciation of Nigerian culture; representation 0 
American culture by (a) personal behavior and d 
by interpreting American culture to Nigerians; an 
adjustment to the assignment. Teaching effectiveness 
was given extra weight and rated on a 50-point 
scale, whereas the other five ratings were made 0% 
10-point scales. The total criterion points (sum © 
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all ratings) ranged from 54 to 87, with numerous 
tied scores. The ratings were submitted by the field 
staff supervisor and represent his collation and pool- 
ing of judgments as described below. The ratings of 
different judges were not individually reported and 
therefore the reliability of the criterion ratings 
cannot be evaluated. In effect, the criterion data 
must be treated as the ratings of a single judge, 
that is, the field supervisor. 

The following quotation, excerpted from the May 
31, 1962, interim report by the field supervisor for 
the Harvard-Ibadan Peace Corps Training project 
highlights the serious limitations of the ratings. 
According to this report, the volunteers were rated: 


on the various elements of the desired PCV be- 
havior... . It does not purport to be accurate. 
The ratings are in many instances based on very 
scant evidence and in some cases amount to 
nothing more than guesses. Much of what the 
volunteers have been doing since arriving at their 
posts has gone unobserved by any Peace Corps or 
training staff, and the judgments recorded here 
have been constructed out of the fragments of 
evidence we do have and the second-hand opinions 
of school principals and others who have had 
more opportunity to observe the volunteers on the 
job. [Nevertheless:] If . . . [the poorest scores] 
could have been predicted ahead of time, it would 
have been better not to send these volunteers. 


RESULTS 


Correlations between the six criterion subscales 
and the total criterion scores ranged from T 
(person-to-person contacts) to .85 (teaching ef- 
fectiveness and adjustment to assignment). In 
view of these reasonably high correlations, as well 
as the global nature of the criterion ratings them- 
selves, all data analyses are based on the total 
criterion scores. 

Table 1 presents the intercorrelations between 
the predictors for the total sample of PCVs for 
oo predictor data were available (N = 41). 

ote that the correlations between each self- 
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report measure and all other measures reach sta- 
tistical significance in only one instance, namely, 
the negative relationship (r = —.58) between Es 
and MA? Scores on the self-report measures are 
not related to scores on the measures collected 
by the training staff. In contrast, note with 
respect to the latter data that all the intercorre- 
lations with only two exceptions are statistically 
significant, although there is a wide range of 
magnitude. The only nonsignificant correlations 
are between academic record during training and 
the interview and between the former and peer 
ratings. 

The highest correlations are between assess- 
ment-board categories and faculty ratings 
(r=.82) and between assessment-board cate- 
gories and the interview (r=.72). Since the 
assessment-board category ratings were made at 
the end of training, with all data from the 
training-staff predictors available, it is of interest 
to see that when the assessors made their final 
ratings they were apparently most influenced by 
the results from the faculty evaluations and the 
interview. This is consistent with the frequently 
noted tendencies of judges to rely most heavily 
on data from other judges who are most closely 
their peers (rather than the ratings of the PCV 
peers or on the academic grades of the PCVs 
during training). The negative consequences of 
this will become apparent when the correlations 
between the predictors and the criterion are 
examined. In other words, the final global 
decisions made about PCVs at the end of training 
were most strongly related to faculty ratings and 
the interview (although the correlations with 
peer ratings and academic grades were also sig- 
nificant). The correlations between the self- 
reports and data collected by the training staff 
in no case approach statistical significance. In 
sum, there is no significant relationship between 
scores on the self-reports and the measures ob- 


z TABLE 1 
3 INTERCORRELATIONS BETWEEN Prepicrors (N = 41) 


Self-reports e Facul 
ee cit Faculty | Interview 
F Es MA 
Es 
HA 58" 
cademic grades 09 —.03 
oE 12 2n he _35* 
he ee a 48 25 35 | .o1* 
a view .14 : "55 34* “ga** Dad 
Assessment board 04 22 3 G 
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TABLE 2 
CORRELATIONS BETWEEN PREDICTORS AND TOTAL CRITERION Points (N = 35) 
g 
2 Predictors 
Self-reports 
Academic Haa Tuy Interview Acen 
F Es MA ) 
Criterion —.45** .34* —.34* 24 .29 27 13 .20 
* p <.05, 
+p <01. 


tained by the training staff, although there are 
reasonably high correlations between most of 
the latter measures. 

Table 2 shows the correlations between predic- 
tors and criterion ratings. The correlations are 
based on the 35 PCVs in Nigeria at the time 
of the criterion ratings.° Inspection of Table 2 
reveals that scores on each of the three self- 
report measures are significantly related in the 
expected direction to criterion ratings.* In con- 
trast, for the data gathered by the training staff, 
none of the correlations reach significance. The 
lowest correlation is for the interview; the high- 
est correlations are for peer ratings (r= .29), 
faculty ratings (r=.27), and academic grades 
(r =.24), but all of these fall short of accept- 
able significance levels, with r= .33 required for 
p<.05.5 


3 All tests of the relationships between predictors 
and the criterion were based only on the 35 PCYs 
remaining in the field at the time of the criterion 
rating. Since decisions about the 4 PCVs screened 
out at the end of training were based on some of 
the predictor measures these cases cannot be in- 
cluded in the analyses dealing with the interrelation- 
ships between predictors and outcomes. Of the 2 
PCVs who were “dropouts” in Nigeria, 1 scored 
in the poorest quartile of the F Scale and the other 
in the poorest quartile of the Es scale. 

4 Inspection of the relationships between predictors 
and criterion ratings for males and females sepa- 
rately revealed no strong sex differences and the data 
for males and females were combined in all statistical 
tests. 

5Since the assessment-board ratings resulted in 
categories varying markedly in size of N, a con- 
tingency table was tabulated in addition to the 
Pearson correlation between categories and criterion 
ratings. This consisted of comparing the outcomes 
(above or below the criterion median) for PCVs 
with “high” as opposed to “low” assessment-board 
ratings, with these ratings dichotomized as closely 
as possible into two groups of approximately equal 
size. Of the PCVs with “high” assessment-board rat- 
ings, 11 were above the criterion and 9 were below 
it; of the PCVs with “low” ratings, 7 were above and 
7 were below the criterion median. 


Examination of the “hits” and “misses” ob- | 
tained if cutoff points on the predictor variables 
are used to make screening decisions provides an 
alternative analysis which is more interesting 
than the correlation patterns. Recall that with 
respect to the self-report measures it was pre- 
dicted that a cutoff point, delineating the poorest 
25% of the distribution, would provide an 
effective index of poor or doubtful criterion 
performance. To differentiate PCVs in terms of 
good or adequate as opposed to poor or doubtful 
performance the distribution of criterion points 
was dichotomized at the median. This crude di- 
chotomy of criterion performance seems justified | 
in view of the grossness of the criterion measure 
and provides at least a rough estimate of rela- | 
tively good as opposed to relatively poorer 
performance in the field. 

Table 3 presents the outcomes if cutoff points | 
are used to delineate the poorest 25%, 33%, and 
50% of the score distribution on each self- 
report (that is, high scores on F and MA, low 
scores on Es). The table shows the number of 
PCVs who would have been “erroneously 
screened out” (that is, performance above the 
criterion median) and the number “accurately 


p 
TABLE 3 
N ERRONEOUSLY AND ACCURATELY SCREENED OUT IË | 
CUTOFF POINTS ARE USED TO SCREEN OUT 
PCVs with “Poor” Scores ON EACH ; 
SELF-REPORT MEASURE 


Self-reports 
Outcome eae |\$_$— a 
F | Es MA 
| 
N erroneously screened out | 25% | 1 1 1 
(above criterion median, | 33% | 1| 31 6 
N = 18) 50% | 2) 7 
N accurately screened out 25% | 8| 7 ( 
(below criterion median, 33% | 10 8 10 ! 
N=) 50% | 14 | | 
ach self-report | 


a "Poorest" 25%, 33%, and $0% of scores on © 
measure, 


a 
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TABLE 4 


CRITERION PERFORMANCE OF PCVs WITH “Poor” or “Goop” Scores on Two 
AND ON ALL THREE SELF-REPORT MEASURES 


Self-report scores 


ES F and Es F and MA Es and MA F, Es, and MA 

Poor Good Poor Good Poor Good Poor Good 
‘Above median (N = 18) 0 8 0 7 4 7 0 6 
Below median (N = 17) 9 1 7 1 8 4 6 0 


Note.—'‘Poor" scores are scores above the median on F or MA and below the median on Es; “good” scores are scores below the 


median on F or MA and above the median on Es. 


screened out” (performance below the criterion 
median) for the closest approximation of each 
of these cutoffs. 

Table 3 clarifies two points. First, note that 
for the 25% cutoff, the three self-report measures, 
and especially the F Scale, seem to make useful 
discriminations. With this cutoff the F Scale 
would have screened out accurately eight rela- 
tively poor PCVs. Only one PCV above the 
criterion median would have been screened out 
erroneously. With the Es and MA scale similar 
results would have been obtained, although 
slightly less clearly. 

Second, obviously as the cutoff point is changed 

to include an increasingly large percentage of 
the sample, the number of PCVs erroneously 
screened out increases greatly. The notable excep- 
tion is the F Scale, on which a cutoff of even 
50% would have resulted in only 2 PCVs being 
erroneously screened out, whereas 14 of the 17 
persons below the criterion median would have 
been identified accurately. 
_ All the other predictor variables were examined 
in a similar way. The number of PCVs who 
would have been accurately screened out never 
exteeded twice the number erroneously screened 
out and in most instances the number erroneously 
screened out equaled or approached the number 
Accurately screened out. 

Outcomes were also examined for PCVs who 
had poor scores on two or on all three self-report 
Measures. The distribution on each self-report 
Measure was dichotomized into scores above and 
below the median. “Poor” scores were defined as 
scores above the median on F or MA and below 
the median on Es. Table 4 presents outcomes 
(performance above or below the criterion 
median) for PCVs with poor scores on any two 
self-report measures and on all three self-reports. 
The main advantage of using such combinations 
of poor scores is that the number of PCVs er- 
toneously screened out is reduced to zero for 


all combinations in which the F Scale is included. 

Post hoc, it is of interest that delineating 
PCVs who attained “good” scores on two or on 
all three self-report measures would also have 
had some usefulness for selecting PCVs into the 
project. Good scores were defined as scores 
below the median on F or MA and above the 
median on Es. Table 4 includes the outcomes for 
PCVs with good scores on two or on all three 
self-reports. Self-report combinations which in- 
clude the F Scale would have been useful but 
the combination of Es and MA would not have 
been valuable. Further, it would not have been 
very fruitful to delineate PCVs with good scores 
on each self-report measure separately, For ex- 
ample, on the F Scale, the quartile of PCVs with 
the best (that is, lowest) scores contained eight 
volunteers and of these, six were above the 
ctiterion median but two were below the median 
and this was the best yield obtained from the 
use of’ extreme good scores on any single self- 
report. 


Discussion 


The overall results are consistent with the 
findings of other assessment projects in which 
both the assessees and the criterion were drasti- 
cally different (for example, posthospital ad- 
justment of schizophrenics). For example, many 
of the conclusions reached by Marks, Stauffacher, 
and Lyle (1963), in a recent effort to predict 
outcomes in schizophrenia, fit the present study. 
These authors conclude that “the best predictors 
of year-end adjustment are those which can be 
most expeditiously obtained [p. 126].” In their 
study, as in the present report, one of the best 
predictors was the “old ‘impure’ F” Scale. 

The failure of “global” techniques, specifically 
the interview and global ratings by staff judges 
and the assessment board, is noteworthy and 
stands in contrast to the relative utility of simple 
self-report techniques. Again, this is consistent 
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with past assessment experiences (Cronbach, 
1956). In general, the results support the useful- 
ness of simple self-report inventories and point, 
once more, to the dangers of relying ‘on global 
ratings (the unstructured interviews and staff 
judges) and on nonquantitative interpretations 
of multiple data (the assessment-board ratings). 
These latter techniques are usually the most 
costly, time-consuming efforts in the assessment 
program. Although they seem compelling and 
valid to those who use them, the evidence sup- 
porting their utility remains minimal. As Table 1 
showed, the assessment-board ratings, which de- 
termined the ‘decisions about volunteers at the 
end of training, were most strongly related to 
the faculty ratings and the interview—and the 
interview turned out to be the measure with the 
lowest predictive validity. Staff judges (faculty 
ratings, interview, assessment boards) showed con- 
siderable agreement with each other—but their 
judgments were not significantly related to 
criterion performance. 

The correlations between criterion ratings and 
academic grades, and ratings by peers, were also 
not significant, although the magnitude of cor- 
relations tended to be somewhat higher than 
those obtained from the interview and the assess- 
ment-board categories. It should be recognized 
that the peer-rating technique employed was 
extremely gross, and that at the time of the peer 
ratings the PCVs had very little knowledge about 
the field conditions which awaited them abroad. 
With respect to grades during training it is quite 
possible that the behaviors which determined 
these grades had little relevance for actual per- 
formance abroad. Indeed, informal comments 
from the field indicated that in many instances 
the content of the academic training program 
and the actual demands and conditions of the 
field experience were at best remotely related. 

Caution must also be applied when interpreting 
the results from the self-report measures. In 
spite of “significant correlations,” their utility 
for assessment decisions remains dubious. How- 
ever, these correlations are more impressive in 
view of the fact that the sample of PCVs was 
prescreened and consequently there was a rela- 
tively narrow range of scores on each self-re- 
port measure. For example, on the F Scale, on 
which the highest score could be 168, actual 
scores ranged from 25 to 82. 

“Screening-out” decisions, based on extremely 
poor self-report scores (for example, the poorest 
quartile), especially with the F Scale, would have 
reduced appreciably the number of poor candi- 

dates sent abroad. But obviously the total “costs” 
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of such decisions are complicated by many con- 
siderations. 

Replication of the present findings in other 
Peace Corps projects would have increased faith 
in the results of the present study. However, 
findings for another project in Ghana already 
show a failure to replicate (Smith & Roth®), 


Indeed, although many of the predictor measures _ 


used in Ghana were similar to those in the pres- | 


ent study, none were significantly related to crite- 
rion ratings. Although it is disappointing that the 
positive results obtained from self-reports were 
not replicated, this may have been due to diverse 
reasons, including differences between the present 
sample and the Ghana sample with respect to the 
PCVs selected, the job requirements and field 
conditions, the determinants of criterion ratings, 
and the PCVs’ test-taking “set.” 

A possible difference in the test-taking “set” of 
PCVs on the self-reports in the two samples may 
be of special importance. Recall that in the 
present project the self-reports were administered 
explicitly as part of a research project with the 
promise that results would not determine de- 
cisions about individual PCVs. The set may have 
minimized distortions in test responses and en- 
hanced the obtained relationships with criterion 
ratings. Current research by the writer on “self- 
prediction” under conditions in which accurate 
self-reports do not lead to negative consequences 
for the respondent suggest that self-predictions 
can provide remarkably accurate outcome 10- 
dices (Mischel, 1963). For example, when the 
data were collected under a “for research only” 
set, correlations between self-predicted course 
grades and obtained grades exceeded .60 for & 
sample of 90 undergraduates. Under appropriate 
test-taking sets self-report measures and selt- 
predictions may be valuable techniques for help: 
ing candidates in their own “self-selection 
process. Obviously this could only be effective 
if the relevant criteria are described to the, Te- 
spondent and if he is presented with sufficient 
alternatives so that an accurate response does 


not result in detrimental consequences for him. | 
For example, in connection with the Peace Corps, 1 


self-reports as a basis for self-selection might 
an effective technique if alternative assignments, 
requiring different attributes, are offered and 
described in terms which have equivalent social 
desirability. 

Although many psychologists are privately con- 
vinced that “if you want to know how a person 
is going to behave, ask him,” research into the 

© “Ghana Project,” progress report to 
of Human Development, University O 
Berkeley, April 1963. (Ditto) 


£ Californias 


the Institute 
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conditions under which accurate self-prediction is ` 


and is not possible remains minimal. Particularly 
in view of the persistent difficulties encountered 
when “expert judges” attempt to predict the 
behavior of others, a renewed exploration of 
the self-report seems worthwhile, not merely to 
obtain the subject’s self-descriptions but also to 

, obtain his explicit outcome predictions for 
specific contingencies. 
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SOME VERBAL ASPECTS OF PRIMARY-PROCESS THOUGHT: 


A PARTIAL REPLICATION * 


STANLEY BLUMBERG 
University of Texas Southwestern Medical School 


AND 


DONALD W. GILLER ? 
Terrell State Hospital, Texas 


This study compared the conceptual performance of chronic schizophrenics 


with acute and nonschizophrenic psychiatric 


quiring the selection of synonyms. All grotps sh 
king by choosing antonyms and homonyms on some 


or primary-process thin! 


of the items. It was concluded that such cognitive 
schizophrenia, but occur in all groups to an extent 


of illness. 


_ The present study is an approximate replica- 
tion of an experiment demonstrating some verbal 
aspects of primary-process thinking in schizo- 
phrenia (Burstein, 1961). Among the attributes 
of primary-process thought Freud (1956) -de- 
scribed the confusion of opposites with identities 
and similar word sounds with similar meanings. 
Based on these descriptions, Burstein constructed 
two multiple-choice word tasks to put Freud’s 
formulation to an empirical test. Using a group 
of long-term hospitalized schizophrenics and a 
group of normals, Burstein found that, as pre- 
dicted, the schizophrenics more often did confuse 
opposites with identities, selecting antonyms as 

1We are grateful to A. G. Burstein for his gener- 
psity in permitting us to use his antonym and 
synonym word lists. 
cy Now at the Dallas State Adult Mental Health 

c 


atients on a word task re- 
‘owed evidence of distractibility 


errors are not unique to 
dependent upon severity 


synonyms, and similar word sounds with similar 
meanings, choosing homonyms for synonyms. 

Burstein also found a significant difference in 
mean IQ scores between the two groups, in spite 
of their having been matched for age, education, 
and estimated “premorbid intelligence.” He in- 
terpreted these results in terms of the schizo- 
phrenic pathology, that is, with increasing 
pathology there is increased primary-process 
thinking and intellectual deficit. Evidence for 
this hypothesis was the significant correlation be- 
tween scores on the word tasks and the intelligence 
measures for the schizophrenic group, with no 
significant correlation for the normal group. 

One question unanswered in Burstein’s study 
is whether this specific error tendency of con- 
fusing antonyms and homonyms with synonyms 
is uniquely schizophrenic or is common to psy- 
chopathology in general. Chapman (1958) found 
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that normals make the same kind of associative 
errors on conceptual tasks that schizophrenics 
make, however, Burstein’s normals did not. He 
suggested that a possible reason for thé normal 
subjects’ avoidance of the antonym and homonym 
alternatives was because of their obvious char- 
acteristics, 

This study was undertaken to replicate Bur- 
stein’s results on a different sample of chronic 
schizophrenics, and to extend the generality of 
his findings to two other pathological groups, 
acute schizophrenics and a_nonschizophrenic 
group of hospitalized psychiatric patients. The 
hypothésis was that errors on this paper-and- 
pencil task, presumably indicative of primary- 
process thinking, would notsbe limited to chronic 
schizophrenics, but would also be found to a 
decreasing extent in acute schizophrenics and in 
nonschizophrenic psychiatric patients. 


METHOD 
Subjects 


Three groups of hospitalized psychiatric patients, 
each group consisting of 20 males, were tested. All 
subjects were Caucasians whose native tongue was 
English and who gave no evidence in the medical 
records of central nervous system pathology or 
mental retardation. A chronic schizophrenic group 
from the Terrell State Hospital was chosen on the 
basis of the same duration of hospitalization as 
Burstein’s chronic group. Each of these patients had 
been hospitalized continuously for at least 5 years 
with a diagnosis of schizophrenia or dementia 
praecox. None of these chronic schizophrenic pa- 
tients had received shock therapy within the 
previous 6 months, but all of them were currently 
on drugs, the majority on chlorpromazine, others 
on a variety of different tranquilizers. An attempt 
was made to select a group that would not differ 
substantially in age or education from Burstein’s 
chronic group. 

A group of acute schizophrenics was selected on 
the basis of having been hospitalized within the 
previous 6 months, the majority having been 
hospitalized within 1 month of testing. Although 
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this group could not be matched on age with the 
chronic group because of the impossibility of finding 
acute subjects of such advanced chronological age, 
their similarity in education and IQ was believed to 
be more relevant and sufficient control of possibly 
important variables. 

The nonschizophrenic psychiatric group was 
matched with the acute schizophrenic on age, educa- 
tion, and duration of hospitalization. The diagnoses 
of patients in this group were 11 depressive re- | 
actions, 1 acute situational reaction, 1 involutional 
psychotic reaction, 3 adolescent adjustment reactions, 
1 passive aggressive personality, 2 dyssocial reactions, 
and 1 anxiety reaction, 

The same two measures of intelligence used by 
Burstein were computed for the three groups of 
patients, Present functioning intelligence was esti- 
mated on the basis of the IQ obtained from the 
Comprehension-Vocabulary-Similarities (CVS) sub- 
tests of the Wechsler-Bellevue Intelligence Scale 
(Hunt, Klebanoff, Mensh, & Williams, 1948). The 
CVS was administered individually. Although the 
groups might have been assumed to be of equal pre- 
morbid intelligence on the basis of equivalent educa- 
tions, an additional estimate of premorbid intel- 
ligence was made on the basis of the number of the 
highest vocabulary item passed. 

Data with respect to age, education, IQ, and 
highest vocabulary for the three groups of this 
study and the two groups in Burstein’s study are 
presented in Table 1. By two-tailed £ tests none of 
the differences in education or highest vocabulary 
among our three groups or between our chronic and 
Burstein’s chronic group reach significance at the .05 
level. On this basis the groups are presumed equal 
in premorbid intelligence. Burstein’s chronic group 
may be more “deteriorated” than ours since the mean 
IQs are 90.7 and 98.9, respectively, but this is not 
testable. The difference in age between the two 
chronic groups just reaches significance at the .05 
level. The age difference between our acute schizo- 
Phrenics and nonschizophrenic patients is not signifi- 
cant (t=.78). The chronic group is significantly 
older (< .01) than both the acute and nonschizo- 
phrenic psychiatric groups. On IQ scores, the chronic F 
and acute groups do not differ, nor do the nonschizo- $ 
phrenic patients and the acutes; however, the dif- 
ferehce in IQ means between the nonschizophrenic 


TABLE 1 
PATIENT SAMPLE CHARACTERISTICS 


Age Education 1Q Highest vocabulary 
Diagnosis 

M SD M sD M SD M 52 
Chronic schizophrenics | 47.7 6.1 99 | 25 9 77 
Acute schizophrenics 353 76 | 116 | 29 | ores | i21 | 28 | 42 
Nonschizophrenics $37 |. 163 | 113 | 25 | 1001 | 135 | 279 | 6.7 
Burstein schizophrenics | 44 4 10.5 19 90:7 == | 261 TO 
Burstein normals 44 4 106 | 47 | 1167 — | 233 | 46 


* Data not reported, 


— 
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TABLE 2 
Error Scores 
Antonyms Homonyms 

Group D I D-1 D I D=-I 

> u |s| m | sp | m | sp | m | sv | x | s sD” 
Chronic schizophrenics | 7.55 | 7.19} 4.20 5.68 | 3.35 | 4.82 | 6.90 | 10.75| 1.70 | 3.02 | 5.20 | 10.22 
Acute schizophrenics 485 | 8.21] 2.45 | 3.79 | 2.40 | 6.38 | 2.90 | 6.43] | .65 | 1.31 | 2.25 | 5.85 
Nonschizophrenics 2.30 | 3.21] .90 | 1.09 | 1.40 | 2.82 | .4 58| .8 | 108 |—.4 1.17 
Burstein schizophrenics | 9.2 | 11.0 3.2 3.8 6.0 | 9.1 3.4 49 | 11 12 2.2 4.6 
Burstein normals 1.3 1.7 | 11 1.3 23 if 21 2 6 a} 8 |—.3 9 

ae 
RESULTS 


patients and the chronics does reach significance 
at the 05 level (t= 2.37). This disparity in func- 
tioning intelligence, with the chronic group lower, 
parellels Burstein’s finding, which he attributed to 
the schizophrenic pathology. 


Tasks 


Burstein’s 97-item antonym list and 48-item homo- 
nym list were administered to each subject, one-half 
of the subjects in each group doing the antonym 
task first. On each item the subject is instructed to 
choose the synonym for a given word from among 
three alternatives consisting of a synonym; a dis- 
tracter, which on one list is an antonym, and on 
the other list a homonym; and an irrelevant alterna- 
tive, An example from each list follows: 


Circle the word that means almost the same a$ BIG 
LARGE SMALL INDIVIDUAL 


Circle the word that means almost the same as SEA 
OCEAN SEE IDOL 


For each subject three error scores on each list were 
computed: the number of distracters chosen (D), the 
number of irrelevants (I), and a difference, or 
distracter-minus-irrelevant, score (D—I). As Bur- 
4 slein (1961), reported, the purpose of the irrelevant 
alternative, or more properly, unidentified error 
alternative, is to measure the number of distracter 
markings made upon bases other than those antici- 
"pated by the investigator, eg, obscure communali- 
ties, idiosyncratic associations, random markings, etc. 
Since choices made upon such unanticipated bases 
would by chance result equally often in D and I 
markings, an error score of antonym or homonym 
markings minus I markings would represent a rela- 
tively pure measure of the distracting effect of the 
antonyms or homonyms, that is, a measure of the 
difficulty experienced by the subjects in distinguish- 
au synonyms from antonyms or homonyms. The 
and D words had been roughly equated for fre- 
Muency of occurrence in the language and each type 
a alternative appeared equally often in each of 
e three possible positions, with the sequence of 
alternatives appearing randomly. 


The means and standard deviations of the 
error scores are presented in Table 2. Although 
age was thought to be an irrelevant variable, as 
a check rank-order correlation coefficients were 
computed between age and each of the error 
scores for the nonschizophrenic psychiatric group, 
because the direction of error scores is consistent 
with an explanation in terms of age as an arti- 
fact. None of these correlations, which ranged 
from .02 to .13 were significant in spite of the 
wide range of ages within the group. On this 
basis, the differences between groups are thought 
not to be an artifact of age. Because the distribu- 
tions of error scores were very skewed, the same 
nonparametric statistics used by Burstein were 
employed in this study. First, differences in 
error scores among the three groups considered 
simultaneously were tested by Kruskal-Wallis H 
tests. Comparisons of each type of error between 
groups of subjects taken two at a time were then 
made using two-tailed Mann-Whitney U tests. 

As expected, the chronic schizophrenics had 
higher D —I error scores on both tasks than 
did the acute group, which in turn was higher 
than the nonschizophrenic psychiatric group. 
None of the differences in D, I, or D — I scores 
between our chronic and acute schizophrenic 
groups was significant, nor did the acute differ 
from the nonschizophrenic patients. Comparisons 
between the chronic and nonschizophrenic psy- 
chiatric groups reveal significant differences on 
the antonym D scores (p<.02) and I scores 
(p <.05), and on the homonym D scores 
(p < .02). and D — I scores (p < .002). 

Comparisons of D versus I errors within each 
group of subjects were made using two-tailed 
Wilcoxon tests. The chronic schizophrenics 
showed a preference for the D error over the 
I choice on both the antonym and homonym 
task (p < .01). The preference of D over I did 
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not reach significance for the acute schizo- 
phrenics on either task, nor for the nonschizo- 
phrenic patients on the homonyms. On the 
antonym task, however, the nonschizophrenic 
psychiatric group showed a preference for the 
D error (p < .05), although 8 of the 20 subjects 
made no errors. 

Rank-order correlation coefficients, corrected 
for ties, between both intelligence measures and 
error scores were computed. Whereas Burstein 
found significant negative correlations between 
D-—I scores and intelligence for the chronic 
schizophrenics, but not for his normals, the cor- 
relations for our chronic group between function- 
ing IQ and D — I (—.24 and —.38) do not reach 
significance. On the other hand, the correlations 
between IQ and D errors and between IQ and 
I errors, for the chronic group are significant 
(r= —.68 for both error scores on antonyms, 
and —.53 for D scores and —.44 for I scores on 
homonyms). The correlation between IQ and 
D—I (—.51) on the antonym task for our 
nonschizophrenic psychiatric group is significant. 
In general, there is a greater relationship between 
all types of errors and both indices of intelli- 
gence on the antonym task than on the homonym 
task. On antonyms, 9 of the 18 coefficients are 
above .45, one of them reaching .79. There 
appears to be a low negative correlation between 
intelligence, both functioning and “potential,” 
and both D and I errors, with the correlation 
highest in the chronic schizophrenics. 


Discussion 


Further evidence was obtained confirming Bur- 
stein’s finding that chronic schizophrenics are dis- 
tractible in the predictable ways of confusing 
antonyms and homonyms with synonyms. The 
fact that our chronic group had a more nearly 
average IQ of 98.9, and were on various drugs, 
while Burstein’s patients were not, yet made 
significantly more D than I errors, adds weight 
to the result. The findings with respect to 
whether this type of distractibility or primary 
process, is a more general phenomenon of psy- 
chiatric patients, or is limited to long-term, hos- 
pitalized schizophrenics are less clear, but tend 
to favor the generality hypothesis. The fact that 
none of the differences between the chronic and 
acute groups, nor those between the acute 
nonschizophrenic patients, reached significance 
suggests that this kind of cognitive dysfunction 
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exists on a continuum. Further, the fact that 
the nonschizophrenic psychiatric group made 
nificantly more D than I errors on the anton 
task is further evidence that this type of 
tractibility, or primary process, is not limite 
schizophrenic pathology. On the other hand, 
nonsignificant difference between D and I s 
for the acute schizophrenic group clouds 
issue. Burstein’s finding no preference for the | 
over the I error in his group of normals is n 
likely due to the obviousness of the co 
choices, rather than an equal potential for b 
types of errors. Indirect evidence for thi 
the very small number of either D or I erro 
made by his normal group. 

As for the relationship between intelli 
primary process, and schizophrenia, the“ 
likely conclusion that can be drawn from 
results of this study is that functioning in 
ligence, as measured by a verbal test sucha 
the CVS, is related to performance on a 
test of primary-process thinking in both sch 
phrenic and nonschizophrenic patients. The 
that the IQ scores of the chronic schizophi 
are negatively correlated with both I and 
errors on both tasks suggests that schizophr 
conceptual deficit, as measured by tasks 
as those used in this study, results from bol 
general loss of functioning intelligence and 
increased susceptibility to distracting stimuli. 
sibly resulting from primary-process thi 
However, there may have been a drug efi 
operating which, at least on the antonym 
could have resulted in fewer D errors and 
I errors in the chronic group as compared ¥ 
Burstein’s chronic schizophrenics, none of Wh 


was on drugs. 
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SEX DIFFERENCES AND EXPERIMENTAL CONDITIONS IN 
SERIAL LEARNING * 


IRWIN G. SARASON And MORTON G. HARMATZ 
University of Washington 


This experiment dealt with the relationship of 3 independent variables to serial 
learning. Male and female Ss differing in test-anxiety scores performed under 
4 conditions. 3 of these conditions involved evaluative reports to Ss, by means 
of light signals, of their performance at several points in learning. There were 


no reports of level of performance 


under a control condition, The major 


finding was a significant difference between the sexes under these conditions, 
Under the control condition, females performed at a higher level than did males. 


Just the opposite was the case for the 3 light conditions. 


Research on motivational factors in verbal 
learning has largely been directed to the study 
of two classes of variables: individual differences 
in subjects’ personalities and experimentally 
created motivational conditions. Scores on per- 
sonality tests have typically been employed to 
assess individual differences (Sarason, 1960). 
Orienting instructions (such as ego-involving 
communications and failure reports) have been 
the most commonly used means of experimentally 
manipulating motivation (Lazarus, Deese, & 
Osler, 1952; Sarason, 1960). Available evidence 
Suggests that the simultaneous study of both 
individual difference variables and experimental 
manipulation of subjects’ set is superior to the 
study of either of them alone in accounting for 
the performance of subjects (Sarason, 1960). 
Thus, for example, it has been found that lightly 
motivating or achievement-orienting preliminary 
instructions often have different effects on the 
Verbal learning of subjects differing in test 
anxiety (Mandler & Sarason, 1952; Nicholson, 
1958; Sarason, 1957). The learning of low- 
anxious subjects appears to be facilitated by 
these instructions, while the speed of learning 
of high test-anxious subjects is slowed down 
by them. 

While instructions and failure reports have 
‘been the most widely used methods of experi- 
mentally approaching motivation in relation to 
learning, they do not exhaust the methods avail- 
able. Both preperformance instructions and 
failure reports have in common the character- 
istic that they serve to orient the subject to 
Perform at as high a level as possible. ‘These 
cues to the subject have typically been given 
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either at the outset of the experimental session 
(as in the case of preperformance instructions) 
or at some intermediate point in the course of 
learning (as in the case of failure reports ad- 
ministered between lists of verbal stimuli). 
Another approach, largely unexplored in the 
verbal learning literature, involves events occur- 
ring while the subject is performing the task 
assigned to him. However, reinforcement pro- 
cedures have been widely used in the study of 
operant conditioning and verbal behavior. A 
review of the verbal conditioning literature indi- 
cates that events such as reinforcements are 
effective in influencing the subjects’ emission of 
a variety of operant responses (Krasner, 1962). 
Stmilar effects have been obtained in studies of 
the effects of the experimenter’s emission of 
“sight” and “wrong” on response repetition ir 
Thorndikian associative learning situations (Post- 
man, 4962). 

Two studies bear quite directly on the rela- 
tionship of events occurring during performance 
to subjects’ attained level of performance. In 
experiments conducted by Hetherington and Ross 
(1963) and by Sarason and Harmata (1965) 
subjects performed on verbal learning tasks. 
Subjects were assigned to “good” and “try 
harder” conditions. The former condition in- 
volved encouraging, supportive comments to the 
subject by the experimenter during learning. The 
latter involved comments by the experimenter 
designed to increase the subjects’ effortfulness 
in performance. The evidence provided by these 
studies indicates that multiple performance 
events such as reinforcements can significantly 
affect the subjects’ attained performance levels 
and suggests that future research might well 
benefit from attempts to manipulate the subjects’ 
motivation during their performance. 

The present experiment was designed to con- 
tribute to knowledge of the effects of experi- 
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mentally controlled events during performance 
on learning. In an effort to reduce the con- 
founding of characteristics associated with the 
experimenter and the administration of a success- 
failure condition, light signals, rather than verbal 
reports by the experimenter, were used to inform 
the subject about his performance. The experi- 
mental design encompassed three variables. One 
of these was that of reports of performance. 
Following one-third of the trials, through light 
signals, the subject was given a report concerning 
his level of performance. One-fourth of the sub- 
jects were informed that their performance level 
was above average; to one-fourth of the subjects 
it was indicated that their performance was aver- 
age; and one-fourth was informed that they were 
performing below average. The remaining sub- 
jects constituted a control group in that they 
were given no reports. 

‘Two organismic variables were incorporated into 
this research. One of these was questionnaire- 
inferred test anxiety. The measure employed has 
been shown to relate to subjects’ performance 
on verbal learning and other tasks (Sarason, 
1960; Sarason & Ganzer, 1962). The second 
organismic variable was sex. This variable has, 
also, been found to relate to verbal learning 
under motivational conditions. The experimental 
design called for three levels of test anxiety 
(high, middle, and low scores), four experimental 
conditions (three light conditions and a control 
one), and sex of subject. 


METHOD 
Subjects 


> 

The subjects were 60 male and 60 female under- 
graduates enrolled in introductory psychology courses 
at the University of Washington. Prior to and inde- 
pendent of the experiment the students had been 
administered the Test Anxiety Scale (TAS; Sarason 
& Ganzer, 1962). High TAS scores ranged between 
8 and 15. Middle TAS scores ranged between 3 and 
7, while the low TAS group consisted of subjects 
with scores between O and 2. The high, middle, and 


TABLE 1 


MEAN NUMBER oF Correct RESPONSES ror GROUPS OF 
SUBJECTS CLASSIFIED IN TERMS OF 
SEX AND CONDITIONS 


Light conditions 
feshe Average Edy, Control 
Male 264.60 | 240.27 | 254.00 | 167.73 
Female 248.87 | 226.07 | 228.80 | 278.93 


Note.—N = 15 per group. 
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low TAS groups corresponded to the upper 22%, 
middle 54%, and lower 24% of the TAS score 
distribution, 


Procedure 


The task employed was a serial learning one 
requiring the anticipation of dissyllable words, The 
material to be learned consisted of 17 items of low 
m value drawn from Noble’s list (Noble, 1952), 
The mean m value of the list was 1.22 and the 
range was .99-1.50. The dissyllables were presented 
by means of a memory drum with an exposure time 
of 2 seconds. Following Item 17 there were three 
blank spaces. The list was presented for 35 trials 
without interruption, 

The subjects were met by the experimenter, a male 
graduate student in psychology, and were seated 
at a table facing a black wooden screen, The 
memory drum was located in a space at the bottom 
of the screen, Above the drum were located in clear 
view of the subjects the three lights. At the left 
was a green light with a template underneath which 
said ABOVE AVERAGE, In the middle was a blue light 
with a template reading averace. On the right, there 
was a red light for the seLow AVERAGE condition. 
The subjects in the control group had the same 
stimulus situation as the subjects in the three light- 
condition groups, except that the lights were omitted, 

All subjects were given necessary introductory 
instructions. In addition, the subjects in the three 
light-condition groups were told: 


Since everyone wants to know how they ate 
doing, I will indicate this to you. 

As you can see on the panel in front of you, 
there are three lights. A green one marked above 
average, a blue one marked average, and a red 
one marked below average. One of these three 
lights will be turned on during the rest space 
at the end of every third trial informing you of 
how you are doing. 

For the subjects in the three light-condition 
groups, a light went on at the end of every third 
trial. Which of the three lights that was tur 
on was determined by the condition to which the 
subject had been assigned. The light went on at the 
beginning of the first of the three blank spaces which 
constituted the intertrial interval, and went off at the 
end of the second blank space. 


RESULTS 


The results were analyzed in terms of a 3 X 
X 4X 35 analysis of variance design (TAS X Sex } 
X Conditions X Trials) with five subjects ™ 
each of the experimental groups. The 35 trials | 
constituted a repeated measurements variable. 

The results of the analysis performed on the | 
number of correct responses showed a si dt 
cant Sex X Conditions interaction (F = 4.7% @ 
= 3/96, p < .005). Table 1 presents means 17” 
vant to this interaction. Under the control cof 
dition in which no lights were employed, fi 


: 
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performed at a higher level than did males. Just 
the opposite was the case for the three light 
conditions. In each instance, males performed 
at a higher level than did females. A breakdown 
of the Sex X Conditions interaction in terms of 
anxiety levels indicated that the nature of the 
interaction was essentially the same for high, 
middle, and low TAS groups. The trials effect 
was, of course, highly significant (F = 390.91, 
df = 34/3264, p < .005). There was a tendency 
towards significance for the Lights X Trials inter- 
action (p < .10), with a more steeply acceler- 
ated learning curve for the above average light 
condition than the other three conditions. The 
Sex X Conditions X Lights interaction was found 
to be highly significant (F = 1.77, df = 102/3264, 
p< 005). The learning curve for males showed 
better performance over trials under the three 
light conditions than under the control condition. 
Just the opposite result obtained for females. 
This finding, together with the significant Sex 
X Conditions effect shows that the condition 
(absence of feedback) which led to the best per- 
formance for females resulted in the poorest 
performance for males. Although it was not sta- 
tistically significant (p < .25), there was a tend- 
ency for the middle and low TAS groups to 
perform at a higher level (Ms = 241.78 and 
256.38, respectively) than the high TAS group 
(M = 219.77). A Duncan's test indicated that 
the low TAS group performed at a higher level 
than did the high TAS group (p < .05). 

An analysis of variance was, also, performed 
on the number of anticipatory errors, i.e., errors 
in which subjects emitted terms on the list but 
in the wrong order. The difference between males 
and females was found to be significant (Ms 
=27.69 and 20.79, respectively, with $ < 025). 
The rise and, then, decline in anticipatory errors 
led to a significant Trials effect (p < .005). An 
aħalysis of variance performed on intrusive errors 
(subject’s responding with words which were not 
on the list used) yielded only one significant F, 
„that for Trials (p < .005). The trial means on 
which this F was based followed a course similar 
to that mentioned for anticipatory errors. 

The final analysis of variance was done using 
as the dependent variable the number of oc- 
casions on each trial on. which the subject made 
No response, correct or incorrect. The Sex X Con- 
ditions interaction was found to be significant 
(p < .01). This result was due to the perform- 
ance of male subjects under the neutral condi- 
dition, This group made more nonresponses (M 
= 386.33) than did all other groups (overall 
M=316.54). The Trials effect was significant 
(®<.005) and was due, as one would expect, 
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to the decline in nonresponses over trials. The 
Sex X Conditions X Trials effect was found to be 
statistically significant (p<.005). This result 
was attyjbutable to the slower decline in non- 
responses for male subjects under the control 
condition than all other groups. 


Discussion 


The striking differences between males and 
females under the experimental conditions would 
appear to be the most provocative findings of 
this research. Two worthwhile problems for 
future investigation suggested by them relate to 
differences among the conditions and between 
the sexes. 

In general, it did not appear to matter very 
much under which report condition subjects per- 
formed. The decisive factor seemed to be, 
simply, whether or not any reports at all were 
given. On a priori grounds one might well have 
expected that the three meanings which were 
attached to the light conditions would differen- 
tially influence subjects’ performance. An impor- 
tant question would appear to be the nature 
of the subjects’ interpretations of the reports 
of performance which were given to them. 

Associated with this problem is the need to 
shed light on the basis for males’ and females’ 
different reactions to the light conditions. It 
would appear that, as it were, leaving female 
sybjects alone during learning is relatively facili- 
tative, but that absence of feedback does not 
lead to a high level of performance for males. 
One possibility that comes to mind is that 
females, at least, in certain situations, may have 
a grefter capacity for self-motivation than do 
males. 

Previous research on anxiety in relation to 
verbally administered performance reports ap- 
peared to provide a basis for anticipating an 
interaction between the conditions of this experi- 
ment and TAS scores (Sarason, 1960). That 
there was no evidence in support of this expecta- 
tion seems somewhat surprising. It may be that 
the interpersonal aspects of the experimenter- 
subject interaction (which were minimized in 
this study) have contributed to the previously 
reported significant results using experimenter- 
administered motivational instructions and failure 
reports. 
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A STUDY OF PHONETIC SYMBOLISM AMONG NATIVE 
NAVAJO SPEAKERS * 


JON ATZET 
University of California, Riverside 
and HAROLD B. GERARD 
Center for Advanced Study in the Behavioral Sciences, Stanford, California 


The phonetic symbolism hypothesis has previously received some support 
using English-speaking Ss. If the phenomenon exists it should occur using 
speakers of any natural language. An experiment was run using 60 native 
speakers of the Navajo language who were asked to match Navajo antonyms 
with their Chinese and Hindi equivalents, The results were negative. 


The various experiments concerned with pho- 
netic symbolism in natural languages have not 
consistently produced results which substantiate 
the existence of the phenomenon. Results sup- 
porting the phenomenon have been interpreted 
in terms of a universal phonetic symbolism 
common to natural languages which is based upon 
the assumption that there are unlearned inter- 
connected sensory systems underlying sounds and 
meanings which operate according to nativistic, 
gestalt-like principles, Foreign words and their 
English equivalents can be matched because they 


1 This research was supported in part by Grant 
GS-15 from the National Science Foundation, We 
are grateful to the administrative staff of the Sher- 
man Indian Institute and especially to Gabe Paxton 
for help in making the necessary arrangements to run 
the experiment. Special thanks are due to Benjamin 
Begay who ran the experiment and was our Navajo 
language informant. The senior author conducted 
this research as part of the course requirement in 
the junior author's undergraduate social psychology 
course, 


are both related to similarly organized trace 
systems. i 
Brown, Black, and Horowitz (1955), using @ 
technique developed earlier by Tsuru and Fries 
(1933), report that English-speaking subjects 
were able to match 21 pairs of antonyms prt 
sented in Chinese, Czech, and Hindi to their 
English equivalents with accuracy significantly 
greater than chance expectancy. However, results 
from two more recent studies (Brackbill & Little, 
1957; Maltzman, Morrisett, & Brooks, 1956) 
Support the earlier findings only in part (see 
Table 1). The latter authors postulate that for 
matching to be successful by English-speaking 
Subjects, one member of the match must be 4 
English word. Brown and Nuttall (1959) Sue 
gest that lack of success in attempting to repli 
cate the earlier results might have been due ay) 
changes in the experimental procedure, kar i 
cally in the method of presentation. In order 
determine whether or not these changes in PI” 
cedure affect the ability of subjects to guess 
meanings of foreign words, Brown and Nut 
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used the original list of antonym pairs (Brown 
et al, 1955) in order to insure the use of ma- 
terial that had been shown to be sensitive to 
the phonetic symbolism effect. The proven sensi- 
tivity of this list, when presented in the original 
manner, would enable the investigators to deter- 
mine whether or not the changes in procedure 
were responsible for reducing or eliminating the 
evidence for phonetic symbolism. The methods 
of presentation investigated were the English- 
foreign pairs method, the foreign-foreign method, 
and the same-different method. In the English- 
foreign pairs method (Brown et al., 1955), pairs 
of English antonyms (stimulus words) are listed 
on one side of a page, and opposite them appear 
pairs of equivalent antonyms in a foreign lan- 
guage (response words). Words within a pair are 
randomized so that while the subjects know that 
a given English word matches one of the words 
in the foreign pair, they are unable to tell which 
it is. The words are also pronounced by native 
speakers during the test session. The foreign- 
foreign method is similar in construction except 
that the stimulus words are also foreign, Thus, 
two foreign languages are involved. In the same- 
different method, devised by Brackbill and Little, 
subjects are presented with two words at a time 
(each from a different language) and are asked 
to judge whether they are the same or different 
in meaning. This last method also incorporates 
the first and second methods in its design. The 
Brown and Nuttall results show that the ability 
of subjects to match words correctly is highly 
sensitive to the method of presentation. The 
English-foreign method enabled English-speaking 
subjects to guess the meaning of foreign words 
with accuracy significantly greater than chance 
expectancy, whereas the foreign-foreign and 
same-different methods resulted in only slightly 
above chance level performance. On the basis 
eof these results they propose a new and more 
restrictive formulation of the phonetic symbol- 
ism hypothesis, “Antonyms naming opposite ends 
.of sensible referent continua contrast with one 
another on appropriate phonetic dimensions often 
enough to make possible better than chance 
matches with equivalent terms in the subject’s 
native language.” 

They suggest the possibility that in the history 
of languages, antonyms evolved toward phonetic 
Contrasts appropriate to their semantic contrast 
and that pairs so contrasting sound “right” to 
native speakers and thus have a greater potential 
for survival. This interpretation is different from 
an earlier formulation (Brown et al., 1955) 
which placed primary emphasis on the origin 
rather than on the survival of speech sounds. 
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They also suggest, in light of these and the earlier 
results, that the hypothesis may have to be re- 
stricted to continua concerned only with magni- 
tude, that is, heavy-light, large-small, etc. Such 
restricted use of concepts cannot permit hy- 
potheses or conclusions to be formed concerning 
on word populations of any of the languages 
used. 

The previous studies were designed using 
subjects who spoke English as their native lan- 
guage. The present study attempts to examine 
the universality of the phonetic symbolism phe- 
nomenon by using subjects who speak a lan- 
guage which is not a cognate to English. If the 
phenomenon does exist, we should be able to 
replicate the results of Brown and Nuttall 
(1959) under similar experimental conditions 
using subjects speaking any natural language. 


METHOD 
Subjects 


The subjects were 60 Navajo students from the 
Sherman Indian Institute in Riverside, California. 
There were 35 male and 25 female subjects ranging 
in age from 15 to 20 years. Subjects spoke Navajo 
as their native language, and none had any knowl- 
edge of either Chinese or Hindi. The range of their 
ability to read English was rated from below 
Grade 1 through Grade 7. At the beginning of the 
test period subjects were informed as to the gen- 
eral purpose of the experiment. The session was 
conducted entirely in the Navajo language. 


: Matgrials 
Three noncognate languages were used: Chinese, 
Hindi, and Navajo. The Navajo words used in the 
experiment consisted of antonym pairs with a high 
frequency of occurrence which named sense experi- 
ences (e.g, warm-cool) and which had an equiv- 
alent concept in the foreign languages. The language 
informants for Chinese and Hindi were native 
speakers of these languages. The Navajo informant 
was asked to select the most common forms of the 
antonym pairs used by Brown et al. (1955). No 
pairs were used in which it was only possible to 
form an opposite by prefixing “not” to the word, 
Two pairs from Brown et al, gold-iron and thunder- 
lightning, were not used since they cannot be con- 
sidered antonyms nor do they refer, in any obvious 
way, to a sensory continuum, The Chinese and 
Hindi informants were asked to match the list 
compiled by the Navajo informant. Pairs for which 
distinct antonyms did not exist in all three lan- 
guages or which were uncommon in any of the 
languages were eliminated. We added other pairs 
to replace those which had to be dropped. Because 
both Chinese and Hindi are commonly spoken in 
several dialects, the national languages from both 
countries were used. The final list, shown in 
Table 2, consisted of 20 antonym pairs. 
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Procedure 


Oral presentation was used exclusively since there 
is no written form of the Navajo language. Also, 
since the phonetic symbolism phenomenon ,concerns 
sound similarities, visual cues should not only be 
irrelevant but might confound a test of the hypothe- 
sis. Four different orders were used for presenting 
the Navajo pair with either the Chinese or Hindi 
pair. The final presentation order was randomized 
in order to control for series errors and for the 
possibility of a subject devising a guessing pro- 
cedure on the basis of a uniform presentation 
sequence. The stimulus words were given in only 
one order; the response words (the foreign 
pair) were given in either the same order as the 
stimulus words or in the reverse order. The question, 
“Which of the two foreign words mean ne 
was followed by either the first or the second word 
of the stimulus pair. Therefore, there was a stimulus 
pair followed by a forward order response pair 
followed by a first stimulus word question; a 
stimulus pair followed by a forward order response 
pair, followed by a second stimulus word question ; 
a stimulus pair followed by a reverse order response 
pair followed by a first stimulus word question; and 
a stimulus pair followed by a reverse order response 
pair followed by a second stimulus word question. 
Each subject therefore made eight guesses for each 
antonym pair; four Chinese and four Hindi. The 
eight possible presentations were randomized for each 
antonym pair. Overall, this procedure is similar to 
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the English-foreign pairs procedure described by 
Brown and Nuttall (1959). 

Auditory presentation was by tape recording in 
which the stimulus pair was given in Navajo with 
an elapsed time of approximately 3 seconds between 
the presentation of the first and second word, The 
Navajo translator then said, 


The two corresponding foreign words are: [filled 
in by either Chinese or Hindi equivalents in one 
of the four presentation varieties described above]. 
Which of the two foreign words mean [either the 
first or second word of the stimulus pair], Is it 
the first or the second? 


Subjects were allowed an interval of approximately 
20 seconds in which to circle a “1” or a “2” in the 
appropriate column on their answer sheet. In order 
to obviate the effects of fatigue half of the presenta- 
tions were made in each of two 1-hour sessions given 
3 days apart. 


RESULTS 


Scores were the number of correct guesses out 
of the 160 stimulus presentations. Using the table 
of normal deviates, a critical ratio (CR) was 
established at the .01 level. Scores were matched 
against the CR in all cases. As indicated at the 
bottom of Table 1, the combined data for all 
subjects and both languages showed chance accu- 
racy in matching the foreign words with their 


TABLE 1 
PERCENTAGE OF CORRECT MATCHINGS IN STUDIES USING FOREIGN LANGUAGES 


Languages used % correct 
Author Subjects 
Stimulus | “Response | N | Auditory | Audio and] visual 
visual 
Tsuru English | Japanese | 51 69,0 English | 
Allport® English Manga 68 56.6> English 
Rich* English | Japanese | 44 57.7* English 
z English | Polish 44 64.8* English, 

Brown, Black, and Horowitz English | Chinese 86 58.9* English 
English | Czech 86 53.7* Bat 

3 i Hindi 85 .6* Englis 
Maltzman, Morrisett, and Brooks English Japanese 65 Bue 55.5 English 
PR Croation 48 54.7 Engak 

i f japanese | Croation 48 ngus 
Brackbill and Little English | Japanese 40 47.2* 52.2 51.4 English 
English | Hebrew | 40 | 52.6 53.3 | 53.0" | English 
English | Chinese | 49 | 49.0 49.9 50.8 | English 

Chinese | Japanese | 40 | 57.0* | 53.9" | 53.7* | Englis 
i Chinese Hebrew | 40 | 46.8* | 47.7 49.9 Engiis 

el S] 
Brownand Nuttall E E E r RES E. 
English | Hindi 84 60.5* English 

Chinese | Hindi 71 52.2 Englis 
Atzet and Gerard Navajo | Chinese 60 50.4 Navajo 
Navajo | Hindi 60 50.1 Navajo 

a B; » Bi 
k Re UE des (plack i Horowitz (1955). 


* Different from chance with p<.01, — 


ia 
Navajo equivalents. The results of the Navajo- 
inese and Navajo-Hindi presentations taken 
“atl were also at chance expectancy. 
As shown in Table 2, an analysis of the data 
_ by individual antonym pairs yields results above 
‘chance (p < .01) for 2 of the 20 pairs with their 
" Chinese equivalents and 1 with its Hindi equiv- 
oa nt, One Chinese pair and 1 Hindi pair showed 
below chance accuracy. 


DISCUSSION 


“The present results provide no evidence for 
Phonetic symbolism for the languages and word 
_ pairs used. Brown et al, (1955) report that 
‘their subjects were able to match Chinese, Hindi, 
and, Czech antonym pairs with their English 
wivalents significantly above chance expect- 
" ancy. Maltzman et al. (1956) report similar 
= results for Japanese and Croation. Brackbill and 
"Tittle (1957) did not confirm these findings and 
‘offer four possible explanations for the discrep- 
_ ancy: the method of presentation (audio, audio- 
‘Visual, or visual); the design of presentation 
" (English-foreign, foreign-foreign, and same-differ- 
ent methods); the selection of the word lists 
Used; and the language native to the subject. The 
first source can be ruled out since Brackbill and 
Tittle used these methods of presentation and yet 
obtained results at chance expectancy. Further, it 
should be noted that the hypothesis in question 
Concerns a correspondence between sound and 
"Meaning. Brown and Nuttall (1959) report re- 
sults better than chance using both oral and 
visual methods of presentation. The visual 
stimulus could easily have provided the neces- 
‘sary additional cues (which are irrelevant to the 
hypothesis) which aided correct matching. Had 
it been possible to provide our subjects with 
Visual as well as auditory cues, our results might 
{lso have been above chance. Since the stimulus 
€ words were Navajo and the subject was a native 
speaker, the second source likewise can be dis- 
7 counted. Brown and Nuttall, and Maltzman et al. 
_* found evidence for phonetic symbolism only with 
_ the English-foreign method. We found no compa- 
i table evidence with Navajo speakers. The third 
‘explanation can also not account for our chance 
level results, The antonym list used in the 
Present study was based upon the original list 
“used by Brown et al. with the revisions discussed 
above. The fourth source is also doubtful 
Since Navajo-speaking subjects are equivalent to 
English-speaking subjects in that each was tested 
» in his native language and with foreign, non- 
‘Cognate languages. 
Our failure to produce results supporting the 
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TABLE 2 


PERCENTAGE OF Correct TRANSLATIONS FOR EACH 
Worp Parr IN Two LANGUAGES Usinc AUDIAL 
PRESENTATION WITH NAVAJO SUBJECTS 


% correct 
Word pairs 

Chinese Hindi 

1. BEAUTIFUL-UGLY 48.0 50.5 
2. BRIGHT-DARK 50.5 50.5 
3. UP-DOWN 46.0 55.5 
4. WET-DRY 53.0 52.0 
5. FAST-SLOW 53.0 51.0 
6. HEAVY-LIGHT 44.0 45.5 
7. HARD-SOFT 45.0 43.0* 
8. SHORT-LONG r KTE 49.5 
9, MANY-ONE 61,0* 58.5* 
10. sSWEET-SOUR 53.0 46.0 
11. cooL-waRM ] 53.5 49.0 
12. WIDE-NARROW 45.5 52.5 
13. BIG-LITTLE 53.5 44.0 
14. SMOOTH-ROUGH 32.0* 51.0 
15, PUSH-PULL 45.0 45.5 
16. EMPTY-FULL 56.0 47.5 
17. THIN-FAT 61.0* 45.5 
18, HIGH-LOW 52.5 53.0 
19. SQUARE-ROUND 48.5 52.0 
20. DEEP-SHALLOW 50.0 54.5 


Note.—Word pairs numbered 1 through 12 are those used by 
Brown, Black, and Horowitz (1955). 
* Different from chance with p <.01, 


phenomenon does at least cast doubt upon the 
generality of phonetic symbolism in natural lan- 
guages. At best, we can say that if the phe- 
nomenon does exist it has thus far been shown 
to ogcur only with English-speaking subjects and 
"then only to a questionable and limited extent. 
It is also possible that the limited positive 
results may be due to the more cosmopolitan 
language experiences of American, English- 
speaking, college undergraduates (mostly from 
Harvard College). Some learning of the foreign 
languages used, however minimal, may have 
occurred, 

It is probable that there are, within every 
language, certain phonetic symbols which do 
have an onomatopoetic quality. The set may be 
limited to a very small number of words whose 
referent itself can be heard and which the 
human voice can imitate such as “cricket” or 
“sizzle.” We sometimes name sounds appropri- 
ately like the “buzz” of a bee or the “tweet” 
of a bird. There is also the “whippoorwill” and 
the “bobwhite.” These phonetic symbols are 
clearly meant to imitate the sounds they refer 
to. There are also words which are meant to 
directly elicit visual images like “cottontail” or 
“bobbysoxer.” 

Languages are in continual flux. They change 
for many reasons, some of which are not clearly 
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understood. It is unlikely that the onomato- 
poetic quality which may have originally charac- 
terized many words would be retained as these 
words undergo vowel shifts and other transforma- 
tions in their continued use by a given popula- 
tion of speakers or as they diffuse across lan- 
guage boundaries. Most words have evolved into 
arbitrary symbols upon which rests their ultimate 
utility in communication. The fact that we often 
suspect a word as being onomatopoetic is prob- 
ably due to its ability to call forth a particular 
auditory image or feeling in us which is based, 
in turn, upon its having been so often associated 
with a particular experience in the past. The 
amount of overlap between a given pair of non- 
cognate languages of such images called forth by 
similar-sounding words is probably minimal and 
if it does occur it will often be coincidental. 
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VERBAL CONDITIONING IN SCHIZOPHRENIA AS A FUNCTION 
OF DEGREE OF SOCIAL INTERACTION + 


EUGENE EBNER? 
Purdue University 


Schizophrenic and normal Ss were Positively reinforced for using I and We 
pronouns in a Taffel sentence-construction task. For 4 of each diagnostic group 
the verbal reinforcer “Good” was delivered by E in direct presence of S; 
the other 4 of each group heard’ the reinforcer over a loudspeaker with E not 
interacting directly with S. Normal groups showed significant verbal condition- 
ing under both reinforcement pro edures but no conditioning was obtained 


for the schizophrenic groups. 


aylor Manifest Anxiety scale score showed 


no relation to verbal conditioning performance, 


The use of verbal conditioning techniques in 
the study of schizophrenic behavior pathology is 
currently an expanding area of research. Although 
recent studies have largely been aimed at ex- 
ploring the effectiveness of several types of 
reinforcing events on schizophrenic subjects, 
primary attention has been given to reinforcers 


1 This paper is based upon a dissertation submit- 
ted to the Department of Psychology of Purdue 
University in partial fulfillment of the requirements 
for the degree of Doctor of Philosophy. Appreciation 
is expressed to John M, Hadley, Frederick H, Kanfer, 
and Robert A. Wagoner for their assistance in the 
planning and execution of this study. Portions of 
the present paper were presented at the meeting of 
the Midwestern Psychologial Association, Chicago, 
May 1963. 

2 Now at Lafayette Clinic and Wayne State Uni- 
versity, Detroit, Michigan. 


which represent the theoretically important vari- 
ables of social reward and censure. § 
Studies by Leventhal (1959) and Cohen and 
Cohen (1960) have found that schizophrenic 
Subjects show little, if any, conditioning when, | 
correct responses are followed by positively re- 
inforcing statements indicating social approval 
or reward, while negative reinforcement (social 
censure) of incorrect responses results in signifi- 
cant conditioning effects for a nonpunished, 
alternative response, While these findings have 
Suggested a hypothesis of diminished motivational 
Properties of acquired social rewards in schizo- 
phrenia, and a contrasting focus upon avoidance 
of aversive social stimulation, other studies 
(Hartman, 1955; Krasner & Ullmann, 19585 « 
Salzinger & Pisoni, 1958) have successfully condi- 
tioned selected responses of schizophrenics 


BRIEF ARTICLES 


through positive verbal reinforcement. Addition- 
ally it has been shown that situational variables, 
other than the affective value of the reinforcers 
can influence verbal conditioning in normal sub- 
jects, such as the amount of prior interaction 
between the experimenter and the subject 
(Kanfer & Karas, 1959) and the emotional 
atmosphere of the experimental situation (Weiss, 
Krasner, & Ullmann, 1960). It would thus seem 
that further investigation of the effects of such 
situational variables upon schizophrenics is a 
necessary step in clarifying questions concerning 
the motivational properties of socially rewarding 
and punishing reinforcers. 

The present study was designed to explore the 
effects of one such variable—the degree of social 
interaction between the experimenter and the 
subject—upon verbal conditioning of schizo- 
phrenics and normals. Selection of this variable 
was guided by the general view that social com- 
munication and interaction represent major 
problem areas for the schizophrenic and that a 
primary function of his withdrawal and isolation 
isa defense against a world of social interactions 
and involvements which have been presumably 
nonrewarding and harmful (Cholden, 1956; 
Jenkins, 1952). Since the verbal conditioning pro- 
cedures employed by Cohen and Cohen, and by 
Leventhal involved direct interpersonal contact 
between the experimenter and the subject, it is 
possible that the nonresponsiveness of the schizo- 
phrenics to the positively reinforcing cues may 
have been due, in part, to the threatening and 
potentially inhibiting effects of the social context 
of the situation, instead of being solely depend- 
ent upon the affective quality of the reinforcer. 
In view of these considerations this study at- 
tempted to determine whether minimizing the 
degree of social interaction in the verbal condi- 
dioning situation would result in heightened 
responsiveness of schizophrenics to a positive 
verbal reinforcer. 

. A secondary variable incorporated into the 
experiment concerned the manifest anxiety of the 
subject. Although Taffel (1955) found this vari- 
able to relate significantly to verbal conditioning 
in a psychiatric population, Buss and Gerjuoy 
(1958) were unable to replicate this finding using 
a similar population, and Sarason (1958) reported 
that scores on a scale of “general anxiety” 
showed no relationship to conditioning level in 
hospitalized psychiatric patients. Since the present 
study employs a verbal conditioning procedure 


+ almost identical to that used in these previous 


investigations, it seemed desirable to further 
examine the relationship between anxiety and 
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verbal conditioning by obtaining a measure of 
the subject’s manifest anxiety level. 


. METHOD 
Subjects 


The schizophrenic sample consisted of 36 patients 
drawn from a state hospital, an acute intensive 
treatment hospital, and the psychiatric ward of a 
Veterans Administration General Medical and Surgi- 
cal Hospital. All subjects met the criteria of staff 
diaġnosis of schizophrenia, no history or evidence 
of organic involvement, between 20 and 50 years of 
age, and literacy. The mean age, educational level, 
and length of hospitalization of this sample were 
32.7 years, 11.3 grades, and 19.2 months, respectively. 

The normal group consisted of 36 Veterans Admin- 
istration general medical and surgical patients for 
whom there was no history or suspicion of psychia- 
tric diffculty. Mean age and educational level were 
37.8 years and 10.2 grades. The groups differed signifi- 
cantly on mean age (t= 2.54, p< .05) but not on 
educational level (t = 1.97, $ > .05). 


Apparatus and Procedures 


The verbal conditioning task was closely similar 
to the sentence-construction technique originally de- 
scribed by Taffel (1955). Stimulus materials con- 
sisted of 160 cards with a different verb typed on 
the center of each card, All verbs were in the past 
tense and were chosen from the thousand most 
frequently occurring words in the Thorndike-Lorge 
word count. An effort was made to select only 
‘effectively neutral verbs. Beneath the verb on each 
card the pronouns I, We, He, She, They were typed 
sin a° randomly chosen order for each card. To 
facilitate reading the card at a distance of 5 feet 
large, bulletin type was used. 

The cards were exposed singly and the subject 
was instructed to make up one sentence for each card 
containing the word in the center and one of the 
pronouns below. The- presentation of each card 
constituted a trial and all subjects were given 160 


trials. 

The schizophrenic and normal groups were each 
randomly divided into two subgroups differing in 
the degree of direct interpersonal interaction during 
the conditioning task as follows: 

Interaction. The experimenter and the subject were 
seated directly facing each other across a small table 
with an 8 inch high exposure stand between them. 
The experimenter presented the cards on the stand 
and reinforced the subject’s use of I or We in a 
sentence by saying “Good.” Thus, there was direct 
visual and verbal interaction between the experi- 
menter and the subject throughout the conditioning 


Taa i interaction. The subject was seated at 
one end of a 5 foot long table placed against a wall 
of the experimental room. Facing the subject on the 
other end of the table was a black wooden partition 
30 inches wide and 25 inches high, with a 4X 5 
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inch opening cut in the center. A 6 foot high cloth 
screen was placed alongside the partition to conceal 
the area behind the screen where the experimenter 
sat and where recording equipment was placed. After 
giving instructions to the subject, the experimenter 
sat behind the partition and presented the cards 
through the opening. 

A tape recorder placed behind the screen was 
connected to a loudspeaker resting on the table 
near the subject. Presentation of the verbal reinforce- 
ment was accomplished by playing a tape loop on 
which the word “Good” was recorded. The rein- 
forcement was presented following each sentence 
containing I or We. Thus, in this condition, all that 
was visible to the subject was the partition con- 
taining the stimulus cards and the loudspeaker. 
Compared with the other experimental condition, 
therefore, social interaction wds minimal. 

No reinforcement was given during the first 20 
trials which comprised the operant-level period. 
During the acquisition phase (Trials 21-80), the 
experimenter reinforced all I and We responses in 
the manner appropriate to the particular experi- 
mental condition. Trials 81-160 constituted an 
extinction period during which no reinforcement 
was given, 

Immediately following the verbal conditioning, 
subjects were questioned about the purpose of the 
experiment and for awareness of the reinforcement 
contingency. Following this, each subject filled out 
the 50-item version of the Taylor Manifest Anxiety 
(MA) scale, 


RESULTS 


Figure 1 presents the acquisition and extinction 
curves for each group plotted in terms of mean, 
frequency of reinforced responses (I and We re- 
sponses) per block of 20 trials. Inspection of the 
acquisition phase of Figure 1 reveals considerable 
variability in the shape and height of the curves, 
‘with both normal groups showing higher levels 
of responding. Further the curve for the normal 
interaction (NI) group rises most rapidly, al- 
though the normal minimal interaction (NMI) 
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TABLE 1 


ANALYSIS OF COVARIANCE OF CONDITION] 
RESPONSES DURING ACQUISITION 


Source af MS 
Doaa (B) i 33601 
AXB i 59.92 

Error 67 46.90 
+*p<.01. 


group also manifests a clearly positive 
In the schizophrenic groups, however, 
tern appears reversed, with the schizo) 
interaction (SI) group having an essenti 
curve, while the schizophrenic minimal ini 
(SMI) group shows a slightly positive 
Analysis of variance of Trial Block 1 re 
no significant group differences (F <1) in m 
reinforced use of I and We. The di 
variable for the major analysis of the a 
data consisted of the sum of reinfor 
sponses occurring in the final blocks of the 
Blocks 3 and 4 (Trials 41-80), where 
effects developed by the conditioning proce 
would be expected to be most clearly exhil 
Table 1 presents the results of a cova 
analysis of Trials 41-80, using performan 
nonreinforced Trials 1-20 as the co 
This analysis indicated that the two di 
groups varied in amount of conditioning, 
the normal groups showing a si 
(F=7.16, p<.01) higher level of resi 
However, response to the positively reinf 
statement “Good” did not appear to be § 
tematically influenced by the degree of 
interaction involved (F <1 for Methods). 
though it appeared from Figure 1 that 
phrenics responded differently from no 
each reinforcement method, the nons 
Methods X Diagnosis interaction term (F 
p> .10) revealed that these differences 
statistically reliable. 
To determine in which subgroup § 
conditioning effects were present, the linear 
for each group was analyzed by 
“slope” scores for each subject over 
acquisition blocks. Only in the normal 
were the slopes of the conditioning 


t 

3 Although no significant group differences: 

initially (Trials 1-20), scores on Trials 41 

found to be correlated with Block 1 scol 

the purpose of the covariance analysis was 

vide a more powerful test by adjusting 
correlation. 
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nificantly different from zero at the .05 level or 
better (F= 11.56, p<.01 for Group NI; 
F=7.53, p <.05 for Group NMI). 

Extinction trials were analyzed only for the 
normal groups which had shown significant acqui- 
sition effects. Linear trend analysis for each 
group indicated that Group NMI showed a sig- 
nificant decrease (F = 5.45, p<.05) in condi- 
tioned responses over nonreinforced extinction 
blocks, while Group NI showed no significant 
extinction effect (F <1). 

The effects of anxiety level upon conditioning 
were studied by dichotomizing each group at the 
median of its distribution on the Taylor MA scale 
into high-anxiety (HA) and low-anxiety (LA) 
subjects. The median MA scale scores for the 
groups were: NI, 10.90; NMI, 10.75; SI, 20.50; 
SMI, 10.00. Analysis of covariance * of the con- 
ditioning performance of HA and LA subgroups 
revealed no significant differences among the 
groups which could be attributed to different 
anxiety level. Thus, in this sample, MA scale 
scores had no systematic relation to rate or 
amount of verbal conditioning for normal or 
schizophrenic groups. 

Examination of the questionnaires concerning 
awareness of the nature of the experiment re- 
vealed that a number of subjects were able to 
verbalize the contingency between their responses 
and the reinforcement, The number of such sub- 
jects in each group was: NI, 7; NMI, 4; SI, 3; 
SMI, 4. The ¢ tests of the differences in condi- 
tioning performance of verbalizers and non- 
verbalizers indicate that in group NI the verbal- 
izers showed significantly greater conditioning 
effects (t= 3.87, p<.01); for the remaining 
groups only chance differences were observed. 


Discussion 


p The finding that both normal groups attained 
higher conditioning levels than the schizophrenic 
groups offers support for the hypothesis that 
verbal stimuli which connote acquired social 
rewards possess relatively less motivating proper- 
ties for the schizophrenic. However, varying the 
degree of social interaction surrounding the rein- 
forcement situation did not appear to. significantly 
alter the minimal responsiveness of the schizo- 
phrenics; in particular, the less social condition 
produced no enhancement of verbal condition- 
ing, contrary to the expectation derived from the 
theoretical argument presented earlier. It thus 


„êPpears from these data that the schizophrenic’s 


As in the previous covariance analysis, score on 
Blocks 3 +4 was the variate and score on Block 1 
was the covariate, 


. 
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failure to respond to positive verbal reinforce- 
ments represents a generalized avoidance of this 
class of events, irrespective of the situational 
context ‘in which they occur. Recent research, 
however, and consideration of some features of 
the present experimental situation require some 
qualification of this conclusion. First, it should 
be noted that the physical arrangements of the 
two reinforcement conditions did not allow for as 
wide a contrast between normal and minimal 
interaction conditions as might be necessary to 
elicit differential effects. It would be desirable, 
for example, to have the subjects in the minimal 
interaction condition seated alone in a room with 
the task stimuli presented automatically and the 
reinforcement contrelled by the experimenter 
from an adjoining room. This would minimize 
not only interaction during the conditioning pro- 
cedure, but also the subject’s awareness of an- 
other person’s presence. A recent study by 
Johannsen (1961) investigated the effects of 
social versus nonsocial feedback upon alternation 
learning by schizophrenics, using a physical ar- 
rangement which came close to the above re- 
quirements and demonstrated superior perform- 
ance of schizophrenics in the nonsocial condition; 
however, the reinforcer in Johannsen’s nonsocial 
condition was a light rather than a verbal re- 
sponse. Additionally, Robinson (1957) found 
that verbal rewards such as “Good” and “Right” 
resulted in poorer paired-associate learning by 
schizophrenics than did a nonsocial reward of a 
machfne click following correct responses. Thus, 
it appears that some aspects of the social con- 
text ọf the task situation are related to differen- 
tial responsiveness, although it may be that the 
most significant aspects involve differences be- 
tween a verbal and nonverbal reinforcer (for 
example, “Good” versus light). 

It is of interest to compare this study to one 
by Salzinger and Pisoni (1958), in which self- 
reference statements of affect by schizophrenics 
were successfully conditioned by positive verbal 
reinforcers. Their conditioning was done as part 
of an apparently routine hospital interview for 
newly admitted patients, the important point 
being that the social interaction between the 
interviewer and the subject apparently did not 
impair conditioning. It seems likely, however, 
that from the subject’s frame of reference, being 
interviewed about problems and feelings shortly 
after hospitalization would seem readily under- 
standable in terms of cultural expectations, while 
the sentence-construction task confronted the 
subject with a much stranger and ambiguous 
situation. Matarazzo, Saslow, and Pareis (1960) 
have pointed out the significance of the subject’s 
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expectancies in affecting the relative conditiona- 
bility of different response classes and it would 
seem plausible that these “situation-induced 
expectancies” may in part explain why schizo- 
phrenics conditioned in an interview situation 
but not in the present experiment nor in Cohen 
and Cohen’s study using the same procedure. 
While this hypothesis represents a post hoc at- 
tempt at explanation, the sensitivity of the verbal 
conditioning phenomenon to several situational 
variables (Kanfer & Karas, 1959; Weiss et’ al., 
1960) suggests that it can be very plausibly 
considered. 

The absence>of any significant relation between 
anxiety level and conditioning in this study is of 
interest, Studies by Buss and Gerjuoy (1958) 
and by Daily (1953), using psychiatric patients 
and college students, respectively, also failed to 
confirm Taffel’s original report of a positive 
relation between MA scale scores and verbal 
conditioning. It is significant that all of these 
studies have employed the Taylor scale as the 
anxiety measure—a scale which emphasizes de- 
scription of long-standing fears and somatic 
components of anxiety. However, Sarason (1958) 
found a significant relation between verbal con- 
ditioning and a scale of test anxiety, but not 
between conditioning and a general anxiety scale, 
which appears more related to the Taylor scale. 
Thus, it may be that the aspects of anxiety 
elicited by the Taylor scale are not as relevant 
to, or activated by the testlike conditioning 
situation, Although positive correlations between 
various types of anxiety scales have been reported 
(Gordon & Sarason, 1955; Sperber, 1959), the 
size of these correlations (from .30 to .50, 
approximately) reveals enough unique variance 
among the scales to suggest that the relationship 
of anxiety and verbal conditioning may be 
clarified by more specific and circumscribed 
conceptualization of the anxiety variable, 
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THE EFFECT OF JUDGES’ ATTITUDES ON JUDGMENT? 


JOEL W. AGER Š 
Center for the Study of Cognitive Processes, Wayne State University 
anD ROBYN M. DAWES 
Veterans Administration Hospital, Ann Arbor, Michigan 


Attitude-scaling studies using paired comparisons have consistently supported 
Thurstone’s assumption that Ss’ favorablengss judgments are independent of 
Ss’ attitudes. This assumption seems unreasonable from the standpoint of both 
social perception and naive observation. The experimental hypothesis tested 
was that number of “errors” in paired comparisons of attitude statements is 
a negative function of distance of the statements from the Ss’ own position. 
9 proscience and 9 antiscience Ss made such judgments among statements 
representing 6 categories of favorableness to science. A linear trend analysis 


. showed that the proscience Ss had significantly more confi 


fusions among the 


less favorable categories; the reverse was true for the antiscience group, The 
results supported the experimental hypothesis and contradicted the Thurstonian 


assumption, 

Question: In what ways does an individual’s 
own attitude affect his judgments of alternative 
attitudes? This question has interested both psy- 
chologists concerned with social perception and 
those concerned with scaling theory. 

Almost all investigations of the question have 
dealt with judgments of “favorability’—that is, 
with judgments of the degree to which one 
statement expresses a favorable or unfavorable 
attitude toward a social phenomenon: institution, 
group, or philosophy. The standard method of 
investigation has been to determine whether the 
judgments of favorability of a group favorably 
disposed toward the social phenomenon differ 
from those of a group unfavorably disposed. 
` The impetus for adopting this standard method 
came from Thurstone’s (1929) scaling assump- 
tion that a judge’s own attitude would not affect 
his judgments (that is, ratings) of favorability. 
‘Thurstone felt this assumption was a necessary 
condition for his scaling methods to be valid. 
To test the Thurstone assumption, psychologists 

» Investigated favorably disposed and unfavorably 
disposed groups’ ratings of attitudes toward 
Negro equality (Hinckley, 1932), pacificism 
(Ferguson, 1935), patriotism (Pintner & Forlano, 
1937), and Al Smith (Beyle, 1932); the ratings 
were analyzed according to Thurstone’s equal- 
appearing intervals model. Since the scale values 
obtained from differing groups correlated highly 
with each other (.93 to .99), these authors con- 
cluded that the Thurstone assumption was valid. 


1 All statements are those of the authors; they do 
not necessarily represent the views of the Veterans 
Administration, 


Their answer to our question would be: the 
individual’s attitude does not affect his judg- 
ment, at least not his judgment of favorability. 

More recent authors (Granneberg, 1955; 
Hovland & Sherif, 1952; Kelley, Hovland, 
Schwartz, & Abelson, 1955), however, have 
argued that correlation is a bad measure of the 
similarity between ratings; they point out that 
while correlation is relatively sensitive to agree- 
ment in rank ordering (of favorability) of atti- 
tude, it is relatively insensitive to perceived dif- 
ferences between attitudes similarly ranked. 
"(Note that the correlation between x and x° for 
the integers 1 to 20 is .975.) Moreover, as these 
authers point out, the Thurstone assumption is 
in direct contradiction to the well-accepted as- 
sertion that perception is influenced by the atti- 
tude of the perceiver. Rating of favorability of 
attitude is a subdomain of rating of attitude, 
which is a subdomain of judgment of attitude, 
which in turn is a subdomain of perception of 
attitude. And if perception is influenced by the 
attitude of the perceiver, it would be odd if a 
narrow subdomain of perception were not. 

The most definitive experiment concerning our 
question was performed by Kelley et al. (1955). 
These authors investigated a group of anti- 
Negro whites’ and a group of Negroes’ ratings 
of Hinckley’s items. The results were as follows: 
the two groups differed when the items were 
scaled according to the method of equal- 
appearing intervals, the two groups did not differ 
when the items were scaled according to a 
method of paired comparisons, and when the 
method of successive intervals was employed for 
scaling, the results were midway between those 
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obtained by the other two methods. The authors’ 
conclusion is similar to that reached by Edwards 
in reviewing their work. Edwards (1957) writes: 


Thus it would seem as if the displacement effect . . - 
is primarily-a function of the method by which the 
judgments were obtained, that is, the method of 
equal appearing intervals [p. 116]. 


From the above evidence, it is also possible 
to reach a conclusion about the nature of the 
judgmental process—a conclusion that would 
explain why the effect occurred with the method 
of equal-appearing intervals but not with the 
method of paired comparisons. It is because this 
further conclusion is stronger than those pro- 
posed by past investigators that we feel past data 
must be reexamined and further data collected. 

Consider the Kelley et al. finding in terms 
of Coombs’ (1964) distinction between intraset 
scaling and interset scaling. The method of paired 
comparisons involves comparing attitude state- 
ments with attitude statements (that is, an intra- 
set comparison of attitude statements), while 
the methods of equal-appearing intervals and 
successive intervals involve comparing an atti- 
tude statement with a category boundary (that 
is, an interset comparison). Since judges’ attitude 
does not affect the attitude-attitude intraset 
comparison but doe$ affect the attitude-category 
boundary interset comparison, it is possible to 
conclude the following: An individual’s attitude 
simply affects his use of category boundaries.? 

Evidence cited previously supports the conten. 
tion that an individual’s attitude affects his use 
of category boundaries. Additional evidence has 
recently been presented by Upshaw (1962)’; he 
found that if an individual’s own attitude lies 
outside a range of stimuli he is categorizing, the 
individual will “spread” his category scale so 
that it includes his own attitude, thereby locating 
the judged stimuli in fewer categories than he 
would were his own attitude within the range of 
these stimuli. The power of the above conclusion, 
however, is not just that own attitude affects 


Beret Hovland, Schwartz, and Abelson (1955) 
ite: 


We would make the hypothesis that paired com- 
parisons scale values largely reflect a semantic or 
meaning continuum and that equal appearing 
intervals values reflect a continuum of social 
significance or importance. 


Their statement is here viewed as one of many pos- 
sible interpretations of the differential effects of 
attitude-attitude comparisons versus attitude-category 
boundary comparisons—albeit a highly reasonable 
interpretation. 
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use of categories, but that it simply aff 
of categories. The conclusion is that di: 
tion is not affected, and presumably other 
mental processes as well might not be a 

Since categorization behavior is just one 
of social judgment, limitation of attitude 
to this domain would be severely restric 
moreover, such limitation would bring into 
ous question work of those psychologists 
example, Manis, 1961) attempting to deri 
eral laws of social judgment from data 
on categorization. Thus, we must consider 
fully whether the evidence for the conclusie 
sufficient. 

The crux of the evidence lies in the conte: 
that attitude-attitude comparisons are nol 
fected by the attitude of the rater; in ad 
this contention, Kelley et al. (1955) write: 


The relative linearity of the three scatter d 
[comparing scale values of attitude statements 
suggested by the product moment correlation 
tween Negro and white scale values: .97 for pi 
comparisons . . . [p. 155]. 


In light of these authors’ earlier remarks, 
statement is somewhat surprising (and 
the product-moment correlation of the no 
relation between x and x? is greater than the 
they report). 

We do not feel that the high correlation 
tween scale values supports the contention 
the Negro and white groups’ ratings do 
differ. In fact, the question is not even W 
these two groups agree “on the whole”—o1 
much they agree. The question is whether 
disagreements may be explained in terms ©} 
attitudes. 

Now in order to characterize the disag 


agree; that is, if we wish to relate the 
attitudes to the fact that one group has a g 
tendency than the other to say State 
more favorable than Statement B, it is nec 
to know something about the attitudes exp 
by A and B. Specifically, we may wish to kr 
whether A and B are favorable statements, i 
favorable statements, neutral statements, 
The idea of a statement that is in itsel) 
able or unfavorable, however, rests on 
sumption that there is a consensus ord 
favorability of statements. It then follo 
whenever two individuals or groups 

about which of two items is more favoral 
individual or group agrees with the C 
ordering and one does not. If someone 
agree with the consensus ordering, we ¥ 
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he has failed to discriminate, or that he has 
committed an error.® 

In short, then, we maintain that our question 
may best be answered by investigating the rela- 
tionship between a rater’s attitude and the type 
of errors he makes. Rather than determine 
whether pro- and antigroups order statements 
in a roughly similar manner, we should determine 
whether their error patterns are similar. 

The hypothesis to be investigated is that the 
error patterns of pro- and antiraters will not 
be similar. Specifically, we hypothesize a dis- 
placement effect; that is, the proportion of errors 
made by pro (anti) raters will be greater when 
the discrimination is made among anti (pro) 
statements than when it is among pro (anti) 
statements.* 

We are saying, in effect, that when the right- 
winger says he sees no difference among the 
varieties of left-wingers, this implies an in- 
ability to discriminate among them over and 
above a preference for categorizing them to- 
gether. We feel that the effect of attitude on 
the discrimination process will be similar to that 
which has been found for the categorization 
process. 


PROCEDURE 


The attitudes used in this study were statements 
about science. In a previous study by Ager (1961), 
these statements had been rated in terms of their 
fayorability toward science” by representative 
groups of college students. On the basis of these 
ratings, these statements can be grouped into six 
categories of favorability. These categories, labeled 
A (most favorable) through F (least favorable), 
have the property that no statement in a given 
tategory had been rated as more favorable than any 
statement in a more favorable category by over 
half the students, or less favorable than any state- 
ment in a less favorable category. There are five 
statements in each category. The statements are 
presented in Table 1. 

An “own attitude” questionnaire was constructed 
from four statements in Category A and four in 
Category F. Subjects taking this questionnaire are 
asked to rate their agreement with these statements, 
as well as their agreement with statements about 
advertising, sex, and political commitment; the 
ratings consist of numbers ranging from +2 (strongly 
agri — H 
Heston 2 (strongly disagree). For each subject, 


i 8In what follows, it must be remembered that 
error” means simply “disagreement with consensus 
ordering.” 

‘Note that this hypothesis is of an interactional 
nature and that it was carefully worded to “correct” 
for the following effects: some pairs of statements 
may be more difficult to discriminate than others, 
and one group of raters may have a higher error 
tate than another group. 
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a favorability score can be computed by subtracting 
the ratings he gave to the four F statements from 
the ratings he gave to the four A statements. 

This own attitude questionnaire was administered 
to 176 students from an introductory psychology 
class at the University of Michigan—96 females and 
80 males. The favorability scores of these students 
ranged from +16 to —16 (the two possible ex- 
tremes); the split-half reliability of the favorability 
scores was .73, which by the Spearman-Brown 
prophecy formula yields a reliability for the entire 
score of .84. 

The experimenters then invited students with 
extreme scores to participate in a “survey of student 
opinion” for pay. Ideally, the experimenters would 
have liked to form a proscience gtoup of the 10 
students with the most positive scores and an anti- 
science group of thes 10 students with the most 
negative scores. A few of the invited students de- 
clined to participate, however, and the experimenters 
made additional departures from their ideal selection 
criteria in order to insure that neither pro- nor 
antigroup be predominantly male or female.’ In all, 
11 students with extreme positive scores and 11 
students with extreme negative scores participated 
in the “survey”; the characteristics of these students 
will be described in the Results section of this paper. 

The subjects were contacted individually by letter 
and phone; they were not told that they were con- 
tacted because they were extremists, but rather were 
told they had been carefully selected as part of a 
representative sample of students. They met with the 
experimenters individually and in small groups. 

The opinion questionnaire presented to the subject 
consisted of all possible pairs of attitude statements 
from adjacent categories; the subject was to select 
the nfember of the pair that was more favorable 
toward science. Since there were 5 statements in 
each category, there were 25 pairs of statements 
from ‘adjacent categories, and therefore 5 X 25 = 125 
pairs of statements altogether. In addition to these 
pairs, 5 pairs of AF statements were included 
in order to disqualify subjects who were either 
extremely careless or obviously misunderstood the 
directions (as, for example, if the subject selected 
the statements with which he agreed rather than 
the one more favorable). These pairs were presented 
in random order with the constraint that no two 
consecutive pairs be from the same two categories; 
within each pair, the order was balanced so that 
each statement appeared half the time as the first 
member of a pair and half the time as the second. 


RESULTS 


Two pro- and two antisubjects originally 
selected for inclusion on the basis of their prefer- 
ence scores were discarded for making more than 
one error on the AF comparisons. An example 
of such an error would be the judgment that the 


5'The male students had a significant tendency to 
have higher scores—that is, to be more proscience— 
than the females. 
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TABLE 1 
CATEGORIES OF FAVORABLENESS TOWARD SCIENCE STATEMENTS 


Category A 


1. Through science alone will man comprehend the universe. 

2. All the innermost secrets of nature must yield to science. | 

3. Ultimately our view of the world shall be wholly scientific in nature. 
4. Knowledge of all things can be attained through science. 

5. Only science gives man the hope of understanding his world. 


Category B 


1. A scientific analysis should always be 
2. No explanation can be satisfactory un! 


pea to any other. 


less scientifically based. 


3. The most satisfactory answer to a problem is the scientific one. 


4. For the best 


lanation of an event we would turn to science. 


5. Only scientific interpretations of phenomena can be really adequate. 


Category C 


1. We can have increased faith in scientific explanations. i 
2. Science’s answer to a problem is apt to be more trustworthy. 

3. Greater reliance can generally be placed on scientific knowledge. 

4. The methods of science usually deserve higher confidence. 

5. In many cases scientific knowledge is the more dependable. 


Category D 


1. Science is one of many roads to knowledge, all of which are necessary. 

2. The explanations of science and our other beliefs need not conflict. 

3. Science’s views are equally as appropriate as others. 

4. Science complements but does not necessarily replace other kinds of knowledge. 
5. Science is an essential but not the sole means to knowledge. 


Category E 


1, For description we turn to science; for understanding. 
2. Science can tell us how something happens but not why. 


elsewhere. 


3. The meaning of scientific results is usually found outside science. 
4. Science is only equipped to discover facts not to interpret them. 
5. Science cannot often judge the significance of its findings. 


Category F s 


1. Many essential questions cannot be answered by science. 

2. The scientific view of the world is too restricted to be completely satisfying. 

3. Much of our knowledge will always need to be revealed by means other than science. 
4. Because of its limited scope, total reliance on science is inadvisable. 

5. The world is too complex to be adequately accounted for by science. 


F statement, “Many essential questions cannot 
be answered by science,” was more favorable 
toward science than the A statement, “Knowledge 
of all things can be attained through science.” 


TABLE 2 
SUMMARY OF LINEAR TREND ANALYSIS 


Source SS | af | Ms F 
Within subjects (linear) | 378.70] 18 

Between categories 

(linear) 12.27) 1 | 12.27] <1 
Groups X Categories 

(linear) 61.25) 1 |61.25| 3.21* 
Categories X Subjects 

ear) 305.18 | 16 | 19.07 


* p <.05 for the directional alternative hypothesis. 


There remained nine subjects in each group. The | 


progroup consisted of subjects whose scores on 
the own attitude questionnaire ranged from the 
eighty-ninth most favorable percentile to they 
ninety-ninth; the median percentile was the 
ninety-seventh. The antigroup consisted of sub- 
jects whose scores ranged from the first most 
favorable percentile to the eighth; the median 
percentile was the third. Six of the prosubjects 
were males, while only three of the antisubjects 
were males. 

The experimental hypothesis tested was that 
number of errors would be a positive function ° 
distance of the judged categories from the sub 
jects’ own position. More specifically, for 
extreme groups used here, we would expect error 
to increase from the AB to the EF comparisons 


| 


BRIEF ARTICLES 


for the progroup and to decrease for the anti- 
group; of course, this increase and decrease 
would be relative to the total number of errors 
both groups made on the comparison. (We would 
expect certain comparisons to be intrinsically 
more difficult than others—regardless of the 
judge’s own position.) 

To test this hypothesis, a linear trend analysis 
of the Group X Comparisons interaction was 
performed (Winer, 1962, p. 353). A summary 
of this analysis is presented in Table 2. The 
mean square for the linear component of the 
interaction tested against the mean square for 
departures from linearity yielded an F of 3.21 
with df = 1/16. For the one-sided alternative 
hypothesis presented here, the F is significant at 
the 05 level. We reject the hypothesis that the 
populations had equal slopes in favor of the 
alternative hypothesis that the slopes differed in 
the direction specified. 

A related and somewhat simpler analysis based 
on difference scores yielded similar results. A 
difference score for each subject was computed 
by subtracting the number of DE and EF errors 
from the number of AB and BC errors. The £ for 
difference between these scores for the two 
groups was 2.03 (with, as predicted, the anti- 
group having the larger scores); this result is 
again significant at the .05 level for the 
directional alternative hypothesis. 

Group differences by category are shown in 
Figure 1, which plots the percentage of errors 
made by prosubjects in each comparison. The 
effect of differences between categories in over- 
all difficulty has been eliminated by considering 
what percentage of those errors actually made 
were made by each group. , 


Discussion 


*The data presented demonstrate that displace- 
Ment effects can be obtained with paired- 
Comparison judgments. In light of the Kelley et 
al. (1955) findings and those of the present study, 
the most reasonable conclusion at present is that 
methods of data collection—equal-appearing in- 
tervals, successive intervals, and paired compari- 
Sons—will differ in terms of the magnitude, but 
not the presence, of such effects. 

It of course remains to be seen if displace- 
Ment effects for paired-comparison judgments 
will be found for other kinds of attitudes—for 
example, those involving high intensity or ego 
involvement. There is some evidence that they 
will. Careful examination of the Kelley et al. 
Plot of scale values of Negro subjects against 
those for the white subjects reveals a slight but 
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Fic. 1. Percentage of total errors made by the nine 
proscience subjects for each of the five category 
comparisons. 
consistent “bowing” effect. These previous au- 
thors suggest that the four items contributing 
most to this effect deal with somewhat different 
content than the other items. It is interesting to 
note, however, that the observed departures from 
linearity are exactly the kind predicted by the 
present study. Perhaps if Kelley et al. had’ ex- 
amined error pattern rather than overall agree- 
ment their conclusions would haye been similar 
to ours. 

The evidence reported here argues against the 
validity of the Thurstone assumption of inde- 
pendence of scale positions and comparative 
judgments of favorability. It is problematical, 
however, whether the displacement phenomena 
should limit application of Thurstone scales to 
groups and individuals differing in attitude po- 
sition. In the present study, subjects from the 
two groups essentially agreed on the rank order 
of categories. Indeed 15 of the 18 subjects gave 
the expected category order; the other 3 had 
only single reversals. 

The most important implications of the find- 
ings are that attitude-attitude comparisons are 
not qualitatively different from attitude-category 
boundary comparisons, and that the correspond- 
ing processes of categorization and discrimination 
are similarly affected by displacement. 

As displacement has been demonstrated now 
with a variety of judging methods, it would 
seem to deserve a more basic conceptual status 
than it has had heretofore. Indeed, the authors 
feel that attempts to deal with this phenomena 
theoretically must now involve basic laws of 
learning and/or social perception. 


REFERENCES 


Acer, J. W. Individual interval scaling for distin- 
guishable stimuli. Unpublished doctoral disserta- 
tion, Syracuse University, 1961. 


538 


Breve, H. C. A scale for the measurement of atti- 
tude toward candidates for elective governmental 
office, American Political Science Review, 1932, 26, 
527-544, 

Coomss, C. A. A theory of data. New York: Wiley, 
1964, 

Epwars, A. L. Techniques of attitude scale construc- 
tion. New York: Appleton-Century-Crofts, 1957. 

Fercuson, L. S. The influence of individual attitudes 
on construction of an attitude scale. Journal of 
Social Psychology, 1935, 6, 115-117. 

GranneBerG, R. T. The influence of individual atti- 
tude and attitude-intelligence upon scale values 
of attitude items. American Psychologist, 1955, 
10, 330-331." (Abstract) 

Hincgtey, E. D. The influence of individual opinion 
on construction of an attitude scale. Journal of 
Social Psychology, 1932, 3, 283-296. 

Hovranp, C. I., & Suertr, M. Judgmental phenomena 
and scales of attitude measurement: Item displace- 


Brier ARTICLES 


ment in Thurstone scales. Journal of Abnormal 
and Social Psychology, 1952, 47, 822-832. 

Kerey, H. H., Hovianp, C. I, Scuwari, M, & 
AseLson, R. P. The influence of judges’ attitudes 
in three methods of attitude scaling. Journal of 
Social Psychology, 1955, 42, 147-158. 

Manis, M. The interpretation of opinion statements 
as a function of recipient attitude and source 
prestige. Journal of Abnormal and Social Psychol- 
ogy, 1961, 62, 82-86. 

PINTNER, R., & FORLANO, G. The influence of attitude 
upon scaling of attitude items. Journal of Social 
Psychology, 1937, 8, 39-45. 

THURSTONE, L. L., & CHave, E. J. The measurement 
of attitude. Chicago: Univer. Chicago Press, 1929, 

Ursnaw, H. S. Own attitude as an anchor in equal- 
appearing intervals. Journal of Abnormal and 
Social Psychology, 1962, 64, 85-96. 

Winer, B. J. Statistical ‘principles in experimental 
design. New York: McGraw-Hill, 1962. 


(Received September 18, 1963) 


Heat af Personality ood zee Psychology 
The findings of the APA’s Project on Scientific Information Exchange in Psychology call 

attention to the difficulties of the scientist in establishing and maintaining contact with current 
E work relevant to his interests. Accordingly, the Project is sponsoring a series of changes during 

the next 12-15 months which are designed to enhance scientific communication. These changes 

are planned as limited trials. Moreover, they are wholly supported by the Project and hence do 

not cut into our page allotment. 
The Editor has agreed to make possible a test of one of the changes, i.e., the titles and 
authors of accepted papers will be listed following the last article in the journal. Such listing 
will make it possible for our readers to become aware of research many months in the advance of 
its journal publication. At the end of the year’s trial the results of the trial will be evaluated and 
further consideration will be given at that time as to whether the service is worth continuing. 
This listing plus those published in the preceding 1965 issues are the entire backlog of manu- 
scripts accepted by this journal through October 1964. 


1964 (September-October) Manuscripts Accepted for Publication in the 
Journal of Personality and Social Psychology 


The Effect of Variable Perspectives on Judgments of Opinion Statements for Thurstone Scales: Equal-Appearing 
Intervals: Harry S. Upshaw*: Department of Psychology, Bryn Mawr College, Bryn Mawr, Pennsylvania 
19010. 
Test-Anxiety Level, Heart Rate, and Anagram Problem Solving: Bernard W. Harleston,* M. Glenn Smith,.and 
Donald Arey: Department of Psychology, Tufts University, Medford, Massachusetts 02155. ; 
The Locus and Orientation of the Perceiver (Ego, Self) under Variable, Constant, and No-Perspective Instructions: 
Thomas Natsoulas*: Department of Psychology, University of California, Davis, California 95616. 7 
Role of Success and Failure in the Learning of Easy and Complex Tasks: Bernard Weiner*: Center for Personality 
Research, University of Minnesota, Minneapolis, Minnesota 55455. 
Effects of Uncertainty about the Nature and Advent of a Noxious Stimulus (Shock) upon Heart Rate: Rogers 
Elliott*: Department of Psychology, Dartmouth College, Hanover, New Hampshire 03755. 
Determinants of the Rehearsal and Transmission of Neutral and Aversive Behaviors: Walter Mischel* and Joan 
Grusec: Department of Psychology, Stanford University, Stanford, California. ‘ s 
Group Performance as a Function of Size, Structure, and Task Difficulty : IT: Julian O. Morrissette* : 725 Wright 
Street, Yellow Springs, Ohio 45387. fais =F l 
Opponent’s Personality, Expectation of Social Interaction, and Interpersonal Bargaining: David Marlowe, Ken- 
neth J. Gergen,* and Anthony N. Doob: Department of Social Relations, Harvard University, 9 Bow Street, 
Cambridge, Massachusetts 02138. zi booths! 
Childhood Prejudice as a Function of Parental Ethnocentrism, Punitiveness, and Outgroup Nee a 
Ralph Epstein* and S. S. Komorita: Department of Psychology, Wayne State University, Detroit, Michigan 
* 48202. 
Individual Perceptual Styles following Induced Failure: Donald R. Brown* and R. James Yandell: Center for 
o Research in Learning and Teaching, University of Michigan, Ann Arbor, Michigan 48104. 3 
= Arousal of Need for Affiliation in Women: Howard M. Rosenfeld* and Samuel S. Franklin: Bureau of Child Re- 
search, University of Kansas, Lawrence, Kansas 66045. 4 , nT 
Expectations * i nce and Compatibility as Related to Status Discrepancy and Social Motives: 
‘ i Robert ei wal owed M. Rosenfeld*: Beit of Child Research, University of Kansas, Lawrence, 


Kansas 66045 
i ' i formance: Elliot Aronson* and Eugene 
Beyond Parkinson's Law: The Effect of Excess Time on Subsequent Pe Gerson Meola SASS. 


Gerard: Laboratory for Research in Social Relations, University of Minnesota, i lesota 
Father Characteristics and Sex Types: Jules M. Greenstein*: Stern College for Women, Yeshiva University, 253 
Lexington Avenue, New York 16, New York. 
The Physiology of Catharsis: Michael Kahn*: Department of Psychology, 


Haven, Connecticut. ‘ an 
Effects of Prior Success and Failure on Expectations of Success and Subsequent Performance: N. T. Feather*: 


iversi i ralia, 
Department of Psychology, University of New England, Armidale, N.S.W., Aust 
Generalization of Attitude Change through Congruity Principle Relationships: Percy r AE EE SEN ita Roy 
.  W. Gengel: Mass Communications Research Center, University of Wisconsin, Madison 9, Nr i 
elias T i ; 
* Asterisk indicates author for whom the address is supplied. » 


539 | 


; 


Yale University, 333 Cedar Street, New 


540 >s List OF MANUSCRIPTS ACCEPTED 


Role Reward and Dissonance Reduction: John Wallace*: Department of Psychology, Stanford University, Stan. 
ford, California. 

A Test of a Model for Commitment: Charles A. Kiesler* and Joseph Sakumura: Department of Psychology, Yale 
University, New Haven, Connecticut. 

The Locus and Orientation of the Perceiver’ (Ego, Self) and the Rotation of Visual Images: Thomas Natsoulas* 
and John T. Murphy: Department of Psychology, University of California, Davis, California 95616, 

Conformity and Authoritarianism Reexamined: Graham M. Vaughan* and Kenneth D. White: Victoria Uni- 
versity of Wellington, P. O. Box 196, Wellington, New Zealand. 

Evaluation of Message and Communicator as a Function of Involvement: Alice H. Eagly and Melvin Manis*; 
Department of Psychology, University of Michigan, Ann Arbor, Michigan 48104. 


rural of Personality and Social Psycholo 
965, Vol. 1, No. 6, 541-550 ce 


MELVIN 


IMMUNIZATION, DELAY, AND THE INTERPRETATION 
OF PERSUASIVE MESSAGES + 


MANIS , 


University of Michigan and Veterans Administration Hospital, Ann Arbor 


3 experiments were conducted in which Ss judged the favorability of persuasive 
messages under 3 main conditions: (a) immunization, a condition in which S 
was first exposed to counterarguments against the message to be judged later; 
(b) immunization + delay, a condition in which several days elapsed between 
the 2 opposing messages; and (c) no immunization. The results indicated that 
immunization with no delay led to contrast effects; i.e., the 2nd of the opposing 
messages was displaced away from the Ist. When, however, a 2-5 day interval 
separated the 2 messages, assimilation was obtained, These results were dis- 
cussed in terms of several distinct theoretical models. f . 


In,a recent study, Manis and Blake (1963) 
investigated the way in which subjects’ in- 
terpretations of persuasive messages were af- 
fected by presenting counterarguments (im- 
munizing messages) against the messages that 
were subsequently to be judged (the test mes- 
sages). In two separate experiments, this pro- 
cedure yielded a significant tendency to dis- 
place the perceived content of the test message 
toward the stand previously advocated in the 
immunization message. This so-called assimila- 
tion effect was tentatively attributed to the 
Subjects’ attempts to minimize the discrepancy 
between the opposing statements, both of 
which appeared to emanate from prestigeful 
sources. 

The results of these studies were, however, 
quite inconsistent with the contrast effects 
Previously reported by Janis and Feierabend 
(1957). Subjects in their experiment rated the 
Contents of each paragraph in an authoritative 
Communication which urged them to volunteer 
for civil-defense work; half of the para- 
graphs stressed the positive features of this 
work, and half pointed to the inconvenience 
and hardship that was entailed. When the 
hegative arguments were presented last, they 
Were rated as being more unfavorable than 
when they were presented first; conversely, 
the positive arguments were regarded as more 

‘All statements are those of the author and do not 
Recessarily represent the opinions or policy of the 
hean Administration. The author is grateful to 
Jerome Katz and Terry Gleason for help in collect- 
ng and analyzing the data; the editorial comments 
af Robyn M. Dawes and Herbert C. Kelman are 
also acknowledged. 
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favorable when presented following the nega- 
tive arguments. In attempting to reconcile the 
divergent results of these studies, it was noted 
that while Janis and Feierabend’s subjects 
rated the two sets of arguments within the 
same experimental session, Manis and Blake 
introduced a 2-5 day interval between pres- 
entations of the opposing messages. This sug- 
gested that the time intervening between the 
judgment of a message and earlier presenta- 
tions of opposing content may be a crucial 
variable in determining the displacement ef- 
fects that are obtained; more particularly, it 
was hypothesized that contrast effects would 
be obtained if opposing messages were pre- 
sentedefor judgment with no intervening delay, 
while assimilation would result if the second 
message was presented several days after the 
first. The present paper reports three experi- 
mental tests of this hypothesis. 


EXPERIMENT I 


Method 

Subjects. This study was conducted with 24 col- 
lege students serving as subjects; there were 23 fe- 
males and 1 male. Data were collected during regular 
class hours of a speech course. 

Procedure. The procedures were rather similar to 
those employed in our earlier study (Manis & Blake, 
1963) and had been largely adapted from the work 
of McGuire and his associates (McGuire, 1961, 1962; 
McGuire & Papageorgis, 1961). 

First, in order to heighten motivation and disguise 
the true purpose of the experiment, subjects were 
told that the research was concerned with the devel- 
opment of a test for selecting people who were gifted 
in scientific areas and that accordingly, the test ma- 
terials would be concerned with the relatively fa- 
miliar topics of health and medicine. $ 
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The experiment was conducted in two sessions 
which were held 48 hours apart. Messages dealing 
with four different health topics were employed: the 
value of penicillin, the effects of daily toothbrushing, 
the value of X-ray examinations in the detection of 
TB, and the desirability of an annual physical 
checkup. Each subject rated the contents of test 
messages dealing with each of the four topics. In 
Experimental Session 1, all subjects were presented 
with immunization passages against two of the test 
messages that were subsequently to be judged; dur- 
ing Session 1, subjects also received one test passage 
which was in direct (point by point) disagreement 
with one of these immunizing passages, plus a test 
message which he had not been immunized against. 
In Session 2, the subject was presented with two 
more test passages; one of these passages contra- 
dicted an immunizing message that the subject had 
read in Session 1, while the other dealt with a topic 
that the subject had not been immunized against. In 
summary then, each subject interpreted the various 
test messages under four different experimental con- 
ditions: 

1, Immunization—here, the immunization and test 
messages were both presented in Session 1, with no 
intervening delay. 

2. No immunizationi—as a control for the above 
immunization procedure, judgments in this condition 
were also obtained in Session 1, but with no im- 
munization, 

3, Immunization + delay—in this condition, the 
test messages were presented in Session 2, while the 
immunization materials were presented in Session 1. 
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Fic. 1, Message interpretation as affected by im- 
munization and delay (Experiment I). 
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4. No immunizations—as a control for the im. 
munization + delay condition, these test messages 
were presented in Experimental Session 2, but with 
no preceding immunization. In obtaining judgments 
under these four conditions, the usual counterbal- 
ancing procedures were used to control for order 
effects and for the specific contents of the various 
messages. 

All messages were presented as emanating from 
physicians affiliated with major medical centers, 
After reading each message, subjects were asked to’ 
indicate the communicator’s probable degree of agree- 
ment with four statements of opinion, using a 15- 
point scale which ranged from “definitely agree” to 
“definitely disagree.” The average of these ratings 
was used as a measure of each communicator’s per- 
ceived scale position and constituted the main de- 
pendent variable of the study. 

At the end of Session 2, after all the messages had 
been rated, the subjects were asked to indicate their 
own beliefs regarding the four health issues using the 
same rating scales, and, finally, were informed of the 
true purpose of the experiment. 

The experiment was conducted using both favor- 
able and unfavorable test passages. That is, for half 
the subjects we were concerned with the influence of 
previously presented unfavorable messages (for ex 
ample, antipenicillin) on the subsequent judgment of 
favorable (for example, propenicillin) test passages} 
for the other half, subjects’ judgments of unfavorable 
test passages served as the dependent variable. Since 
previous research (McGuire & Papageorgis, 1961) 
suggests that college students have strongly favorable 
attitudes towards each of the experimental issues 
subjects exposed to the favorable test passages pro- 
vide data concerning the effects of our independent 
variables when the subject’s initial attitude is in essen- 
tial agreement with the test message to be judged; 
subjects presented with unfavorable test messages 
provide data for the case in which the subject 0%- 
poses the message to be judged. 


Results 


The data obtained from the favorable and 
unfavorable test messages were analyzed sep 
rately. Before applying formal significan 
tests, however, in order to eliminate effects 
attributable to differences between the sets 0 
opinion items and differences in the “extrem® 
ness” of the various test messages, the judg: 
ment data were corrected by the use of addi 
tive constants; these constants were selec 
so that the corrected mean for each test nA 
sage (pooling the various contexts in which it 
was presented) would equal the common gr ted 
mean, The own-opinion data were correcth 
in a similar fashion. 

Figure 1 presents the mean judgments 
ited by positive and negative test pas 
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under the four different experimental condi- 
tions described above; judgments that are 
low on the ordinate reflect a belief that the 
communicators held unfavorable opinions re- 
garding the various experimental issues. Each 
data point represents the average response to 
four different messages (m = 3 for each mes- 
sage). 

An analysis of the ratings elicited by the 
favorable test messages (Winer, 1962, p. 350)* 
revealed a significant interaction (p < .025) 
between immunization (versus no immuniza- 
tion) and delay (versus no delay). For the 
data collected in Session 1 (see Figure 1, solid 
line), a t-test comparison indicated that fa- 
vorable test messages which were immediately 
preceded by an unfavorable immunizing mes- 
sage were significantly displaced toward the 
positive end of the scale, as compared with 
favorable messages which were judged with no 
prior immunization (contrast effect; £ = 2.43, 
df = 11, p < .025, one-tailed test). Data col- 
lected in Session 2 (dashed line) followed the 
Opposite pattern; that is, favorable messages 
which were rated 48 hours after the presenta- 
tion of an unfavorable immunizing message 
were displaced toward the negative end of the 
scale (assimilation effect; ż = 2.03, df = 11, 
$ < .05, one-tailed test). 

{ This analysis also indicated that there were 
significant differences between the various 
subgroups which were established to provide 
for counterbalancing. The origin of this effect 
is quite obscure, however, and may simply 
represent a Type I error. More important, 
perhaps, is the fact that the difference between 
subgroups did mot interact with the variables 
of primary interest, and we may therefore 
conclude that the effects of immunization and 
delay were essentially the same in each of the 


‘experimental subgroups. 


The data obtained from the unfavorable 
Messages revealed no significant trends; im- 
munization did not produce any systematic 
effects on the subjects’ final judgments. When 
the data from the favorable and unfavorable 
, *Winer’s (1962) analysis is designed for studies 
Involving two within-group and two between-group 
Variables, Immunization and delay were the within 
pees the between variables derived from the 
oe that the counterbalancing procedures resulted in 
‘our experimental subgroups which could be placed 
Within the context of a 2 X 2 factorial design. 
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test messages are combined, the results are in 
general accord with the predicted trends, but 
are not significant. That is, ¢ tests comparing 
immunized and unimmunized judgments indi- 
cate that immediate immunization produced 
an overall contrast effect (p < .10, one-tailed 
test); a similar analysis of the immuniza- 
tion + delay data revealed an assimilative 
trend (p < .15, one-tailed test). Inspection of 
Figure 1 indicates that these combined results 
are exclusively a function of the favorable 
test messages; the unfavorable messages yield 
data which are mildly opposed to hypothesis, 
although these effects are far from reliable. 


Discussion 

The results of Experiment I were both 
promising and perplexing. Promising, because 
the data from the favorable test messages 
were in accord with our hypothesis; perplex- 
ing, in that these results (and those of Manis 
& Blake, 1963) were not replicated with the 
unfavorable test messages. 


EXPERIMENT II 


Experiment II was designed as an addi- 
tional test of our hypothesis concerning the 
effects of immunization and delay on the 
subsequent judgment of persuasive messages. 
Tn this experiment, only unfavorable test mes- 
sages were employed; thus the test messages 
were in opposition to the subjects’ generally 
favorable opinions concerning our experi- 
mental topics. 


Method 


Subjects. Experiment TI was conducted using 18 
female and 14 male college student subjects who 
participated as part of a psychology course require- 
ment, 

Procedure. In the main, Experiment II followed 
the procedures of Experiment I; thus, the study was 
again introduced as an attempt to develop a selec- 
tion device for people who were gifted in scientific 
areas, There were, however, several relatively minor 

i ices, 

E this experiment, the delay and no-delay data 
were obtained from separate groups of subjects. Only 
two message topics were employed—penicillin and 
the virtues of X-ray examination in the detection 
of TB; unfavorable messages which essentially at- 
tacked these practices served as test stimuli for all 
subjects. Each subject rated both test messages after 
having been immunized in one area but not in the 
other. For half the subjects, the immunization pas- 
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sage and the two test passages were presented with 
no intervening delay (that is, all the data were col- 
lected in a single session) ; for the other half of the 
subjects there was a 5-day interval between im- 
munization and the presentation of the two critical 
test passages. 

2. Rather than indicating each communicator’s 
position on a set of Likert-type scales, the subject 
used a set of six evaluative semantic differential 
scales (Osgood, Suci, & Tannenbaum, 1957) for this 


purpose. 
3. Before rating each passage, to insure careful 
reading, all subjects were instructed to underline the 
most important clause or sentence in each paragraph, 
Counterbalancing procedures were again employed 
to control for.order effects and differences between 
messages in the two topic areas. 


Results i 


As in Experiment I, to eliminate differences 
between the ‘various test passages, empirically 
determined constants were added to the ob- 
tained ratings so as to set the mean judgment 
elicited by each message (across experimental 
conditions) equal to the grand mean for all 
messages. When these corrected judgment 
data were subjected to analysis of variance 
(Type II mixed design; Lindquist, 1953)? 
there was a general contrast effect (p < .10) 
following immunization. This tendency to dis- 
place the test message away from the immuni- 
zation message appeared to be essentially ún- 
affected by the time interval separating the 
two. ai 


Discussion o 


The results of this experiment supported 
only one part of our hypothesis; while im- 
munization did lead to contrast effects in the 
no-delay condition, assimilation effects were 
not obtained following immunization and de- 
lay. 


EXPERIMENT IIT 


Since the results of Experiment II were 
somewhat less promising than those of Ex- 
periment I, it seemed desirable to replicate 
our original effects before reaching any final 
conclusions. In comparing the first two ex- 
periments, it was noted that in Experiment I, 


8 This design is for situations involving two be- 
tween-group and one within-group variables. The 
between-group variables were delay versus no delay, 
and a subgroup classification introduced to provide 
for counterbalancing; immunization (versus no im- 
munization) constituted the within-group variable. 
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subjects rated the content of the vario 
messages on a series of rather specific 
scales (for example, chest X-ray examin 
for TB should be taken regularly and off 
while Experiment II employed general | 
uative rating scales of the semantic diffe 
tial variety (for example, good-bad) 
be recalled that our previously obtained 
similation effects (Manis & Blake, 1963) 

tentatively attributed to the recipient’s 
tempts to minimize the apparent conflict 
sonance) between the immunization and 
messages. It now seemed possible that) 
dissonance might not lead to assimilation 
less the subject was forced to notice thee 
municators’ disagreement at a fairly c 
and detailed level (as in the Likert s 
and that this effect would be negligible 1 
the subject was more focused on 
evaluative content (as called for in th 
mantic differential). In Experiment IIT 
Experiment I, therefore, subjects’ judgt 
were obtained on Likert-type items. 


Method 


Subjects. Subjects were summer-session stù 
at the University of Michigan who were pal 
participating in the study (V = 60); there wi 
females and 29 males. 

Procedure. Experiment III closely resemb 
periment I in both design, content, and meto 
assessment. There were three experimental 
penicillin, the virtues of frequent toothbrushi 
the use of X-ray examinations in the del 
TB. After reading each message, subjects 
the most important part of each paragraph. 
indicated the communicator’s probable re 
four Likert-type opinion items on a 15-poi 
that ranged from “definitely agree” to “de 
disagree.” All messages were represented as 
ing from prestigeful medical sources. 

Data were collected in two sessions sepi 
a 5-day interval. In Session 1, subjects 
munized in one of the three topic areas} i 
2, they were immunized in a second area; 
sponded to test messages in all three area 
three test messages thus provided data 
each subject’s judgments in three experimen! 
tions; immunization -+ delay, immunizatio 
delay, and no immunization. As in Exper 
and II, the usual counterbalancing procedi 
employed. Moreover, as in Experiment I, } 
subjects received the favorable messages as 
sages, while for the other half the unfavorā 
sages played this role; thus, we were concen 
the effect of immunization on the judgment 0 
quent messages that the subjects agreed y 
those that they disagreed with. x 
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Results 


Figure 2 presents the mean judgments elic- 
ited by the test passages under the three ex- 
perimental conditions. As in Experiments I 
and II, empirically derived constants were 
employed to eliminate effects attributable to 
trivial differences between the various mes- 
sage topics. 

Data obtained from the favorable test mes- 
sages are in accord with hypothesis, and 
closely parallel those reported in Experiment 
I (see Figure 1). That is, test messages pre- 
sented directly after immunization were dis- 
placed away from the immunization passages 
(from a no-immunization “base-line” judg- 
ment of 13.33 to a judgment of 13.97); dis- 
placement in the opposite direction (from 
13.33 to 13.15) was obtained when there was 
a 5-day interval between presentations of the 
immunization and test messages. Analysis of 
variance (Type I mixed design; Lindquist, 
1953)* confirmed the fact that the three ex- 
perimental conditions lead to significantly 
different judgments (p < .025). Subsequent 
t tests revealed a significant difference be- 
tween the no-immunization and immediate- 
immunization conditions (p < .025, one-tailed 
test) ; delayed immunization did not, however, 
yield results which were reliably different 
from those obtained with no immunization 
(p < .35, one-tailed test). 

Turning now to the unfavorable test pas- 
Sages, analysis of variance indicated that the 
main effect associated with the different im- 
munization conditions was far from signifi- 
cant (F < 1.00); while the general trend of 
the data suggested that immediate immuniza- 
tion tended to produce contrast (see Figure 
2), immunization + delay had essentially no 
„effect, The failure to obtain a significant main 
effect in this block of data was, perhaps, at- 
tributable to the presence of an interaction 
(p <.01) between the three immunization 
Conditions and the various experimental sub- 
groups which were created to provide for 
counterbalancing. While the design of this 
study makes it impossible to interpret this 
result unequivocally, it is most likely due 
tise experiment involved one between-group 
factor (to provide proper counterbalancing) and one 
within-group factor (immunization, versus immuni- 
zation + delay, versus no immunization). 
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Fic. 2. Message interpretation as affected by im- 
munization and delay (Experiment III). 


either to unknown differences between the 
messages presented in the various topic areas, 
or the order in which the experimental condi- 
tions occurred. 

Overall, when the results from the favorable 
and unfavorable test messages are combined, 
a significant contrast effect is obtained im- 
mediately following immunization (¢ = 1.78, 
df= 59, p < .05, one-tailed test) ; the com- 
bined data for the immunization + delay 
condition yield an assimilative trend which is 
far from significant (¢ < 1.00). 

An intraindividual analysis. As noted 
earlier, the preceding experiments all em- 
ployed some form of repeated measures de- 
sign; as a consequence, after the “raw” judg- 
ment data had been corrected so as to permit 
meaningful _ cross-topic comparisons (see 
above), it was possible to determine whether 
or not each subject had conformed to hypoth- 
esis by comparing his “base-line” (unimmu- 
nized) judgments with his judgments of test 
messages that Aad been preceded by im- 
munization, Table 1 presents the results of 
this analysis. The first two sections present 
the results obtained when subjects were ex- 
posed to the favorable and unfavorable test 
messages, respectively; the overall results ob- 
tained by summing across the two types of test 
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messages and across the three experiments, 
are shown in the last row of the table. For ex- 
ample, of the 12 subjects presented with fa- 
vorable test messages in Experiment I, the 
ratings obtained immediately after immuni- 
zation showed contrast effects in 8 cases and 
assimilation in 4 (see Table 1); under similar 
judgmental conditions, 20 subjects in Experi- 
ment III showed contrast, while only 10 
showed assimilation (Experiment II did_not 
employ favorable test messages). Continuing 
with the results obtained from favorable test 
messages, we find that in both Experiments I 
and III, assimilation effects were more fre- 
quent than contrast when a delay period was 
interjected between the immunization and test 
messages. Summing across the two experi- 
ments, the results indicate that immediately 
after immunization, 28 subjects showed con- 
trast while only 14 showed assimilation 
(p < .025); following immunization + delay, 
the results are reversed—only 13 subjects 
showed contrast, while 29 showed assimilation 
(p < 025). 

It is interesting to note that despite the 
generally negative findings obtained with the 
unfavorable test messages, the results of the 
intraindividual analysis (see Table 1) are 
surprisingly consistent with our main hy- 
pothesis. Thus, in Experiments II and III, 
contrast effects were more common than as- 
similation when test messages were presented 
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TABLE 1 
An INTRAINDIVIDUAL ANALYsIS OF RESULTS OBTAINED IN Exrerments I, II, ann II 
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immediately after immunization (the 
reactions were equally common in Experin 
I); when a delay period was interjected 
tween the immunization and test me 
assimilation was more common than ¢ 
trast in all three experiments. As shown, 
ever, neither of these trends was statistic: 
significant, even when the results of all thi 
experiments are combined. 

The last row of Table 1 indicates that wi 
the data from the favorable and unfavorg 
test messages are combined, the obtain 
sults are highly significant and consistent 
hypothesis. That is, immediate immunizat 
typically leads to contrast (p < .012), w 
immunization + delay produces assii 
(p < 012). 

Own-opinion data. To provide a more ¢ 
plete account of the subjects’ reactions to 
immunization procedures, Table 2 presents 
summary of the own-opinion data from 
three experiments. High numbers indi 
favorable attitude toward the various i 
results obtained from the favorable 
favorable test messages are shown separ: 

5 As noted previously, the subjects’ opinion } 
assessed only once—after they had received 
change messages; the design of Experiments [ 
II made it possible, however, to estimate the 1 
that subjects held directly after immunization 
before they received the change messages). | 


specifically, in Experiments I and III the 
which served as immunizing agents for subje 


Immunization + delay 


Immunization 
Assimilation Contrast 
Favorable test messages 
Experiment I 4 8 
Experiment Ilb = me 
Experiment III 10 20 
um of I and IIT 14 28 
Unfavorable test messages 
Experiment I 6 6 
Experiment II 5 9 
EA II 12 18 
Sum of I, II, and MI 23 33 
Sum of all data (favorable 
and unfavorable) 37 61 


+ Assimilation Contrast 
19 9 3 
<.06 20 10 
<.025 29 13 
= 7 5 
21 9 7 
<.25 17 13 
<15 33 25 
<.012 62 38 


Note.—Subjects who showed neither assimilation nor contrast are omitted from this analysis. 


a One-tailed tests. 
b Experim 


jent II did not include a condition in which subjects judged favorable test messages. 
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TABLE 2 
Own OPINIONS As A Function OF ĪMMUNIZATION AND DELAY 
Favorable test messages Unfavorable test messages 
E edi 
attitude assessment Experiment Experiment 
I Il Til I IIe Ill 
Immunization (no change) 8.1 — 8.1 13.2 = 12.0 
Immunization and change (no delay) 10.2 — 9.9 10.5 6.03 10.3 
Immunization, delay, and change 11.8 — | 107 9.8 5,86 9.7 
Change message (no prior immuni- 
zation) 13.2° — 12.0 8.1° 5.14 8.1 


a Experiment II did not include a condition with favorable test messages, 
b Attitudes in Experiment II were assessed with 7-step semantic differential scales, 


Experiments I and II, in that unimmunized change messages were pre- 


Likert-type items. 
© There were two distinct no-immunization conditions in 


sented in both Test Sessions 1 and 2; for Experiments I and II, therefore, 


theseetwo conditions. 


Inspection of Table 2 reveals two consistent 
trends which appear within each of the five 
sets of data that are reported there. First, in 
accord with McGuire’s (1961, 1962) results, 
note that immunization served to reduce the 
impact of the change messages; there was less 
acceptance of these messages following im- 
munization than there was in the no-immuni- 
zation condition. Second, although this re- 
duced acceptance appears within both the de- 
lay and the no-delay conditions, a comparison 
of the two sets of data suggests that immuni- 
zation tends to lose its impact with the pas- 
sage of time, for introduction of the delay 
period leads to somewhat greater acceptance 
of the change messages (for similar results see 
Miller & Campbell, 1959, Conditions 1 and 
3; also McGuire, 1962). 


DISCUSSION 


e 

The experiments that are reported above 
provide mixed support for the hypothesis that 
guided this research. While inconsistent effects 


' were obtained with the unfavorable test mes- 


sages, favorable test messages (with which the 


one experimental group (for example, those receiving 
favorable test messages) also served as change mes- 
Sages for subjects in the contrasting group (for ex- 
ample, those receiving unfavorable test messages). 

e subjects’ attitudes following receipt of an un- 
favorable immunizing passage were therefore assumed 
to be the same as the attitudes obtained when an 
Unfavorable change message was read with no prior 


immunization; a similar procedure was employed 


in estimating the postimmunization views of subjects 
Who read favorable immunizing passages. 


while Experiments I ahd III used 15-step 


the no-immunization entries result from an averaging of 


subjects agreed) produced rather clear-cut 
positive results. That is, when the rating of a 
favorable test message was immediately pre- 
ceded by a passage of opposing content, con- 
trast was obtained; when there was an inter- 
vening delay of 2-5 days between presenta- 
tions of the opposing passages, assimilation 
effects were noted. 

Despite the striking consistency of these 
data, they are not easily explained, for they 
are not readily deducible from any of the 
now-popular theories of social judgment. For 
example, while some approaches (Helson, 
49475 Volkmann, 1951) lead to the expecta- 
tion that contrast will be obtained when op- 
posirg messages are presented for successive 
judgments, neither of these schemes leads to 
the prediction that assimilation (not merely a 
diminished contrast effect) will result when 
the two messages are separated by a delay 
period. 

In attempting to place these results within 
a general theoretical context, four rather dif- 
ferent approaches seemed promising; un- 
fortunately, the available evidence does not 
permit a rational choice among these alterna- 
tives. 

1. The Hovland-Sherif (1961) theory of as- 
similation and contrast can be applied if we 
stress the fact that immediately after immuni- 
zation the subject’s views have been changed, 
making the discrepancy between “self” and 
“other” sufficiently great that contrast is ob- 
tained. With the passage of time, however, 
the subject’s beliefs drift back toward his ini- 
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tial views (and toward the test message), ulti- 
mately producing an assimilation effect; a 
more modest assimilative trend may also have 
been present (although it cannot be eyaluated 
in these experiments) given the relatively 
small self-other discrepancy which presumably 
existed when the test message was judged with 
no previous immunization. Perhaps the main 
difficulty with this approach is the fact that a 
previous study (Manis, 1961b) using pres- 
tigeful sources did not show any evidence of 
contrast, regardless of the discrepancy be- 
tween the judge’s views and the message con- 
tents. These earlier data thus suggested that 
the assimilation and contrast model may only 
hold in the case of an anonymous message 
source, while the present studies all employed 
high prestige sources. 

2. An alternate application of the Hovland- 
Sherif (1961) theory might start with the 
assumption that the subject’s recall of the 
immunizing message is gradually displaced 
toward the middle of the scale with the pas- 
sage of time (such an assumption would be 
consistent with previous findings concerning 
the displacement of ambiguous communica- 
tions—Manis, 1961a). Starting from this 
premise, we might anticipate that when the 
test message is presented for judgment im- 
mediately after immunization, contrast is ob- 
tained because of the marked discrepancy 
between these two inputs; several days after 
immunization, however, the hypothesized“mi- 
gration” of the immunizing message toward 
the middle of the scale may bring it suffi- 
ciently close to the test message that assimila- 
tion occurs. 


PRESSURE TO CHANGE 


S 


neutral 


iimm. 


SCALE POSITION 


Fic. 3. A hypothetical curve showing the effects of immunization, delay, and self- 
other discrepancy on pressure to change and message displacement. 
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3. The obtained results may have | 
generated by two opposing processes w 
differed in importance as a function of | 
Thus, it might be argued that when 
munization and test passages were presé 
with no intervening delay, the obtained” 
trast effect was mainly attributable to 
sort of perceptual process that has been’ 
documented in previous judgmental resi 
concerned with anchoring effects (Hovla 
Sherif, 1961). This perceptual effect 
however, show a rapid decline, so that wh 
delay period is introduced, the data prin 
reflect the continuing operation of a 
sonance-reducing process which was il 
“masked”; that is, when the perceptual | 
ess has run its course, assimilation ei 
become manifest as the subjects atte 
reduce the apparent conflict (dissonance 
tween the “prestigeful” authors of 
munization and test passages. 

4. A “single-process” explanation Ca 
constructed if we start with the assum} 
(Festinger, 1957; Manis, 1961a, 1961 
distortions in this type of situation often 
sult from the subject’s attempts to reduce 
pressure to change that is generated by 
ing messages. Figure 3 presents a seri 
hypothetical relationships showing ho 
sure to change might be affected by th 
crepancy between “own views” and Va 
incoming messages when there has be 
immunization (solid line), immediately, 
immunization (dashed line), and several 
after immunization (dotted line). In bi 
assume that the average subject enteret 
experiments with relatively favorable” 


favorable 
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toward the different experimental issues and 
may consequently be located at Point So on 
the attitude continuum. The unbroken line in- 
dicates the extent to which this average sub- 
ject might feel pressure to change his views 
should he receive various test messages more 
favorable than his initial beliefs from the sort 
of “trustworthy” sources that were employed 
in these studies.° Note that we have assumed 
this to be a monotonic rising function; that is, 
given no immunization, the greater the dis- 
crepancy between own position and the views 
of a prestigeful other, the greater the pressure 
to change. We further assume that given the 
ambiguity inherent in any message, the sub- 
ject will choose that interpretation which is 
accompanied by minimal pressure to change, 
and are thus led to the prediction that under 
ho-immunization (base-line) conditions, the 
average subject may well have assimilated the 
test message from its veridical location (My) 
to a point less discrepant with his own stand 
(Mo). 

Let us now consider the immediate effects 
of immunization; upon receiving an unfavor- 
able immunizing message from a prestigeful 
medical source, the average subject reacted by 
shifting his own views to a point (S1) about 
midway on the attitude continuum (see Table 
2). A second hypothesized effect is indicated 
by the dashed line in Figure 3, which suggests 
that beyond a given point, increases in the 
self-other discrepancy are accompanied by 
‘decreased pressure to change. The descending 
Portion of this curve is presumed to reflect the 
subject’s relative resistance (and mistrust) in 
aesponse to extreme messages that have been 
immediately preceded by contradictory state- 
ments from an apparently trustworthy source. 
.Given these assumptions, we are led to the 


' prediction that immediately after immuniza- 


tion, the subject may most effectively reduce 
the influence potential of extreme messages 
(such as were employed in these studies) by 
displacing them to a still more extreme posi- 
tion on the attitude continuum (for example, 
from My to M1). Note also that less extreme 
Messages (which fall to the left of maximum 
Pressure to change) should show assimilation. 

®Data from Experiments I and III suggest that 


the favorable test messages were somewhat more 
extreme than the subjects’ initial views. 
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When immunization has taken place several 
days before the presentation of the test mes- 
sage, two main changes are assumed: First, 
the subject’s own views may be expected to 
shift back toward his original position (from 
S; to S2). Second, as a further consequence of 
the passage of time and of message-source 
dissociation (Kelman & Hovland, 1953), we 
assume that the subject is less affected by the 
apparent status of the immunizing communi- 
cator and hence is more likely to accept the 
“given” credibility of subsequent prestigeful 
authors, even though they may disagree with 
the first communicator (and even though they 
may disagree with the subject). Figure 3 
(dotted line) suggests that as a result of these 
two effects, the maximum pressure to change 
now occurs in response to a more extreme test 
message (and a greater self-other discrep- 
ancy) than was true immediately after im- 
munization.’ As a consequence, when immuni- 
zation has been followed by a moderate delay, 
the subject may reduce the influence poten- 
tial of a test message which was previously 
contrasted, by displacing it towards his pre- 
ferred stand at Ss (assimilation). Note that 
the magnitude of this assimilation is depicted 
as exceeding that which occurs when there 
has been no immunization, presumably be- 
cause of the greater pressure to change which 
would accompany a veridical interpretation of 
the test message when the subject is located 
at Sà rather than So (see Figure 3). 

Results with the unfavorable test messages. 
For reasons that are not completely clear at 
the present time, significant results were not 
obtained when unfavorable test messages were 
employed. There are, however, several reasons 
for proceeding cautiously. First, there is the 
fact that previous studies (Manis & Blake, 
1963) have obtained assimilation effects when 
these same unfavorable test messages were 
preceded by immunization and delay; thus, 
part of our main hypothesis has been sup- 
ported in other work. Second, the unfavorable 
test messages produced results which were in 
some ways more promising than our earlier 


7 With the further passage of time, the shape of 
this curve should become increasingly similar to the 
monotonically increasing function which was as- 
sumed under no-immunization conditions (see Fig- 


ure 3, solid line). 
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discussion might suggest. As indicated in Ta- 
ble 1, most subjects who rated the unfavorable 
test messages immediately after immunization 
showed contrast, while assimilation was typ- 
ically obtained following the delay period. 
Despite the modest consistency of these trends 
across experiments, however, neither of these 
effects reached acceptable levels of signifi- 
cance. 

It is, of course, possible that the relatively 
inconsistent results shown with the unfavor- 
able test messages may be due to the over- 
whelmingly fayorable attitudes of our subjects. 
It might be, for example, that given no im- 
munization, the subjects’ favorable opinions 
may have partly “substituted” for the favor- 
able immunizing passages. In a general sense, 
this explanation is reminiscent of Upshaw’s 
(1962) recent results; this investigator found 
that when the subjects judged the favorabil- 
ity of a series of opinion items, they were 
relatively unaffected when items were ex- 
cluded from their preferred end of the attitude 
continuum, while strong contrast effects were 
obtained by removing items with which the 
subject disagreed. Analogously, the presence 
or absence of an immunizing message may be 
a relatively unimportant determinant of sub- 
sequent judgments when the contents of this 
message are in general accord with the sub- 
ject’s personal views. 
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Tufts University 


This experiment is concerned with the identification of physiological correlative 


evidence of test anxiety 


high-anxious Ss produced. 
onset of the problem-solving 


e in a problem-solving situation. While low-anxious, 
medium-anxious, and high-anxious Ss attempted to solve anagrams, pulse- 
monitored heart-rate recordings were taken. The 
significantly largér increases in heart rate with the 
task than low-anxious Ss. Also, large increases in 


principal findings were that 


heart rate were consistently associated with poorer anagram problem solving. 
Tt was concluded that the physiological correlative index was sensitive to both 


anxiety level and the task situation. 


The primary purpose of the experiment 
reported in this paper was to attempt to 
identify during performance correlative in- 
dices of a paper-and-pencil measure of test 
anxiety which had been found to differentiate 
subjects in a variety of problem-solving situa- 
tions (Harleston, 1962). The rationale un- 
derlying the research was that successful 
identification of such indices would provide a 
broader base for understanding and inter- 
preting test-anxiety scale scores and would 
possibly enhance the validity of the measuring 
instrument. 

Despite the dearth of useful data indicating 
reliable relationships between behavioral 
measures of anxiety and physiological indi- 
cants (Martin, 1961), both empirical and 
theoretical considerations influenced the de- 
„cision to look to physiological activity for 
correlative indices. In the first place, several 
theorists (Child, 1954; Spence, 1958) have 
interpreted anxiety scores in terms of drive 
level or drive state for which an underlying 
physiological state is presumed (although sel- 
dom specified). Second, a variety of somatic 
indices have traditionally been associated with 
anxiety (Martin, 1961). Third, increasingly 
more attention in recent years has been di- 
rected to the specification and measurement 
of motivation in terms of physiological in- 
dices, The impetus for this approach has 
come from the research and writing of the 
activation theorists (Duffy, 1932, 1962; 


1 This research was supported in part by a Tufts 
p Caleta Faculty Research Grant to the first 
author, 


Lindsley, 1951; Malmo, 1959; Schnore, 1959; 
Stennett, 1957) who equate motivation to the 
intensive dimension of bebavior. This inten- 
sive dimension, labeled activation, arousal, or 
drive level, refers to the degree of excitation 
of the organism and denotes a communal 
drive mechanism. Also, the intensive dimen- 
sion is postulated to be reflected in and 
quantified by reference to neurophysiological 
and physiological indicants. 

To date, the activation point of view has 
influenced a considerable amount of research 
seeking to relate physiological indices to per- 
formance. For example, Stennett (1957) 
found an inverted U relationship between 
* physiological indicators of skin conductance 
and muscle tension and performance on a 
tracking task. Schnore (1959) demonstrated 
that an increase in task difficulty and task 
stress produced heightened physiological ac- 
tivity in a variety of indicators. He also found 
that certain indicators—heart rate, blood 
pressure, respiration rate, and right forearm 
muscle tension—were more sensitive to task 
and situational changes than other indicators. 
Obrist (1962) identified linear and curvilinear 
relationships between skin conductance and 
verbal learning and between heart rate and 
verbal learning. 

In contrast to the highly successful at- 

` tempts to uncover significant relationships 
between physiological indicants and perform- 
ance, efforts to relate physiological indicants 
to anxiety scale scores are few in number and 
generally nonproductive of consistent rela- 
tionships. Rossi (1959) found that high- 
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anxious subjects (Taylor, 1953) had signifi- 
cantly higher muscle-action potentials than 
low-anxious subjects during the tenth of a 
second interval immediately prior to respond- 
ing on a reaction-time task. Rossi interpreted 
these results as support for the view that the 
Taylor scale measured chronic drive rather 
than susceptibility to drive. On the other 
hand, McGuigan, Calvin, and Richardson 
(1959) found no relationship between Taylor 
scale anxiety and palmar-sweat index in a 
maze-learning situation. Likewise Rosenstein 
(1960) failed to find a significant relationship 
between test anxiety and palmar-sweat index 
in a verbal learning situation. 

Because these experiments involve a variety 
of procedures, instruments, and performance 
situations, it seems premature to conclude 
from them that behavioral measures of anx- 
iety are independent of physiological indices. 
Rather, the results suggest that the successful 
demonstration of such relationships may de- 
pend upon the particular choice of indices, 
scales, and performance measures. 

The present experiment sought to explore 
further the relationship between behavioral 
and physiological indices of anxiety in a 
performance situation. More specifically, at- 
tention was focused on the relationship be- 
tween pulse-monitored heart rate and test- 
anxiety level in a performance situation 
involving the solving of anagrams. Heart rate 
was selected because it had emerged as a 
sensitive index in a number of experiments 
(Blatt, 1961; Malmo, 1959; Schnore, 1959). 
Similarly, the particular test-anxiety scale 
selected had effectively differentiated subjects 
in problem-solving situations. The experiment 
sought answers to two related questions: 
What is the relationship between test-anxiety 
level and pulse-monitored heart rate? What 
is the relationship between each of these 
indices and performance in an anagram 
problem-solving situation? Three hypotheses 
were formulated: during performance the 


high-anxious subjects will have the fastest ` 


heart rate while the low-anxious subjects will 
have the slowest heart rate; anagram 
problem-solving efficiency will be inversely 
related to test-anxiety level; and anagram 
problem-solving efficiency will be inversely 
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related to heart rate during the performance 
situation. 


METHOD 
Subjects 


Forty-two females selected from an introductory 
psychology course served as subjects. Participation in 
the experiment satisfied a course requirement. About 
2 months prior to the beginning of the experiment 
the entire class was administered the test-anxiety 
questionnaire, developed and described in detail by 
Harleston (1962). Accordingly, subjects were ran- 
domly selected from among the distributions of 
low-anxious (LA), medium-anxious (MA), and high- 
anxious (HA) subjects. Fifteen of the subjects were 
LA, 14 were MA, and 13 were HA. The test-anxiety 
scores ranged from 26 to 48 for the LA, from 60 to 
67 for the MA, and from 76 to 86 for the HA, 


t 


Problem-Solving Materials 


Sixty anagrams were prepared for the experiment, 
They were selected equally from high- and low- 
frequency words as scaled by Thorndike and Lorge 
(1944). The anagrams were presented to all subjects 
in the same order, with word frequency counter- 
balanced within sets of four anagrams, The letter 
sequences for the various anagrams were selected 
from among difficult letter-arrangement orders de- 
scribed by Mayzner and Tresselt (1958). 


Recording of Heart Rate 


Heart rate was recorded peripherally from the 
thumb of the right hand with the aid of photocell- 
amplifier-recorder unit. The subject held a cylindrical 
tube which contained a 1.5-volt light source and a 
light-sensitive photoelectric cell. The subject's thumb 
was positioned on a plastic disc located on the top 
surface of the tube, flush with the light source and 
photoelectric cell. The diffusion of light on the thumb 
changed with the rhythmic change in blood volume 
evoking photoelectric cell responses. These latter 
responses were amplified and recorded by a Nesco 
Ink Recorder (Model JY120) on paper which moved 
at the rate of 1 inch every 7.5 seconds. The 
responses appeared on the paper as discrete horizon 
excursions of the pen from a center point.? 


Experimental Procedure 


All subjects were run individually. The anagrams 
were typed in capital letters on plain white 3 x 
index cards and exposed to the subjects on ê | 
display board, Each subject was allowed up to 
1 minute to solve the anagram. When the subject 
arrived at a correct solution she reported it to r 
experimenter who recorded the time taken and thi 
presented the next problem. If the subject of 
a wrong solution, the experimenter asked her tO 


2The authors are indebted to Michael S 


Frances Botch, and Lance Miller from the be 
States Army Natick Labs for assistance in de 
ment of the recording system. 
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again to solve it. If the subject did not correctly 
solve the anagram in 60 seconds, the experimenter 
told the subject the correct solution and then 
presented the next anagram. 

The subjects solved as many problems as they 
could in 20 minutes. Five minutes after the last 
anagram was presented, the subject was given 3 
minutes in which to recall orally as many of the 
anagram solutions as possible. 

Heart rate was recorded for 5 minutes before the 
onset of the problem-solving period (pretest calibra- 
tion), during the 20-minute problem-solving period, 
for 5 minutes after the problem-solving period (post~ 
test calibration), and during the 3 minutes of at- 
tempted recall. During the pre- and posttest calibra- 
tion periods, the subject was asked to sit quietly 
and try to relax. Instructions and three sample 
problems were presented before the onset of the 
pretest calibration period. A second experimenter, 
shielded from the subject’s view, monitored the 
operation of the recorder. 


RESULTS 


Heart-rate scores were derived from the 
ink-paper records of each subject. For any 
given subject, the heart-rate measure was 
obtained by simply counting heart-beat re- 


| sponses on her ink record and summing over 


1-minute intervals. 


Test Anxiety and Heart Rate 


For the 5-minute pretest calibration period, 
the mean heart rates per minute were 90.3, 
87.8, and 85.3 for the LA, MA, and HA, 
respectively. The LA subjects had the fastest 
heart rate, the MA subjects had the next 
fastest, and the HA subjects had the slowest 
„heart rate. However, neither ¢ tests nor 
product-moment correlations revealed sta- 
tistically significant relationships. In other 
words, test-anxiety level was not significantly 
“elated to heart rate before the onset of the 
problem-solving task. 

Two other periods which were examined are 
‘the first 5 minutes of the problem-solving 
task and the full 20-minute problem-solving 
period. The former period was investigated 
because it was evident that the onset of the 
task led to heightened heart activity in virtu- 
ally every subject. The 20-minute problem- 
solving period represents the total time the 
subject was confronted with the task. Because 
the anxiety groups had different initial heart- 
tate scores, it was decided that percentage 


“change scores (PCS) would be used in 


exploring the relationship between test anx- 
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iety and heart rate during these periods. The 
PCS for each subject was the ratio of his 
mean heart rate for either of these periods to 
his mean heart rate during the pretest 
calibration period. 

For the first 5 minutes of the problem- 
solving task, the mean PCS of the LA, MA, 
and HA subjects was 3.8, 5.4, and 8.7, 
respectively. The LA subjects experienced the 
smallest percentage of increase while the HA 
subjects experienced the largest increase. For 
the entire task period the mean PCS was 
1.2, 2.1, and 4.0, respectively. Thus, over 
the entire task the LA subjects showed a 
slight drop in heart rate while the HA sub- 
jects sustained thé highest increase. In each 
instance the relationship between test-anxiety 
scores and change in heart rate was statisti- 
cally reliable (r = .30 and .31, respectively, 
p<.05). In short, while pretest heart-rate 
level was unrelated to test-anxiety level, the 
degree of task-associated change in heart rate 
was found to be positively related to test- 
anxiety level. 


Test-Anxiety Level and Performance 


Three indices of performance were derived 
from the task: number of anagrams solved 
during the 20-minute test period, mean solu- 
tion time for the anagrams which were solved, 

«and number of anagrams recalled, The scores 
of the anxiety groups on these performance 
indiges are presented in Table 1. With respect 
both to number of anagrams solved and solu- 
tion times, it is evident that poorer perform- 
ance is associated with high test anxiety. 
More specifically, the LA and MA subjects 
solved more anagrams than the HA subjects, 
and the LA subjects had faster solution times. 


TABLE 1 


MEAN NUMBER OF ANAGRAMS SOLVED, MEAN SOLUTION 
Tres (IN SECONDS), AND MEAN NUMBER or 


ANAGRAMS RECALLED BY LA, MA, AND 
HA SUBJECTS 
Anagrams Solution Anagrams 
solved times recalled 
Group | N 
m | sp | M | SD | M | SD 
15 | 22.1 | 7.7 | 165] 4.0 | 7.8 | 2.6 
MA 14 | 22.8 | 8.5 | 19.6] 6.0 | 7.6 | 3.1 
HA 13 | 18.8 | 8.5 | 19.2 | 5.5 | 7.9 | 44 
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TABLE 2 


MEAN NUMBER OF ANAGRAMS SOLVED, MEAN SOLUTION T™ES (IN SECONDS), AND MEAN NUMBER OF 
ANAGRAMS RECALLED BY SUBJECTS WITH Low AND HIGH PERCENTAGE OF CHANGES IN HEART RATE 


Anagram solved Solution times Anagrams recalled 
Perocatage of change N 
ratio and group hy d A LA F ER 
Pretest to first 5 minutes 
foe PCS oa 24.6 8.5 16.9 48 8.5 3.4 
High PCS g 21 18.0 6.8 19.9 5.6 7.4 3.3 
Pretest to total task 
Toe PCS. ay 21 23.4 8.4 17.0 4.8 7.6 3.1 
High PCS 21 19.2 7.8 19.8 5.6 7.9 3.7 


However, norie of these differences reached 
statistical significance. 


Heart Rate and Performance 


In order to determine the relationship be- 
tween change in heart rate and problem- 
solving efficiency, the performance of the 21 
subjects showing the smallest percentage of 
change was compared with that of the 21 
subjects showing the largest percentage of 
change. Separate analyses of the performance 
data were made for PCS data based upon 
the first 5 minutes of the problem-solving 
task and for PCS data based upon the entire 
problem-solving period. In Table 2 are pre- 
sented the scores on the various indices 
obtained by the high and low percentage of 
change groups based upon each period. With 
respect to both the number of anagrams 
solved and solution times, subjects showing 
smaller percentage of changes performed 
better than subjects showing larger percentage 
of changes. Considering first the PCSs based 
upon the first 5 minutes, one-tailed ¢ tests 
indicated that the differences between groups 
in numbers solved and in solution times were 
significant at better than the .05 level 
(t= 2.68 and 1.83, respectively, df = 40). 
These relationships are further underscored 
by correlations of —.41 and .42, respectively 
(p < 01). For the PCSs based upon the total 
problem-solving period, the ¢ value for num- 
ber of anagrams solved was 1.63, with a p 
value between .05 and .10. For solution times, 
the ¢ value was 1.71, with a p value of .05. 
Again, these relationships are reflected in sig- 
nificant correlations of —.30 and .37, respec- 
tively ( < .05). In neither period do recall 
differences approach significance. 


Interactive Effects of Test-Anxiety Level and 
Percentage of Heart-Rate Change on Per- 
formance s 


In view of the fact that high test anxiety 
was associated with greater percentage of 
change in heart rate and large percentage of 
changes was associated with poorer task per- 
formance, the failure of test-anxiety level to 
emerge as a statistically significant variable 
raised the possibility that test-anxiety level 
was interacting with degree of change in heatt 
rate to determine performance. This possibil- 
ity was explored by comparing the perform- 
ance scores of the high and low percentage of 
change subjects separately for each test- 
anxiety group. These data are summarized in | 
Table 3, separately for the first 5 minutes and 
for the total task period. Inspection of the 
table reveals that within each anxiety group 
the performance of the subjects with high 
percentage of change scores was generally 
poorer than that of subjects with low pe į 
centage of change scores. With respect t0 
number of anagrams solved the differences be- 
tween high and low percentage of change sU 
jects were consistently greater within both 
LA and MA groups than within the i 
groups. For each time period, analyses by 
tests revealed that the differences between hig 
and low percentage of change subjects W! 
the LA group and within the MA group Wat 
statistically reliable (p< .05 in each ‘i 
stance). For the solution times only the dii 
ferences within the MA groups were Si 
cant (p < .05). 

Thus, based upon the performar 
of anagrams solved and solution times, " 
is some evidence of an interactive effect E 
tween anxiety level and degree of chang? 


nce indices 
ere 


: 
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heart rate. The recall data, while showing 
some trends which point to interactive ef- 
fects, failed to produce consistent differences 
when similarly analyzed. 


DISCUSSION 
Test Anxiety and Heart Rate 


While absolute heart rate failed to show a 
significant relationship with test-anxiety level, 
percentage of change in heart rate emerged as 
a significant correlative index. High-anxious 
subjects produced the greatest increase in 
heart rate upon beginning the problem-solving 
task and sustained the greatest increase in 
heart rate throughout the task, while LA sub- 
jects produced the smallest initial change and 
sustained the change for a shorter period of 
time. If change in heart rate is an index of 
change in arousal or drive level (Schnore, 
1959; Stennett, 1957), these data suggest that 
rather than reflecting absolute drive level as 
such, test-anxiety score is a relative index of 
the subject’s potential increase or change in 
arousal or drive level in a task situation. The 
task evokes the arousal change, but the ex- 
tent or degree of change depends upon the 
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subject’s test-anxiety level. In short, while a 
correlative index of heart rate has been suc- 
cessfully identified, the index reflects the inter- 
action of subject characteristics and task fac- 
tors. Anxiety level, in relation to this index, 
was significant only in the presence of the 
task. 
In view of the fact that the relatively 
greater increase in heart rate by the HA sub- 
jects occurred early in the total task, it seems 
reasonable to conclude that the major factor 
causing the shift was the general task demand 
to perform selectively and under time stress. 
It is also possible, however, that such factors 
as the subject’s perception of how well she 
was doing and the impact of success or failure 
on a given problem also contributed differen- 
tially to the heightened arousal of the HA sub- 
jects. Indeed, Blatt (1961) has indicated that 
in certain problem-solving situations, success 
experiences lead to heightened heart rate. The 
design of this experiment does not permit a 
meaningful test of any differential effects on 
change in heart rate due to specific success or 
failure experiences. However, the fact that the 
three anxiety groups did not differ signifi- 
cantly in performance would seem to minimize, 


TABLE 3 


MEAN NuMBER OF ANAGRAMS SOLVED, 
ANAGRAMS RECALLED BY SUBJECTS OF 


MEAN SOLUTION Times (IN SECONDS), AND MEAN NuMBER OF 
Eacu ANXIETY Group WITH Low AND HICH PERCENTAGE 


OF CHANGE IN HEART RATE 


Anagrams solved © Solution times Anagrams recalled 
Percentage of change | ya 
ratio and group a S ep ee ig p 
Pretest to first 5 minutes 
i 16.6 5.1 8.0 2.4 
Low PCS 9 24.8 8.3 x ; l ; 
High PCS 6 18.0 3.9 16.5 1.5 7.5 2.9 
MA 8.6 16.9 43 8.0 3.4 
Low PCS 7 27.1 i : : . . 
i High PCS 7 18.4 5.7 22.4 6.2 7A 2.6 
‘A 
Low PCS 7 18.0 7.9 18.7 4.9 17 4,5 
High PCS 6 19.8 9.1 19.7 61 8.2 4.2 
Pretest to total task 
a 8.0 15.7 4.8 7.4 2.8 
Low PCS 9 24.5 . 3 $ s 4 
Met PCS 6 18.5 40 17.8 1.8 8.3 2.2 
4 
Low PCS 7 271 8.6 16.9 43 8.0 3. 
apes PCS 7 18.4 5.7 22.4 6.2 7.1 2.6 
4.2 
Low PCS 8 17.9 7.4 18.9 4.6 7.6 
High PCS 5 20.4 9.9 19.6 67 8.4 4.6 


^ The Ns refer to the number of subjects in a given anxiety grou 


p whose PCSs were below or above the mean for that group. 
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though not eliminate, the importance of these 
factors in the present experiment. 

The emergence of an arousal or drive level 
index which is dependent upon the interaction 
of subject and task is consistent with the 
many studies which have found important 
interactions between anxiety level and task 
factors (Harleston, 1962; Sarason, 1960; 
Spence, 1958). Of greater significance, per- 
haps, is that the present findings may offer a 
basis for explaining or interpreting these» in- 
stances of anxiety and task interaction. Fur- 
ther, the emergence of a correlative index 
only in the interaction of subjects and task 
may account for the failure of other studies 
which have used absolute measures of physio- 
logical activity to identify reliable correlative 
indices. At any rate, these data underscore the 
importance of situational factors in evoking 
latent arousal or drive. 


Test Anxiety and Performance 


The failure to find significant differences in 
performance associated with test-anxiety level 
as a main effect was quite surprising. In pre- 
vious research involving anagrams (Harleston, 
1962), HA subjects had performed more 
poorly than LA or MA subjects. One possible 
reason for obtaining no differences in the 
Present experiment is that task-generated 
arousal dwarfed the differential changes at- 
tributable to anxiety level. Certainly the task 
of the present experiment was considerably 
more difficult than the tasks used in earlier 
studies. Many more difficult anagrams were 
used, and the subject was allowed a maximum 
of only 1 minute to solve each anagram. In 

+: previous research the subject was allowed 2 or 
2.5 minutes. In short, the difficulty level of the 
present task may have produced a relatively 
large amount of task-related arousal, thereby 
reducing the effects of test-anxiety level. 
Further, the trends pointing to interactive ef- 
fects of percentage of heart-rate change and 
anxiety suggest that the influence of task- 
generated arousal may have been dispropor- 
tionately greater for the LA and MA subjects 
than for the HA subjects. It is interesting to 
note that Rossi (1959) found a comparable 
interactive effect in his experiment on muscle- 
action potential (MAP) correlates of manifest 
anxiety in a reaction-time task. In addition to 
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finding that HA subjects had higher MAP 
scores than LA subjects, Rossi reported that 
his LA subjects with high MAP scores per- 
formed significantly better than his LA sub- 
jects with low MAP scores, while among HA 
subjects, high or low MAP scores were not 
consistently related to performance. 


Percentage of Heart-Rate Change and Per- 
formance 


With respect to both number of anagrams 
solved and solution times, relatively moderate 
shifts in arousal level were associated with 
better performance than relatively extreme 
shifts, These results are consistent with the 
data of several experiments (Easterbrook, 
1959; Stennett, 1957) involving a number of 
behavioral processes. However, the results do 
not selectively support any one of the many 
explanations which have been advanced to 
account for poorer performance due to ex 
treme levels of arousal or drive (Malmo, 
1959). 

Considered together, the results of this ex- 
periment lend support to the search for cor- 
relative indices of behavioral measures cof 
anxiety and confirm the validity of using 
physiological indices as predictors of behavior. 
At the same time the results also point to @ 
number of research needs. Among these are 4 
test of the generality of the present findings to 
other physiological indicants and a more prt 
cise delineation of the relative role of subject 
and task factors. 
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HOPE OF SUCCESS, FEAR OF FAILURE, SUBJECTIVE 


PROBABILITY, AND RISK-TAKING BEHAVIOR* 


RICHARD peCHARMS anp PRAFULACHANDRA N. DAVE? 
Washington University, St. Louis, Missouri 


Risk-taking behavior was studied in 71 elementary school boys testing the 
hypotheses that (a) Ss who have hope of success will be more successful at 
shooting a volleyball into a basket than Ss who have fear of failure; and (b) 
Ss who have hope of success will take more moderate risks than Ss who have 
fear of failure. Actual success was measured in 70 practice trials from a number 
of predetermined distances. Risk taking was measured in 20 trials in which 
S chose his shooting distances. Hope of success and fear of failure were measured 
as by Atkinson et al. and by a technique first presented by Jordan and 
deCharms. Risk taking was measured as by Atkinson et al. and by taking 
into account the skill of the individual S. No relationships were found between 
the measures used by Atkinson et al. and risk-taking behavior. Using the 
Jordan-deCharms method of assessing achievement orientations, however, it 
was found that Ss with high hope of success and high fear of failure both 
shot more successfully in the 70 practice trials and took more moderate risks 


in the 20 free-choice trials than did the other Ss. 


This study is a direct outgrowth of research 
on risk taking and achievement motivation 
reported by Atkinson and Litwin (1960) and 
McClelland, Atkinson, Clark, and Lowell 
(1953). In many respects the basic hypothe- 
ses in this study are similar to the predictions 
and results reported in the previous work. 
Thus in general the basic problem of the 
present research is to investigate the relation- 
ship between achievement motivation and risk 
taking in children. The hypotheses stem, from 
the proposition that in a risk-taking situa- 
tion, subjects with strong achievement moti- 
vation will attempt to attain an optimum 
amount of satisfaction, which is assumed to 
be a joint function of maximizing success and 
minimizing failure. It is further assumed that 
some subjects will be more concerned with 
maximizing success than with the implications 
of failure and others will be most concerned 
with minimizing failure. Specific hypotheses 
relating these aspects of achievement motiva- 
tion (n Achievement) to risk-taking behavior 
will be developed. 

Essentially, two basic innovations are made 
in the present research. First, achievement 
motivation is divided into two components 


1The research reported in this Paper was sup- 
ported by the Office of Naval Research, Group Psy- 
chology Branch, Contract Nonr 816 (11). 

*Now at Regional College of Education, Manasa- 
Gangotri, Mysore-2, India. 


called hope of success and fear of failure that 
are measured in a way which has not previ- 
ously been related to risk taking, although 
they were developed and used in previous re- | 
search by Jordan and deCharms (1959). Sec- 
ond, a specific attempt is made to make ex- 
plicit to each subject his own probability of 
achieving success in the various conditions of | 
the risk-taking task. Since the task involves 
skill, the probability of achieving success 1s 
assumed to vary from subject to subject. In 
previous research, although the probability of 
achieving success was assumed to be an mm- 
portant variable, it was calculated on a genera 
basis rather than on a basis specific to the 
individual subject. In the present study, sub- 
jects were alone with the experimenter rather 
than in a group situation as in previous 
studies. This was an attempt to emphasis 
individual achievement and reduce other fac 
tors such as affiliation motivation. i 
The model presented by Atkinson for the 
relationship between motivation, incentives 
and the probability of success is assumed a 
most details here. It implies that subjects W i 
a strong motive to achieve and a weak fear 0 
failure will tend to take moderate risks, W a 
subjects motivated primarily by fear of failure 
will take either extreme risks or low ie 
Theoretically, the definition of risk hinge 
the subject’s expectations concerning his 0 i 
skill, Subjects motivated to achieve shou 
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take risks close to their own :5 probability of 


succeeding, whereas subjects motivated by 
fear of failure should avoid their own .5 prob- 
ability of succeeding. 

Atkinson, Bastian, Earl, and Litwin (1960) 
present evidence, however, that the two groups 
approach a shuffleboard game with differential 
expectations about scoring. Subjects motivated 
to achieve expect to beat more of the group 
than do subjects motivated by fear of failure. 
They found no evidence, however, that these 
differential expectations were borne out in the 
actual shooting, although the data available 
to check on differential skill would demand 
large differences for significance. 

The present experiment attempts to assess 
skill and individual probabilities of success by 
recording successes in practice trials. This 
technique was tried by the aforementioned re- 
searchers, but failed since the skill of the 
subjects was so low that empirical probabili- 
ties of success could not be used. Subjects with 
strong achievement motivation have been 
found to excel in many laboratory tasks (Mc- 
Clelland et al., 1953). It therefore seems rea- 
sonable to predict that success in practice 
trials will be related to the achievement moti- 
vation of the subject. 

In addition to attempting to refine the ex- 
perimental technique, this study is designed to 
compare the use of n Achievement (n Ach) 
and anxiety scores (Atkinson’s measures) with 
a hope of success and fear of failure measure 
derived exclusively from fantasy productions. 
The preference for the fantasy measures re- 
sults from assuming that subjects with hope of 
§uccess or fear of failure are alike in stating 
achievement goals in their stories. The content 
and valence of these goals, however, indicate 


_cither approach or avoidance reactions. Thus 


4 person with strong achievement motivation 
with primarily avoidant responses, that is, 
one to whom the goal is important and to be 
avoided seems to be a person motivated by 
fear of failure. 

Atkinson and Litwin (1960) and Birney, 
Burdick, and Teevan (1964) base their 
Measures on the assumption that fear of 
failure is a characteristic of subjects with low 


‘achievement motivation. They use a general 


measure of anxiety or of hostile press to as- 
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sess fear of failure. Our assumption is quite 
different. We have assumed that achievement 
goals should be important for fear of failure 
subjects. Thus we have tried to separate hope- 
versus fear-oriented subjects from among 
those who show strong achievement motiva- 
tion. This approach has its roots in the dis- 
tinction made by French and Chadwick 
(1956) between goal-oriented and threat- 
oriented affiliation motivation. 

With the methodological refinement and a 
different measure of achievement orientation, 
the present study was intended to support the 
hypotheses presented by Atkinson with even 
greater clarity. « 

Hypothesis I. In a risk-taking task involv- 
ing motor skills, subjects who are: oriented 
toward achieving success will be more suc- 
cessful than subjects who are oriented toward 
avoiding failure. 

Hypothesis II. In a risk-taking task in- 
volving motor skills, subjects who are oriented 
toward achieving success will take more mod- 
erate risks than subjects who are oriented 
toward avoiding failure. This hypothesis is 
similar to that presented by Atkinson et al. 
(1960) and Atkinson and Litwin (1960). 


METHOD 


Subjécts 


Seventy-one boys from the fourth, fifth, and sixth 
grades of two predominantly middle-class schools in 
a large midwestern city served as subjects in the 


experiment. 


Measure of Achievement Orientation 


The Thematic Apperception Test (TAT) was ad- 
ministered to all the subjects under so-called task 
conditions in their own classrooms, Six TAT pictures 
were presented to the subjects. The pictures were all 
standardized pictures listed in Atkinson (1958) and 
they were shown in the following sequence: the boy 
in the checkered shirt, two men standing near a 
machine, the architect, the boy in the operating 
room, a youth talking to an elderly man beside a 
farm fence, and the trapeze artists, After the sub- 
jects had viewed each picture for 20 seconds, the ex- 

jmenter asked them to make up a story about it 
and to devote approximately 1 minute to answering 
each of four general questions in the booklet. As each 
minute was UP, the experimenter instructed them to 
proceed to the next question. The total time re- 


quired to complete the test was approximately 30 


minutes. 
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Scoring 


First the six stories written by each subject were 
evaluated individually to determine whether or not 
they contained achievement imagery. Second, each 
story that contained achievement imagery was scored 
for seven subcategories. Third, following a procedure 
similar to that developed by Jordan and deCharms 
(1959) each story containing achievement imagery 
was further categorized either as indicating hope of 
success or as indicating fear of failure depending 
upon the outcome of the story, The scorers at- 
tempted to distinguish between general approach’ and 
avoidance reactions (optimism-pessimism) which 
were specifically related to achievement goals and 
not a generalized anxiety or hostile press. Thus if 
the achievement goals in the story were not achieved, 
it was scored fear of failure. If the outcome was 
questionable, the story was arbitrarily scored fear of 
failure. (It should be noted, however, that the out- 
come category is not necessarily determined by the 
outcome of stated instrumental activity in the story.) 

Each subject was assigned an n Ach score (here- 
after referred to as total n Ach score) on the basis of 
1 point for achievement imagery and 1 additional 
point for each of the following subcategories: need, 
instrumental activity, anticipatory goal states, affec- 
tive states, nurturant press, personal or environ- 
mental obstacles, and overall thema. The sum of 
the points assigned for the above subcategories in all 
of a subject’s stories containing achievement imagery 
constituted his total n Ach score. This is identical 
bi the procedure used by Atkinson and McClel- 

The hope of success score and the fear of failure 
score for each subject were determined by dividing 
his stories into the two aforementioned categories, 
that is, those judged to be hope of success and those 
judged to be fear of failure. The sum of the points 
assigned for achievement imagery and the subcate- 
gories in those stories determined to have a positive 
outcome (or hope of success) constituted the sub- 
Ject’s hope of success score, and the sum of the 
points in those stories determined to have a nega- 
tive outcome (or fear of failure) constituted his fear 
of failure score. The sum of the hope of success 
score plus the fear of failure score for any subject 
equaled the total n Ach score for that subject. 

Interscorer reliability was checked by double scor- 
ing a one-third random sample from the total popu- 
lation of stories. The reliability coefficient for both 
percentage of agreement on achievement imagery 
and for all subcategories was 84%. Test Anxiety 
scores were obtained from the Mandler-Sarason Test 
Anxiety Questionnaire (Mandler & Cowen, 1958), 
which was administered immediately after the TAT. 


Risk-Taking Task 


A game of shooting a volleyball into a fixed 
bushel basket at floor level from varying distances 
was selected as the task situation after various pre- 
tests had indicated both the subjects’ interest in 
basketball and their inability to handle the standard 
basketball equipment. The following seven shooting 
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distances also wete determined in the 
were selected to include both hard and ea 
each subject: 4, 7, 10, 13, 16, 19, and 22 

The risk-taking task was administered 
lated situation with one subject at a 
trasted with Atkinson’s group situation. 
was briefed fully about what he was supposed 
He was shown the basket and the distances 
told to shoot the ball in any way he wis 
try to hit the inside of the basket as many 
he could. It was made clear to him that 
counting for points were those in which 
the inside of the basket first, although it wi 
necessary for the ball to stay inside the 
the points to count. 

The risk-taking task was divided into 
warm-up period of 7 shots; 70 practice sho 
test period of 20 free-choice shots. i 

In Phase 1, or warm-up period, the subj 
7 shots, 1 from each distance. In P] 
practice period, the subject made 70 practice 
10 blocks to determine his probability of hi 
basket from each distance. During each of 
seven-shot blocks, he made one shot from ead 
seven distances in the following order: 4, 10) 1 
7, 13, and 19 feet. This procedure was foll 
all 10 practice blocks. After 5 seven-shot bl 
completed, the subject was allowed to rest 
2 minutes, and then he was asked to sho 
shots in the remaining 5 blocks. 

When he had completed Phase 2, the st 
given a 5-minute rest period during wi 
the experimenter computed his probability” 
from each distance and then gave the 
paper-and-pencil analog of the risk-taking 
which we will discuss later. 

During Phase 3, the final testing period 
choice trials, placards on the floor next to 
tance mark confronted the subject with th 
of times he had succeeded in hitting the bi 
that distance in the 10 practice trials in P 
addition, the placard showed the number © 
which he would receive if he hit the 


in reverse order of the subject’s measur 
of success from any distance, 1 point for 
probability, 2 points for 90%, down to 10 
10% or 0%. G 
It should be noted that these probal 
incentives were actually computed from 
of each subject in Phase 2 and therefore V 
subject to subject. For example, one sub; 
have successfully hit the basket in 5 oul 
practice trials from the 10-foot distance. 
placard confronting him at 10 feet would 
of that fact plus the fact that he wí 
points if he succeeded in hitting the } 
that distance during these trials. Anotl 
however, who had had 5 successes from 7 
be confronted with this same placard at 
stead of 10 feet. Thus the placard appro 
each subject was placed at each of the 
tances, and their placement varied from 
subject. 


Risk-TAKING BEHAVIOR 


The pencil-and-paper analog of the risk-taking 
situation which was given immediately prior to the 
20 free-choice test trials gave assurance that each 
subject had a full understanding of what he was 
supposed to do in the final testing period. The ana- 
log test was a miniature drawing of the exact game 
situation, showing distances, the subject’s computed 
probability of success, and the corresponding in- 
centives, The experimenter emphasized to the sub- 
ject that during the final test he did not have to 
conform to what he put down on this paper-and- 
pencil analog. 


Definition of Moderate Risk 


Moderate risk will be defined in terms of a group 
norm—namely, the distance where most shots are 
taken by the entire group. This distance is the em- 
pirically established most preferred point—the group 
mean distance. Nonmoderate risk is defined by shots 
taken either much closer to or much farther from 
the target than this point. 

A moderate risk in terms of the subject’s estab- 
lished probability of success is defined in terms of 
the empirically established most preferred probabil- 
ity of success—that is, the group mean of free- 
choice shots expressed in terms of their probabilities 
of success. This mean tells us that most often the 
subjects choose this level of probability of success in 
their free-choice shots. A point for each subject 
which corresponds to this group mean probability 
for him may be established and deviations around 
this may be established. Such a measure tells how 
the subject varies around his own point of proba- 
bility of success which corresponds to the group’s 
most preferred probability. The distance of this 
point may vary from subject to subject according to 
skill. However, it is the point at which each sub- 
ject has an equal and moderate probability of suc- 
cess, Thus the measure assures a test of moderate 
tisk taking which is equated for skill. 


Analysis of the Data 


The data may be analyzed in four different ways. 
This is because success and failure orientation may 
be derived from either the total n Ach scores and 
Test Anxiety scores (Atkinson’s measures) or from 
our analysis of the TAT resulting in a hope of suc- 
tess or fear of failure score. The dependent variable 
may also be derived in two ways. Following Atkin- 
son’s procedure the deviation of actual distances 
around the group mean may be used. The measure- 
ment of actual probabilities allows a second score 
comprised of the deviation of probabilities around an 
individual’s point corresponding to the average prob- 
ability. The latter appears to conform more closely 
to Atkinson’s model which assumes that the choice 
of a place to shoot from is made with the proba- 
bility of success in mind and is only secondarily re- 
lated to the actual distance. In addition, this latter 


-measure equates subjects for skill. 


The independent variables of total n Ach, test 
anxiety, hope of success, and fear of failure were 
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split at the median score. Using total n Ach and test 
anxiety, a 2 X 2 division into four groups resulted in 
a group high on both variables, a group low on both 
variables, and two groups high on one variable and 
low on the other. A similar division was made using 
hope of success and fear of failure. Both of these 
divisions resulted in unequal cell entries, but the 
inequalities were especially marked in the second 
division of hope of success and fear of failure be- 
cause 52 subjects scored 0 on fear of failure. 

Two scores, comprising two measures of the de- 
pendent variable, were computed from the same data. 
These scores are the average deviation distance score 
and the average deviation probability score. These 
names will be shortened to the distance deviation 
score and the probability deviation score. The method 
used to derive the distance deviation score was as 
follows; First, the group mean of the distances for 
the 20 free-choice shéts obtained from all 71 sub- 
jects was computed. Then the deviation of each shot 
distance from the group mean for each ‘individual 
was computed disregarding sign and this comprised 
his distance deviation score. 

The probability deviation score is similar to the 
distance deviation score except that it is based on 
the empirically determined probability of success 
from each distance. For each distance from which a 
subject might shoot, the probability of success was 
available from the 10 practice shots at that distance 
for each subject. Thus each distance was also char- 
acterized by a probability for any individual sub- 
ject? Therefore, a deviation score could be calcu- 
lated using the probabilities just as it was with the 
distances. First, a group mean was computed for all 
subjects using their individual probabilities of hitting 
from each point from which they took a shot during 
the free-choice trials (group M = .34). Then the 
eviation of each of the 20 shot probabilities for 
each individual from his own point of .34 probabil- 
ity of success was calculated. Finally, the mean of 
the 20 deviations of each individual was computed 
disregarding sign and this comprised his probability 
deviation score. A very skillful person may have a 
moderate risk probability (.34) far from the target, 
whereas a less skillful one may have this probability 
close to the target. Using deviations about this group 
defined norm but individually determined point com- 
pletely eliminates the effect of skill on the dependent 
variable, It shows how an individual's choice of 


8Jn some instances, the empirical probabilities of 
success in the practice trials were not monotonically 
decreasing with increased distance. If this were the 
case, for example, a subject made only three suc- 
cesses from 7 feet and made five from 10 feet, the 
experimenter quickly graphed the results and roughly 
plotted a smooth curye, and assigned probabilities 
which were communicated to the subjects on the 
placards. The graphing was only done when the 
probability went up 20 or more with increasing dis- 
tance. The procedure was used in approximately 
one-third of the 71 subjects. These smoothed proba- 
bilities were used both for the placards and for 
computing the probability deviation score. 
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TABLE 1 


COMPUTATION or DISTANCE DEVIATION SCORE AND 
PROBABILITY DEVIATION SCORE FOR 
Supyects 36 AND 37 
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TABLE 2 


RANK-ORDER CORRELATIONS BETWEEN HOPE oF 
Success, Fear or FAILURE, n ACHIEVEMENT, AND 
MANDLER-SARASON TEST ANXIETY QUESTIONNAIRE 


Se 


Distance from target in feet 


4 7 | 10 | 13 | 16 | 19 | 22 
Subject 36 
Empirical pias .70 | .60 | .20 | 00 | .10| 00) 00 
requency of free- 
choice shots 10 7 3) 0] 0} O| oO 


Subject 37 

Empirical probability® | .90 | .50 | .40 | .50 | .20 | 30 10 
Frequency of free- 

choice shots 0j 0 9j 11] 0| 0| 0 


a These represent the probabilities determined in the practice 


shots varies around a point” where he is equally 
as successful as all other subjects. 

Since the distinction between the distance deviation 
score and the probability deviation score is critical, 
Table 1 is presented to help clarify how the scores 
were derived. Table 1 shows that Subject 36 took all 
of his free-choice shots from the distances of 4, 7, or 
10 feet, At these distances his probabilities of success 
determined from practice were .70, .60, and .20, re- 
spectively. The average deviation of this subject’s 
shots around the overall mean distance of free-choice 
shots for all subjects (12.1 feet) can be determined 
using the concept and formula for average deviation 
2|X — X|/N, where X equals the distance of each 
subject’s shots and N equals the number of shots he 
took, that is, 20, This gives him a distance deviation 
score of 6.15. . 

It should be clear that the same procedure may be 
used substituting the probabilities for the distances 
and the overall mean probability of the free-choice 
shots for all subjects (.34). This results in the 
probability distance score—for Subject 36 it équals 
.292. For Subject 37, the distance deviation score is 
1.44 and the probability deviation score is .115, 

The advantages of the probability deviation score 


Fear of |n Achieve- Test 

failure ment anxiety 
Hope of success 38* .84* —.009 
Fear of failure _ -72* 001 
n Achievement =- = —.133 


*p<01. 


may now become apparent. Such a score equates 
subjects on skill and compares only the type of risk 
they are taking with their own skill. It should be 
remembered that when the subject decided from 
what distance to take his free-choice shots, he was 
confronted with placards on which were the proba- 
bilities that he knew were the results of his practice 
trials and were then used in computing his score. The 
probability deviation score, therefore, should give a 
better indication of risk-taking behavior than the 
distance deviation score.“ 


RESULTS 


Relationships between the Independent Vari- 
ables 


Table 2 presents the correlations between 
the hope of success, fear of failure, total n 
Ach, and anxiety measures. Since the hope of 
success and fear of failure scores are com- 
ponents of the total n Ach score, the high cor- 
relations with n Ach are to be expected. 

t The probability deviation score is a variant of a 
score which might be derived from Atkinson’s (1957) 
formula: Behavior =f (Motive X Expectancy X In- 
centive). An analysis was performed using such @ 
multiplicative score with results similar to those 
with the probability deviation score. ý 


TABLE 3 
MEAN PROBABILITIES or Success AND MEAN NUMBER OF SUCCESSES IN PRACTICE TRIALS 


Feet from target 


Mean 
number 
4 7 10 13 16 19 Bgl AA 
Hips ot aa and fear of failure 
igh-hig] 89 72 47 33 24 28,5* 
HE 89 58 38 25 14 10 o 23.8 
Low-higl 88 1 3 19 14 08 04 | 23.4 
n ee a and anxiety = : th ELS p “ ae 
High-higl 90 64 39 24 
i . : AS 24.58 
eee ae) Sede) 2 |e | Be 
E ‘ . 16 23. 
Low-low -90 .61 36 25 16 3 05 27 


* No significant differences between these fe 
* Analysis of variance shows this mean differs te <.001. 
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TABLE 4 
Means OF DISTANCE DEVIATION AND PROBABILITY DEVIATION SCORES BY 
TOTAL n ACHIEVEMENT AND ANXIETY 
Distance deviation scores e Probability deviation scores 
Level of anxiety Level of anxiety 
High Low Total High Low Total 
High n Achievement 
3.78 (19)= 4.34 (16) 4.04 (35) -291 .207 .214 
Low n Achievement 
M 3.81 (19) 3.90 (17) 3.85 (36) -193 .199 .196 
Total 3.80 (38) 4.11 (33) 3.94 (71) -206 -203 -205 


^ Number of subjects. 


Tests of Hypotheses 


Hypothesis I predicted that subjects ori- 
ented by hope of success would be more suc- 
cessful in the 70 practice trials than subjects 
oriented by fear of failure. This hypothesis 
was checked by computing the mean number 
of successful shots out of the 10 shots which 
each subject took from each distance during 
the practice trials. These data are reported in 
probabilities of success, that is, if a subject 
had 5 successes out of 10, his probability 
from that distance was assumed to be .50. 
The average probabilities for subjects classi- 
fied by hope of success and fear of failure and 
by total n Ach and anxiety are presented in 
Table 3. 

The mean number of successful shots out of 
„70 is also presented. These means were treated 
by analysis of variance. Table 3 shows higher 
mean probabilities of success from all except 
ki 4-foot distance for subjects with high hope 


of success and high fear of failure, Analysis of 
variance shows that the mean number of suc- 
cesses is significantly higher for the high hope 
of success, high fear of failure group than the 
other groups ($ < .001). These results con- 
stitute an interaction rather than the main 
effect of hope of success as predicted by the 
hypothesis. The results indicate no significant 
differences in probabilities or mean number of 
successes related to the total n Ach or anxiety 
scales. 

Before testing Hypothesis II using the devi- 
ation measures the mean distance of free- 
thoice shots and the associated probabilities 
for the motivational subgroups were treated 
“by analysis of variance. These tests indicated 
no significant relationships between the inde- 
pendent variables (n Ach, anxiety, hope of 
success, fear of failure) and either mean dis- 
tance of free-choice shots or mean probability 
of free-choice shots. 


TABLE 5 
MEANS OF DISTANCE DEVIATION AND PROBABILITY DEVIATION SCORES BY 


Hore oF Success AND FEAR OF FAILURE 


Distance deviation scores 


Probability deviation scores 


Level of fear of failure 


Level of fear of failure 
High Low Total High Low Total 
High hope of success 
we 3.68 (12) | 3.76 (27) | 3.74 (39) 151 2 200 
Ow hope of success 
a 5.46 (7) 3.84 (25) 4.19 (32) 262 197 211 
Total 434 (19) | 3.80 62) | 3.94 (71) 192 .209 205 


“Number of subjects, 
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TABLE 6 
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ANALYSES OF VARIANCE FOR DISTANCE AND 
PROBABILITY DEVIATION SCORES 


Distance de- | Probability de- 
viation scores | viation scores 
Source df 

MS MS F 
n Achievement (A) 1 1.87 | 0.93 3.025 | 0.74 
Anxiety (B) 1 74 | 0.37 0.12. | 0.03 
A XB 1 0.98 | 0.49 0,685 | 0.17 
Residual 67 2.01 4.085 — 
Hope of success (A) 1 7.75 | 4.35* | 18.06 | 3.76 
Fear of failure (B) 1 11.34 | 6.34% 06 «0: 
A XB 1 50 | 5.30** | 45.56 | 9.49% 
Residual 67 1,80 4,80 a: 


^ These mean ‘squares were computed dropping the deci- 
mal points in the original probabilities used to compute the 


probability deviation score. 


*> <.05. 7 
* D <.025. 
sp < 005, 
HOPE OF 
304 success 
High 
High 
Low 
Low 
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Fic. 1. Percentage of shots taken from each level of ili 
] e probability of 4 
subjects classified by hope of success (HS) and fear of ERA (FF), se 
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Hypothesis II predicts that subjects ori- ; 
ented toward achieving success will take more 
moderate risks than subjects who are oriented 
toward avoiding failure. Operationally, this > 
hypothesis predicts a relationship between the 
hope of success measure and the distance and/ 
or probability deviation scores. Table 4 pre- 
sents the means of both the distance and prob- 
ability deviation scores for the subjects cate- 
gorized by high and low total n Ach and anxi- 
ety. Table 5 presents the means of both the 
distance and probability deviation scores for 
the subjects categorized by high and low hope 
of success and fear of failure. Table 6 presents 
the results of the four separate 2 X 2 analyses 
of variance of the means presented in Tables 4 
and 5. 
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X EMPIRICAL PROBABILITY OF SUCCESS 


Fic. 2. Percentage of shots taken from each level of probability of success; 
subjects classified by total n Achievement (n Ach) and anxiety. 


a 
The results indicate that the distance devi- 
EA and probability deviation means in Ta- 
<ble 4 are such that they may be considered to 
oa all drawn from the same population; 
oa an none of the tests presented in Table 6 
oe TN total n Ach and anxiety is signifi- 
when hus Hypothesis II derives no support 
‘te „the independent variables are opera- 
nalized using the total n Ach and anxiety 

Scores, 
a means presented in Table 5 cannot be 
tion os to be drawn from the same popula- 
ae results with distance deviation 
lated and probability deviation scores re- 
to hope of success and fear of failure 


indicate significant interaction between the 
two variables. Although the main effects with 
distance deviation scores appear to be signifi- 
cant, over and above the interaction they are 
not significant. These results do not confirm 
the simple main effect relationship predicted 
by Hypothesis II. They must be interpreted 
as more complex in view of the interactive 
effects of hope of success and fear of failure. 
As an aid to the interpretation of the inter- 
action effect on the probability deviation 
scores, Figures 1 and 2 are presented. These 
figures show the percentages for shots taken 
from various distances representing different 
points of probability of success. Since subjects 
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were free to choose the distances from which 
they shot, the figures represent a composite— 
some subjects shooting from some of the 
points but not all points representing all sub- 
jects or vice versa. Therefore, there is no way 
to analyze these data in this form statistically. 
They do present us with a picture of the devi- 
ations which we have analyzed statistically, 
however. 

Figure 1 presents these results comparing 
subjects categorized as to high and low hope 
of success and fear of failure. It therefore 
presents data, deriving from the situation 
where the significant interaction was found 
( < .005). Figure 2 presents for comparison 
purposes the results with subjects categorized 
as to high and low total n Ach and anxiety, 
where no significant differences were found. 


Discussion 


At the outset it was expected that the two 
major innovations in the present experiment 
would merely heighten the significance of the 
results from previous experiments. However, 
it is apparent that the innovations affected 
the previously found relationship between risk 
taking and overall achievement motivation 
and anxiety. These two variables previously 
found to relate to risk taking showed no sig? 
nificant relationship with either dependent 
measure under the present conditions. ” A 


Contrasts with Previous Studies Kies 


There are several unique aspects of this ex- 
periment which may account for the failure to 
find an effect of n Ach and anxiety on risk- 
taking behavior. The most obvious unique as- 
pects are the clear establishment of probabili- 
ties of success prior to the free-choice trials, 
the attempt to isolate the subject from spec- 
tators, the physical setup using a volleyball 
and basket, the age of the subjects, 

There is no reason to believe that age is 
critical, McClelland (1958) used 5- and 
9-year-olds and Atkinson and Litwin (1960) 
used college students in risk-taking studies. 
The relevant aspects of the volleyball situa- 
tion seem to compare with the ringtoss and 
shuffleboard games used in the previous experi- 
ments. 

The alone versus group presentation may 
have made a difference since our subjects had 
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no way of comparing their performance with 
that of others. They could only compete with 
their own standard of excellence. If this is the 
crucial difference, it would suggest that n Ach 
and anxiety are related to risk taking when 
the subject is comparing his performance with 
that of others. Hope of success and fear of 
failure may be needed to predict risk taking 
with regard to individual standards. There is 
no way of testing this suggestion in the pres- 
ent data. 

Probably the major difference between this 
and previous studies is the fact that subjects 
in this experiment went into the risk-taking 
free-choice trials knowing their probabilities of 
success at each distance whereas subjects in 
previous experiments approached the situation 
with vague subjective probabilities. There is 
some evidence (Atkinson et al., 1960; Kaus- 
ler & Trapp, 1958) that subjects with high n 
Ach approach tasks with higher subjective 
probabilities of success when probabilities are 
not explicit. In the present experiment, the 
subjects were confronted with empirical prob- 
abilities which did not differ as a function of 
n Ach or anxiety. These results suggest that 
subjects with high n Ach may take more mod- 
erate risks only when they have little evidence 
about actual probabilities. 

This latter effect may also interact with the 
alone versus group conditions. High n Ach 
subjects with little evidence of their own 
probabilities may assume that they can do 
better than others in the group that they have’ 
been watching. This may induce them to con- 
centrate their shots slightly above where the 
group appears to be having moderate success., 
This may appear to them the greatest test of 
their skill. Anxious subjects who wish to avoid 
such a test tend to avoid this critical region. 
These factors would account for results of ' 
previous experiments. 


Hope of Success and Fear of Failure Results 


Turning to the results with our measures 
the data clearly indicate a significant dif- 
ference in skill between subjects with high 
hope of success and high fear of failure and 
the other three classifications of subjects. The 
high-high group had an average of 5 more , 
successes in the 70 practice trials than did the 
other three groups, a difference which was sig- 
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nificant beyond the .001 level. These successes 
resulted in higher probability of success in the 
middle ranges for this group of subjects. The 
distance deviation score which is similar to the 
one used by Atkinson et al. (1960) does not 
equate subjects on skill. Results using this 
score may be contaminated by the fact that 
the high-high group was found to be more 
skillful. Using a distance deviation score, a 
significant interaction was found between hope 
of success and fear of failure indicating that 
the one group of subjects with high fear of 
failure and low hope of success stood out in 
that they avoided the moderate risks more 
than the other three groups. This is the way 
Atkinson would predict the subjects with anxi- 
ety or fear of failure would act. 

The interaction is even more significant us- 
ing the probability deviation score, The ra- 
tionale for developing the probability devia- 
tion score takes into account the probability 
of success assessed for each individual at each 
shooting distance. Therefore, this score can 
be taken as the most theoretically valid test 
of the hypothesis since it equates the subjects 
as to their probability of success from the 
various distances. Results from this score can 
be unequivocally attributed to differences in 
level of motivation over and above any dif- 
ferences in probability of success. Apparently, 


the measure of hope of success and fear of h 


failure has succeeded in tapping a motiva- 
tional variable with an effect on risk-taking 
behavior that is independent of skill. 

The most striking results in Figure 1 are 
for the subjects with high hope of success and 
high fear of failure who concentrate almost 
sone-third of their shots at the .30 probability 
Point. This represents the calculated risk. 

Subjects with high fear of failure and low 


„shope of success definitely avoided this point 


of calculated risk. Their results are bimodal. 

heir highest percentage of shots occurred 
Where their probability of success was only 
10. They show a deep trough in the middle 
ranges, Only 7% of their shots were taken at 
the point of .30 probability of success. An- 
other secondary mode occurs at the .70 point. 

One can interpret these data as attempts to 
4 PE any real test of their skill. Having been 
$ ea into a situation where they fear failure, 

y tend to choose positions where they have 
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a relatively high probability of success (but 
will receive fewer points) or to choose posi- 
tions where they have a relatively low prob- 
ability of success (but will receive many points 
if successful). This behavior may be seen as 
the best way to resolve an avoidance-avoid- 
ance conflict. Fear of failure would dictate 
avoidance of the entire situation, but they 
cannot withdraw without incurring the dis- 
pleasure of the experimenter. The only solu- 
tion is to take impossible shots where failure 
is expected but a chance success would be re- 
warded handsomely or to settle for relatively 
easy shots with low incentives. » 

The subjects with high hope of success and 
low fear of failuresshow a pattern quite simi- 
lar to those with high fear of failure and low 
hope of success. They tend to stress the very 
high risk more than any other group, but they 
also have a minor mode at .60 probability of 
success. 

Apparently, subjects with both hope of suc- 
cess and fear of failure resolve an approach- 
avoidance conflict by concentrating their ef- 
forts at a point of moderately high risk, but 
one which may be considered a calculated risk. 
Subjects motivated primarily by fear would 
be expected to leave the situation, but under 
restraint avoid the points where their skill 
‘might receive the most rigorous test. Subjects 
motivated primarily by hope of success would 
not be expected to attempt to avoid the situ- 
ation, and they apparently will take the most 
extreme risks of any group. Why they tend to 
avoid the middle range and have another 
minor mode at a relatively easy point is un- 
clear. 

In general, the results of the present experi- 
ment seem to indicate that when alone and 
when the level of probability of success is em- 
pirically assessed and impressed upon the sub- 
jects as they enter the risk-taking situation 
there is no effect of overall n Ach and/or 
anxiety upon the risk-taking behavior of the 
subject. However, it has been demonstrated 
that subjects with high hope of success and 
high fear of failure as indicated in their TAT 
protocols are more skillful in the risk-taking 
task. In addition to identifying subjects with 
greater skill, the measures of hope of success 
and fear of failure succeeded in predicting 
differential risk-taking behavior over and 
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above the differences found in probabilities of 
success. | 
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A FACTOR-ANALYTIC STUDY OF RELIGIOUS 
BELIEF AND BEHAVIOR 


VICTOR B. CLINE AND 
University of Utah 


A sample of 154 adult male and female S’s was studied regarding religious 
beliefs and behavior using 3 procedures: (a) a TAT-type projective device, 
(b) an intensive depth interview, and (c) a 67-item questionnaire, 58 variables 
were derived from these procedures which, were (for the sexes separately) 
intercorrelated and factored. 12 factors emerged for the females and 14 for the 
males. Some evidence suggested that these factors were stable across religious 
denominations. The average intercorrelation of the “overall religious commit- 
ment” measure from each of the above 3 devices was .66 suggesting a fairly 
good cross-method validity. Major sex differences emerged between the males 
and females. One of the most striking findings was the close to Q-order relation- 
ship between the major religiosity factors (as measured by hi church at- 
tendance, frequency of prayer, contribution of monies, etc.) and such variables 
as “having love and compassion for one’s fellow man,” “being a Good 
Samaritan,” “possessing humility,” etc. 


* JAMES M. RICHARDS, Jr. 


American College Testing Program, Iowa City, Iowa 


In surveying the psychology of religion 
after 50 years, Page (1951) commented, 


The psychological investigation of religion in America 
beginning about 1900 with Starbuck’s studies of 
conversion and James famous classic, ran a fairly 
lively course of productive activity during the first 
quarter century. But since then interest and output 
have steadily declined until today it would not 
perhaps be untrue to say that the subject is regarded 
by many psychologists with almost complete indif- 
ference and by some with positive suspicion and even 
disfavor, Thus one studies tribal ceremonies of primi- 
tive cultures, religious delusions of the psychotics, 
Conversion experiences of adolescents, but not the 
religious behavior of normal adults of our own 
culture [p. 60]. 


‘Tn a review of all studies in the area of 
religion reported in the Psychological Ab- 
stracts for the quarter century 1927-53, 
Michaels (1956) found that only 12% were 
of an empirical nature which averaged less 
‘than six per year in the literature. 

In reviewing Meissner’s (1961) Annotated 
Bibliography in Religion and Psychology, 
McDowell (1963) found that the bulk of the 
Published material in such journals as Pasto- 
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san terviewer, as well as Richard Porter, Guy 
piesman, Kenneth Bennion, Dorothe Weinberg, 


Joyce Stavros, Elaine Davies, and Robert Hawes 


Who all contributed much to the project. 


ral Psychology, for the decade 1950-60 was 
neither empirical nor “experimental” in na- 
ture and consisted in great part of “discus- 
sions of issues,” reviews of a single case study, 
etc. He concluded that Thouless’ (1938) 
comment two decades prior was still applicable, 
“There is a marked poverty of scientific 
knowledge in the psychology of religion.” 
+ In fact, in the Fifth Mental Measurements 
Yearbook (Buros, 1959) only five tests were 
disted under a religious heading. Four of these 
were knowledge tests published by specific 
denominational groups for use in parochial 
school systems while the fifth was a general 
test of Bible information which was given a 
particularly critical and devastating review. 
The evidence would suggest that significant 
empirical studies of the psychology of religion 
are a real rarity and that this has certainly 
not been a popular area of study for psycholo- 
gists. And yet religion in one form or another 
is still important to most United States citi- 
zens and whether they identify with it or 
rebel against it, it still affects their lives in 
a multitude of ways. This has been impres- 
sively demonstrated by Lenski (1961), a 
sociologist, who probably has made the most 
important major recent contribution to this 
area in his The Religious Factor. His research 
grew out of the Detroit Area Study, where he 
investigated the consequences of religious 
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belief and practice on the “everyday life” of 
his subjects. Lenski sharply distinguished be- 
tween types of religious commitment or in- 
volvement, the major classification being tas- 
sociational involvement” (based on frequency 
of church attendance, etc.) and “communal 
involvement” (based on choice of marriage 
partners, friends, etc., of the same religion). 
In his study he found very little relationship 
between devotionalism (prayer life, actually 
living the precepts of one’s religion in every- 
day life, etc.) and doctrinal orthodoxy (how 
orthodox one is in religious belief). Probably 
his major finding was that religious organiza- 
tions not only are remaining vigorous and 
influential in contemporary American society, 
but that most evidence pointed to gains in 
“associational” vigor and vitality for the fore- 
seeable future. A great body of evidence sug- 
gested that religion in various ways was 
constantly influencing the daily lives of the 
masses of men and women in modern America 
and, more than that, through its impact on 
individuals, religions make an impact on all 
other institutional systems of the commu- 
nity in which these individuals participate. 
Depending upon the socioreligious group to 
which a person belongs, the probabilities are 
increased or decreased, for example, that he 
will enjoy his occupation, indulge in install- 
ment buying, save to achieve objectives for 
the future, vote Republican, favor the welfare 
state, oppose racial integration in the schools, 
migrate to another community, develop a 
commitment to the principle of intellectual 
autonomy, have a large family, complete a 
given unit of education, or rise in the class 
system. The evidence Lenski presented 
strongly suggested that religion acted in a 
causal way and was not merely correlated 
with certain kinds of behaviors and events. 
The authors of the study presented in this 
paper have likewise been interested in study- 
ing the impact of religion and religious belief 
on behavior, but have been concerned about 
the looseness and lack of rigor of most of the 
“research” in this area. In an effort to make 
greater use of the tools of empiricism and 
scientific methodology the researchers pro- 
ceeded with variable success to study a most 
complex phenomena as developed below. 
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METHOD 
Selection of Sample 


A random sample of 155 adult males and females 
was drawn from the City Directory listing every 
adult residing in the greater metropolitan Salt Lake 
area. The method used in selection was to take the 
tenth name down in the left-hand column on every 
tenth page, alternately choosing male and female 
subjects. If the tenth person down were the “wrong” 
sex, then the next individual of the appropriate sex 
down the list was chosen. This meant that every 
resident adult male and female in the Salt Lake Val- 
ley had an “equal” opportunity of being included in 
this sample. The only individuals not included were 
nonresident transients and tourists staying briefly in 
motels or hotels. Ten percent of the sample drawn 
could not participate in this study because: 7% had 
moved; and 3% refused for such reasons as, “being 
too busy,” etc. In these cases the researchers 
used as substitutes another adult of the appropriate 
sex who lived in or adjacent to the apartment or 
residence in question. 


Method of Data Collection 


A single trained interviewer collected all of the test 
data and other information in the following manner. 
First individuals selected for the study were sent a 
letter indicating that the University of Utah was 
conducting a research project. They were told that 
the researchers would like to survey their opinions 
and attitudes about a number of questions. No hint 
or information was given as to the nature of the 
questions or the purpose of the study. Later a phone 
call or personal visit was made in order to secure 
a specific appointment time with the individual. 
When the interviewer arrived for the appointment 
he asked permisison to record the interview, “So 
Til not make any mistakes in taking down what 
you say.” Permission was granted by the partici- 
pants without exception. A special transistorized 
battery-operated recording device was used. This was 
built into a small innocuous “business-type” leather 
brief case, There were no visible wires, micropbgne, 
or anything else noticeable that might tend to make 
the respondent nervous or give him “mike fright” 
about having his conversation and words recorded. 
Later, stenographers made typescripts of each of the 


154 interviews. During the session with the subject S 


three types of procedures were administered: a modi- 
fied TAT-type test, a depth interview, and a 67-item 
Religious Belief-Behavior Questionnaire. 


Tests Used and Information Collected 


Modified TAT. After securing permission to record 
the interview, the interviewer without any further 
explanation presented, serially, a set of seven TAT- 
type projective pictures. The instructions required 
that the respondent look at each picture and tell 
what was happening, what the people were thinking 
and feeling, and what would be the outcome. After 
the initial response to each picture the interviewer 
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probed considerably as to the full implications of the 
response and inquired about features in the card 
that might have gone uncommented on. In doing 
this, great care was taken not to suggest what the 
nature of the study was. The pictures, described 
below, were presented in this order: 


1. A man is putting fishing tackle in his car, 
while across the street people are entering a church 
or synagogue. 

2. A man is speaking in front of a group of 
people inside a building. This could be a clergyman 
in a church, but the cues are somewhat ambiguous, 

3. A man is placing some money in what appears 
to be a collection plate, but again the scene is 
somewhat ambiguous and could be interpreted 
variously. 

4. A young boy is kneeling at his bedside in a 
posture suggesting prayer. 

5°A physician tells a woman sitting at the 
bedside of a man who is apparently very ill, 
“There is nothing more I can do.” 

6. The scene is a graveside funeral. A casket is 
being lowered down into the grave. 

7. A group of shrouded figures are gathered 
together in a depressing and dismal scene. Hover- 
ing above is a “Satanic and evil” looking figure 
faintly seen. 


In commencing the interview the researchers did 
not structure the nature of the study or give any 
Suggestion that it dealt with attitudes, beliefs, and 
behavior in the area of religion for the following 
reasons. Such researches as the Detroit Area Study 
(Lenski, 1961) suggest that approximately 90-97% 
of the adult United States population say they 
believe in a diety. In other words nearly everybody 
Says they believe in God, or are “for God.” How- 
ever, in prior unpublished pilot studies by the 
authors some evidence was obtained suggesting that 
many individuals typically give pat, stereotyped, and 
Cehventional responses to questions about religion 
(especially to strangers) which do not reflect accu- 
rately the real complexities or conflicts they have 
about what they believe. To get at more basic and, 
hopefully, pervasive and deep-seated attitudes and 
feelings about religion and religious belief, it was 
decided to start with a mildly ambiguous projective 
device (such as the one described above). This, at 
Teast initially, would not give the individual any 
Special “set,” with less likelihood of conscious distor- 
tion of responses. In other words, it was felt that 
the subjects’ reactions to these projective pictures 
might provide more honest and valid information 
van a more direct approach where faking, dis- 
Simulation, and distortion might be more easily 
Possible, 

Responses to the TAT-type pictures were con- 
Verted to scores in the following way. Three trained 
Taters reviewed each subject’s protocol and made six 
Ae ratings using a 1-5 scale. These six vari- 
2 eS Were: overall religious commitment, feelings 
„Built, religious conflict, defensiveness about re- 

‘on, sheer number of religious themes, and number 
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of themes suggesting hostility toward religion. These 
variables were chosen following several pilot studies 
where a larger number of potential variables were 
tried out using other samples of subjects. In making 
these ratings, the raters considered the entire set of 
responses a subject gave to all seven pictures rather 
than considering or scoring each picture separately. 
The score each subject received on each of the six 
variables was the average of the three raters’ ratings. 

Depth interview. Following the administration of 
the modified TAT the researcher proceeded to con- 
duct a depth interview aimed at searching out the 
quality, nature, and intensity of the subjects’ belief 
about diety, church, and what impact, if any, these 
had upon his life and his relationship with parents, 
spouse, associates, etc. A great deal of specific in- 
formation about both one’s religious behavior and 
beliefs were secured. In addition, data on age, sex, 
marital status, religious ‘and political preference were 
obtained. Typescripts of these interviews were read, 
studied, and rated by five graduate psychologists- 
sociologists, including the original interviewer. They 
made 38 ratings on each subject using for the most 
part a 1-5 scale. Ratings were based on a total 
analysis of the interview material, not just isolated 
comments or bits of information out of context. A 
subject’s final score for a particular variable was 
the mean of these ratings. These raters were selected 
so that their own attitudes and orientation toward 
religion were heterogeneous. Interrater reliability was 
calculated by correlating the ratings on each separate 
item for each rater, against each other rater. Thus 
for a particular variable, there was a matrix of 10 
correlations for the five raters, or 380 correlations 
for, all 38 variables. The average correlation (or 
measure of agreement) between the individual raters 
was 52, with a range of from —.20 to +1.00. 

Religious Belief-Behavior Questionnaire? Follow- 
ing the interview, the subject was given a question- 
naire developed by the authors consisting of 8 scales 
and 67 Likert-type items. This questionnaire assessed 
both religious belief and behavior. The names of the 
8 scales suggest something of the content of the 
items: Public Religious Behavior, Private Religious 
Behavior, Personal Religious “Experience,” Beliefs 
about God, Beliefs about Good versus Evil, Beliefs 
about Organized Religion, Beliefs about Immortality, 
and Beliefs about the Scriptures. An example of the 
type of item included is: 


Within the last two years I have prayed: 
—1. Frequently 

—2. Occasionally 

— 3. Rarely 

——4. Never 


This item belonged to the second scale, Private 
Religious Behavior. 

It proved difficult to avoid the common error of 
previous studies of religiosity, which typically have 
used questionnaires which measured primarily 
“orthodoxy” or “conventionality” of belief, and 


2 Copies of this questionnaire may be secured 
from the senior author without charge upon request. 
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TABLE 1 
RELIGIOSITY INTERVIEW RATING VARIABLES 


> 
Aponte anea with spouse? Positive Neutral „Negative 
2. Belief in God? Yes Uncertain No 
low average high 
3. How active in religion? 3 y “hs 2 ` 
4. Overall importance of religion in life—degree of psychic (religious) 
commitment? 12345 
5. Father’s overall religiosity? 12345 
6. Mother’s overall religiosity? 123 4 5 t; 
7. Relationship with mother? $ Positive Neutral Negative 
8. Relationship with father? Positive Neutral Negative 
About subject’s spouse 
9. Belief in, God? Yes Uncertain No 
low average high 
10. How active in religion? i si 1 4 5 
11. Overall importance of religion in life—degree of psychic (religious) hs 
commitment? 123 45 
12. Father’s overall religiosity? hey AM <i E] 
13. Mother’s overall religiosity? 12345 Ñ 
14. Relationship with mother? Positive Neutral Negative 
15. Relationship with father? Positive Neutral Negative 
About the subject 
16. Humbleness (meek, modest, lacking arrogance, pride, etc.). ele eave E RR) 
17. Genuine love-compassion-sympathy for others. Ta EE ae R] 
18. Deep, profound, abiding faith, and belief in God. MEZES A S 
19. Good Samaritan, Do-er, who really “helps others.” 1203.45 
20. Intensity, amount, and importance of “prayer life.” eens O] 
21. Degree to which he lives up to teachings of his religion (or personal 
ethical philosophy). AOD ESHA) 5 
22. Frequency of attendance at religious services. Mie (Sera 15 
23. Degree to which he lives a good moral life (personal morality, ethics, etc.). | 1 2 3 4 5 
24. Degree to which he is conflicted, doubtful, and uncertain about basic 
religious questions. (SNe a 
25, Degree of tragedy, suffering, and misfortune in his life. 123 45 
26. Degree to which he is a neurotic or disturbed person. ‘Ney ees ee: S] 
27. Degree to which he “uses or exploits” religion to his advantage (for busi- 
ness, social, status, or similar reasons), ° Pizia a: s 
28. Degree to which he is an “authoritarian, power-centered, manipulative” 
type of person (in all areas). D243 r S-] 
29, Degree to which he is self-centered, selfish, self-concerned. MWe DoS: -S 
30. Degree to which he is a “religious phony,” hypocritical, insincere, cynical, ) 
“two-faced” about religion. eres, 45 
31. Degree to which he is a “religious literalist” accepts everything in Bible + 
exactly as it is. OPAS ae S 
32. Degree to which he is dogmatic—extremely sure that he is right and those 
who disagree are completely wrong. OE RW N] 
33. Degree to which he is pragmatic . . . demands absolute tangible proof 
before “he'll believe” anything religious. 
34. Degree to which he approaches religion from a theoretical and philosophical 
point of view rather than by “faith.” (Ey Wier S 3 
35. How much does he know about religion (especially his own), has he read x 
and studied much, knowledge of Bible, etc.? i R as AR: a 
36. Degree to which he is antagonistic to religion in general. ee n iiie SE 
37. oy pe Aue you “personally like” this person (aside from any religious 
acora R à TESDIES EIAS 
B TEn of change (religiously) in his life. Little/None, Some, Great Deal 
. Direction of change (if indicated)? More Religious Less Religious 


therefore penalized a member of the more liberal Protestants) in other ways could be considered 
religious groups or persons having a less orthodox deeply religious. 

kind of religious orientation, For example, a question Similarly, a belief in immortality is an important 
about the existence of Satan would, perhaps, unduly part of the faith of most Christians, but not of tke 
discriminate against some Protestants who view faith of many liberal Jews. On the other hand, 2 
Satan as a symbolic or mythical figure but (these questionnaire devoid of any items dealing with 
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i specific dogma would tap nothing more than vague 
unspecified attitudes toward religion. As a com- 
promise, the researchers attempted to solve these 
problems by grouping items into 8 subareas, so that, 
T for example, if a person disclaimed belief in the 
personal existence of Satan, this would not neces- 
sarily be combined or lumped together numerically 
with his beliefs about the personal existence of God. 
In order to permit comparisons with other studies, 
however, scores on the 8 subscales were also 
combined into a total score. 

Thus in summary this study included 58 separate 
measures relevant to the religious beliefs and prac- 
tices of the subjects: 6 ratings of responses to the 
TAT-type pictures, 39 ratings based upon the depth 
= interview, 9 subscale scores as well as a total score 
from the Religious Belief-Behavior Questionnaire, 
and four demographic variables (age, religious and 
political preference, and sex). All of the variables 
were jntercorrelated and factor analyzed, using a 
principal components (eigen value and eigen vector) 
solution with a varimax rotation. 


RESULTS 


_ Information about and Characteristics of the 
= Sample 


The sample was exactly divided (50% 
versus 50%) between male and female sub- 
Jects as was provided for in the original 
research design. With regards to political 
‘Preferences 33% unequivocally listed Demo- 
Crat and 28% Republican as their choice— 
while another 39% listed themselves as Inde- 
Pendent, or “I vote for the man—not the 
party.” With regards to age range 14% were 
ìn the 20-29 category, 39% were 30-45, 
another 27% were 46-60, and 20% were 60 
or over. With regards to religious preference 
12% listed themselves as LDS (Mormon) 
While 11% gave no preference, 9% Protes- 
tant, 4% Catholic, and 4% other (Jewish, 
Christian Science, etc.) 


Relationship between the Three Measures of 
cligiosity 
i A most important question to be answered 
involves the relationship of the overall re- 
\Slosity or religious commitment index from 
( the Projective tests, interview data, and ques- 
entre procedure. Their average intercor- 
ae was .66 with a range of from .50 
for ‘84 which is remarkably high, especially 
the projective TAT-like device. Many 


Cente the courtesy of Western Data Processing 
t, University of California, Los Angeles. 
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previous studies using fantasy materials have 
found them monotonously disappointing in 
their yield and of questionable validity. It 
may be that projective tests in some cases 
need to Be custom-made (as was done in this 
study) to tap specific dimensions germane 
to the research. The relative high correlations 
between the three quite different procedures 
assessing overall religious commitment amount 
to a kind of cross-method validity. 


Intercorrelational and Factor Analysis 


In analyzing these data, the next step was 
to intercorrelate the 58 test and: rating vari- 
ables for the sexes separately. Results are 
presented in Table A,* with correlations for 
males appearing above the diagonal and~ 
results for females below the diagonal. Fol- 
lowing this, a principal components (eigen 
value and eigen vector) factor analysis was 
performed for each sex, using varimax solu- 
tions for the rotations (this is an orthogonal 
rotation). All factors having an eigen value 
greater than 1.00 were retained in the rota- 
tion. Results for males are presented in 
Table B* and for females in Table C.* 

The results presented above reveal con- 
siderable complexity within the religiosity 
domain, a finding contrasting with most 
theoretical formulations which imply a single 
global factor or at most 2 or 3 factors. In 
the present study, 12 factors were found for 
men and 11 factors for women. On the other 
hand, it should also be noted that there is 
a high intercorrelation among the total re- 
ligiosity measures from the three separate 
experimental procedures (projective test, 
questionnaire, and interview), with correla- 
tion values ranging from 56 to 81. The factors 
obtained are named and/or described below. 

Female factors. Factor I is entitled Re- 
ligious Belief and Behavior with high load- 
ings on all questionnaire scale variables, the 
religious-commitment variable from the pro- 
jective test, and interview variables reflecting 

4This table has been deposited with the American 
Documentation Institute. Order Document No. 
$319 from ADI Auxiliary Publications Project, 
Photoduplication Service, Library of Congress, 
Washington, D. C. 20540. Remit in advance $1.25 
for microfilm or $1.25 for photocopies and make 
checks payable to: Chief, Photoduplication Service, 
Library of Congress. 
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belief in God, importance of religion, and 
participation in organized religion. It thus is 
highly similar to the common meaning of 
“religiosity.” 

Factor II involves having a spousé who had 
good relations with religious parents, as 
measured by the interview. 

Factor III might most appropriately be 
termed the Compassionate Samaritan factor. 
It has high loadings on the interview ratings 
of “love and compassion,” “being a ’good 
Samaritan,” and being “liked” by the raters 
of the interview protocol, and a high negative 
loading on being self-centered. The most sig- 
nificant thing about this factor is undoubt- 
„edly its relative independence of acceptance 
of conventional religious dogma. 

Factor IV involves Projected Guilt, since 
its highest loading is on the projective test 
guilt variable. Secondary loadings suggest that 
persons scoring high on this variable tend to 
believe in God, but do not live up to the 
teachings of their church nor are they par- 
ticularly moral in their personal life. The 
projected guilt almost certainly results from 
the “cognitive dissonance” arising from this 
beliefs-actions inconsistency. 

Factor V involves having good relations 
with a religious father as reported in the 
interview. 

Factor VI involves primarily Projectiye 
Test Religiosity, with high positive loadings 
on the projective test variables of “number 
of religious themes” and “religious commit- 
ment,” and negative loadings on “hostile 
themes,” “defensiveness about religion,” and 
“religious conflict.” 

Factor VII has high loadings on age, 
tragedy, and suffering, as revealed in the 
interview. The high loading on marital status, 
because of the peculiarities of the scoring 
system for this variable, indicates that per- 
sons scoring high on this variable tend to 
be widows. Thus this factor involves tragedy 
and suffering in later life alone. The tend- 
ency toward rated “neuroticism” of persons 
scoring high appears to be a result of this 
tragedy and suffering, and an understandable 
reaction to their life situation. 

Factor VIII should be reflected when inter- 
preting it, and involves having good relations 
with a religious mother, and a tendency not 
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to change one’s own religious views, whatever 


those views might be. 

Factor IX involves Religious Hypocrisy 
with high loadings on interview ratings as 
being a religious “phony,” as exploiting re- 


ligion, as having conflict and doubts about | 
religion, and on projective measures of — 


defensiveness and religious conflict. 

Factor X has a high loading on Political 
Preference which, due to the scoring of this 
variable, indicates a tendency to conservative 
political beliefs. It is interesting to note that, 
contrary to common opinion, this is not neces- 
sarily accompanied by religious conservatism. 

Factor XI appears to be Dogmatic Authori- 
tarianism with high loadings on the inter- 
view variables of authoritarian and dogmatic. 
Again it should be noted that contrary to 
some current theories of authoritarianism, this 
factor does not involve any tendency toward 
conventional or conservative religious belief. 
These results suggest therefore, that an un- 
believer is just as likely to be authoritarian 
about his unbelief as a believer is to be 
authoritarian about the dogmas of his faith, 
which is entirely consistent with Rokeach’s 
(1960) reformulation of the problem of 
authoritarianism. 

Male factors. Factor I appears to involve 
Religious Behavior with its highest loadings 
on the church attendance, prayer life, and 
religious activity scales from the interview, 
and the public and private religious behavior 
from the questionnaire scales. There is a 
tendency for men scoring high on this scale 
to have spouses who view religion as impor- 
tant, and who are active in church activities. 
It should be noted that there is a fairly strong 
tendency for high scorers on this factor to be 
rated as “exploiting” their religion. 

Factor II is most appropriately called the 
Compassionate Samaritan factor, with high 
loadings on the interview scales of being @ 
good Samaritan, being loving and compas- 
sionate, leading a good moral life, and being 
liked by the raters. This factor, of course, 
is virtually identical with Factor III in the 
female sample. 

Factor III is similar to Female Factor XI, 
and involves a tendency toward Dogmatic 
Authoritarianism. High positive loadings were 
obtained on the interview scales of authori- 
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‘tarian and dogmatic and a high negative 
loading on humbleness. 

Factor IV involves having a spouse who 
had good relations with religious parents. 

Factor V involves having good relations 
with a spouse who is herself religious. 

Factor VI, like Female Factor VI, involves 

: Projective Test Religiosity. If reflected, it 
has high positive loadings on the number 
of religious themes given in response to the 
projective material and the estimate of re- 
ligious commitment based on these themes, 
and a moderate negative loading on the de- 
gree of defensiveness about religion aroused 
by the pictures. 

High scores on Factor VII are men who 
had good relations with their own religious 
parents. 

Factor VIII has some similarities to Female 
Factor VII, and thus might be termed 
Tragedy and Suffering in Later Life since its 
highest loadings are on age and the amount 
of tragedy and suffering revealed in the inter- 
view. In contrast to the corresponding female 
sample, however, there was no strong tend- 
ency in the male sample for high scorers to 
have survived their spouse nor for them to 
display neurotic symptoms. 

„Factor IX appears to be Loss of Faith, 
since its two highest loadings indicate that 
high scorers have changed their religious be- 
liefs toward a Jess religious position. Second- 
aty loadings indicate that high scorers are 
More pragmatic and theoretical, less dogmatic 
and less literal in their religious views and 
havé more knowledge about religion. 
aed X appears to involve Religious 

lief since its highest loadings are on the 
questionnaire belief scales and the interview 
a concerning belief in God. This factor 
'S* sharply different from Male Factor I 
Which emphasized religious behavior with a 
Minimum of religious belief. 
fE ctor XI involves Neuroticism. High 
: orers tend to be viewed by the interview 
ae as being neurotic and a “religious 
be as exploiting their religion, as lack- 
E umbleness, and as being disliked by the 

ers themselves. 

gt XII might be termed Projective 

eligious Conflict with high loadings on 
© projective test scales involving religious 
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conflict, defensiveness about religion, and 
number of hostile themes given. High scorers 
on this factor also revealed more than average 
conflict and doubt about religion in the inter- 
view, and tended to reject, in the question- 
naire, conventional beliefs about God, good 
versus evil, and the church. 

Factor XIII is similar to Female Factor X, 
involving Political Preference. Again, due to 
the scoring system of this variable, this 
factor involves primarily a tendency toward 
conservative political views. 

The results described above and presented 
in Tables A, B, and C,* provide considerable 
evidence of important sex differences in pat- 


terns of religiosity. For example, the firsta._ 


factor which emerged for both men and 
women (and which accounted for the lion’s 
share of variance in both factor solutions) 
were religious in content but with some very 
interesting differences. In the case of the 
females we would call it Factor I: Religious 
Belief and Behavior because it rather neatly 
integrates belief variables from the question- 
naire, projective test, as well as interview, 
together with behavior variables from both 
questionnaire and interview. Thus with our 
female, the deeper and more intense her 
religious beliefs and convictions are, the more 
apt she is to behave and do things in her 
everyday life consistent with these beliefs. 
And, likewise, the more she participates be- 
haviorally in religious activities the greater 
the likelihood that her beliefs and convictions 
will be strengthened. 

However with our males the picture is 
somewhat different. The most accurate title 
or description we can give to their primary 
Factor I is Religious Behavior which is char- 
acterized by intense “activity” (high attend- 
ance, holding a church job, frequent prayer, 
etc.) but there is a greater degree of doubt, 
fuzziness, and ambiguity in the “belief area” 
than for women, about the nature of God, 
the validity and accuracy of the scriptures, 
immortality, etc. In fact, for men we find 
another somewhat distant and independent 
Factor X which we would title Religious 
Belief. What this means, apparently is that 
with men, some can be very high in their 
religious activity, yet at the same time come 
close to “failing” a belief test. And contrari- 
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wise, some men can be relatively inactive yet 
at the belief level be deeply “religious” (for 
example, have a completely committed belief 
in God’s existence, in a life after death, 
and/or in the validity and accuracy of the 
scriptures, etc.). Of course there are many 
men high in religious activity as well as 
belief, or low in both—but, overall, in com- 
parison with the females, belief and behavior 
for men, in the religious area, are not in- 
extricably linked, or at least not to the same 
degree. 

Religious commitment factors emerge from 
the projective test material for both men and 
women, Since these factors are relatively un- 

„related to each other (ice., how you score on 
one is to some extent unrelated to how you 
will score on another factor) this could 
mean that this is tapping an “unconscious” 
kind of religious variable. However, further 
research will be required before one can 
conjecture more on the exact nature of this 
factor. 

With the males one finds another interest- 
ing factor, Factor IX: Loss of Faith, which 
is not present in the female sample. This sug- 
gests that with at least some of the males 
there exists a certain pattern of becoming 
“unconverted” or rejecting the religion» of 
one’s youth. This is characterized by a tend- 
ency to view one’s religion from a “theoretical 
or philosophical point of view” rather than 
by “faith.” This coincides with doubt and 
inner conflict, considerable guilt, and an in- 
creasingly rejecting attitude toward a “literal 
interpretation” of the scriptures. Finally 
there is a pragmatic demand for tangible 
absolute proof for the religionists’ claims, 
which when not met leads to further estrange- 
ment and loss of faith but not without some 
inner turmoil, 

However, while a “loss of faith” factor is 
absent in the female sample, one finds for 
them two very interesting other factors, not 
present with the males. The first is Factor 
TX: The Religious Hypocrite. And while this 
is not a major religious dimension among 
women it does suggest that there are a small 
cluster who apparently are insincere and 
“two-faced” about their religiousness and 
who, to some extent, exploit religion for their 
own purposes. We might infer that possibly 
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men do not have to be hypocrites (at least 
consciously)—if they do not like their re- 
ligion they just become inactive or leave it. 
The other female factor is Factor IV: Pro- 
jected Guilt. This possibly suggests that 
“guilt” has a greater psychic impact with 
women than with men, or that women tend 
to internalize the values and conscience of 
their religious culture with greater effective- 
ness. Associated with feelings of guilt for 
women are tendencies to accept the scriptures 
literally and by faith, and to be rejecting of 
a philosophical, theoretical, or pragmatic view 
of religion which might allow one to ra- 
tionalize their “sins.” There are also some 
evidences that this “Factor IV woman” views 
herself as not living up to the tenets of her 
religion as she should. 

For both men and women there are factors 
related to political preference (roughly along 
a liberalism-conservatism continuum) which 
appear not to be related to religiosity. There 
are also factors (for both men and women) 
which might be called Dogmatic Authori- 
tarianism which, again, are not related to 
whether one is religious or not. 

In both samples, we find that the factors 
in the middle group (IV, V, and VII for 
men and II, V, and VIII for women) have 
in common something to do with family rela- 
tionships and the religiosity of other family 
members. These factors for the most part are 
unrelated to the person’s own religiousness in 
any of its senses. An analysis of their content 
and relationships suggest the following: (@) 


The degree of religiousness in one’s parents 4 


was significantly related to the existence of 
better family relationships. This was particu- 
larly true in the case where the father was 
religious. The relationship most benefited b 
his being religious was with his daughter) 
though his sons also benefited to a lesser 
degree. The father’s religiousness was also 
related to better husband-wife relationships. 
(b) It was found that the religiousness of 
the mother had a greater influence in another 
area (though it also contributed to better 
relationships with the offspring) and that was 
in increasing the likelihood that her children 
(especially her son) would also be religious. 
However, while the relationship between the 
religiousness of the parents and their off- 
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spring was positive and significant it was 
still not very great—and, in fact, considering 
the potential influence parents have on their 
children—it was remarkably low (with 7’s 
primarily in the 20s). The relationship be- 
tween the “religiosity” of husband and wives 
produced correlations in the 50s suggesting 
a moderate correspondence as might be 
expected. 

Thus, when the two primarily “religious” 
factors for men (for example, I: Religious 
Behavior, and X: Religious Belief) are cor- 
related with such variables as “being a good 
Samaritan,” “having love and compassion for 
one’s fellow man,” and being “humble,” cer- 
tainly cardinal virtues in the Christian ethic 
(to Which at least 85% of the sample sub- 
scribe) we find almost no relationship. In 
the case of our female sample the relationship 
is very slight with r’s of .21, .20, and .15 
between these variables and the main female 
religiosity factor. What this means essentially 
is that the “irreligious” in our sample are 
nearly as frequently rated as being a good 
Samaritan, having love and compassion for 
their fellow man, and being humble as the 
most devout and religious of our group 
studied. Or to put it another way, there are 
a lot of devout, religious, churchgoing “non- 
Christians” in the sample studied if the 
Sermon on the Mount and the Four Gospels 
are considered relevant to Christian belief 
and practice. Thus it would appear that the 
churches (primarily the Christian denomina- 
tions) while being able to have considerable 
Impact on their membership, seem limited to 
inducing frequent prayer, high attendance, 
the giving of money, and concern about per- 
Sonal salvation—but somehow have appar- 
ently failed to induce much sense of responsi- 
bility toward one’s fellow man. However one 
might argue, that the behavior of the irre- 
ligious (i.e., nonattenders, noncontributors, 
nonbelievers) may in many indirect and 
subtle ways still be influenced and impacted 
far more than one realizes by the Judeo- 
Christian “folk culture” in America. 

The question should be raised as to what 
extent one might generalize these findings, 
Since the Salt Lake population with its many 

Ormons is not representative of the country 
aS a whole with respect to religious prefer- 
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ence. However, while Mormon theology has 
some relatively unique aspects, Mormon 
values correspond fairly closely to the 
“Protestant ethic” which is still predominant 
in the United States. As another approach 
to this problem a separate factor analysis 
of all variables was run for the non-Mormon 
sample, with factor structure emerging which 
was strikingly similar to the total sample for 
10 mutual factors and with fair similarity 
for 4 other pairs (not only on which variables 
loaded on which factor, but on the actual 
size of the loadings). However, the non- 
Mormon group produced one ‘extra factor 
(VIII) which did not show up in the total 
sample which had te do with being a tolerant 


open-minded female who lives a good moral — 


life (this was not related to being religious 
in the conventional sense). In a secondary 
way these findings also suggest that these 
factors may have a certain universality across 
religions and even, possibly, including persons 
without specific religious preference. 

In conclusion, it would be important for 
the reader not to overgeneralize these findings 
since the Salt Lake population is not com- 
pletely representative of the country as a 
whole with respect to religion. However, one 
should not undergeneralize these results either 
because, as noted, these findings, if anything, 
come through with even sharper clarity and 
consistency for the non-Mormon sample, thus 
making it difficult to conclude that they were 
only due to sample peculiarities or bias. 

With the sample limitation noted, this 
study has suggested that religiosity or re- 
ligious commitment is no simple, single 
dimension, but is factorially extremely com- 
plex, or more simply, there are many ways 
to be religious and to express this behav- 
iorally. 

The differences in the way men versus 
women approach religion and integrate it in 
their lives was marked. These sex differences, 
it should be noted, are much more than mean 
differences in variables, but involve quite 
divergent patterning of correlations and fac- 
tors. Some commonly held stereotypes such 
as “religious people are more authoritarian 
than nonreligious” did not hold up. Addition- 
ally it was found that the three quite different 
assessments of overall religious commitment 
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measures (i.e., projective test, coded depth 
interview, and questionnaire) had intercorre- 
lations averaging .66 suggesting stability and 
a kind of cross-method validity. 
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Ss (N = 104) who scored either high 


or low in test anxiety were assigned to 


groups to discuss a pending classroom exantination. Each group comprised 2 


high- and 2 low-anxious members and 


1 moderately anxious member who was a 


leader confederate. By prearrangement, the leader fostered task-irrelevant 
commentary in + of the groups and task-relevant in the remaining 4. Ss’ 


changes in level of aspiration and their 


discrepancies between goals and achieve- 


ment on examinations clearly supported the assumption that task-irrelevant 
responses mediate test anxiety. These results, together with findings from a 


concerning the nature of test anxiety. 


In the present study, two individuals who 
were high in test anxiety and two who were 
low in test anxiety joined with a moderately 
anxious leader to discuss a forthcoming class- 
room examination. The discussion leader fos- 
tered either task-irrelevant or task-relevant 
commentary. The consequences of such dis- 
cussion were studied by measuring changes in 
the subjects’ level of aspiration on the class- 
toom examination and also on an unrelated 
performance task. 

This study was designed to examine the as- 
sumption that the effects of anxiety on be- 
havior result from the arousal of task-irrele- 
vant response tendencies in the subject. In 
addition, it was hoped that the results would 
help to clarify further the question of whether 
anxiety is a generalized state which affects the 
subject’s behavior in an overall way, or is spe- 
tific to a particular type of stimulus situation. 

Empirical studies of the effects of anxiety 

„0n behavior have generally taken one of two 


` forms which have led to both overlapping and 


disparate findings. In their summaries of the 
literature in which subjects were selected for 
high and low scores on a number of differ- 
ent anxiety questionnaires, Sarason, Davidson, 


1This study is partially based on a dissertation 
submitted by the senior author to the faculty of the 
Graduate School of Adelphi University, in partial ful- 
lment of the requirements for the degree of Doctor 
of Philosophy. 
" 2Now at Post-Graduate Center for Mental Health, 
New York City. 
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pegboard performance, supported both generality and specificity hypotheses 


Lighthall, Waite, and Ruebush (1960) and 
Sarason (1960) have pointed out that differ- 
ences in performance of high- and low-anxious 
subjects are consistent on a variety of tasks. 
Similarly, attempts to directly manipulate ex- 
perimental conditions in such a way as to 
arouse varying levels of anxiety (for example, 
using “ego-threatening” instructions or “fail- 
ing” the subject with respect to a desired per- 
formance level) have also resulted in system- 
atic differences in performance (Iverson & 
Reuder, 1956). These two approaches have 
been*combined by various investigators, for 
example, in earlier work by Mandler and 
Sarason (1952) and more recently by Sarason 
and Palola (1960). In ascertaining the impact 
of a number of situational variables such as 
type and difficulty of task, they have pro- 
duced similar effects with measured as com- 
pared with manipulated anxiety and also 
found significant interactions. 

Following the lead of Malmo and Amsel 
(1948), and a more explicit formulation by 
Mandler and Sarason, most authors have at- 
tributed their findings on the interfering ef- 
fects of anxiety to the arousal of a tendency 
on the part of the subject to make task-irrele- 
vant responses. By directing attention to him- 
self and his subjective feelings, the high-anx- 
ious subject makes responses which interfere 
with efficient task performance. This assump- 
tion has been made by those who believe anx- 
iety to be a firmly established, deep-seated 
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personality characteristic (May, 1950), by 
those who view it as a motivational state sub- 
ject to arousal by external stimulation (Alpert 
& Haber, 1960; Mandler & Sarason, 1952), 
and by those who consider it as both (Cattell 
& Scheier, 1960). The assumption has also 
been supported by findings which indicate 
that subjects who score high on anxiety scales 
also reveal by means of other measures such 
tendencies as self-preoccupation, self-depreca- 
tion, and a perspective on achievement that 
would lead them to view tests and examina- 
tions as a form of personal threat (Sarason, 
1960). = 

Yet, as far as could be determined, there 
are no available studies wherein the task rele- 
vance or irrelevance of a response has itself 
been directly manipulated. The effect of such 
manipulation on subjects who scored either 
high or low on a test-anxiety questionnaire 
was a major focus of the present investigation. 

It was assumed that high or low test-anx- 
iety questionnaire scores would reflect high or 
low tendencies on the part of the subject to 
make task-irrelevant (or conversely, task-rele- 
vant) responses toward examinations. Thus, 
it was expected that high- and low-anxious 
subjects would react differently to a group 
discussion in which task-irrelevant aspects of 
a common classroom examination were salient 
than to a discussion which was task relevant, 
in its orientation. 

A test-anxiety scale was chosen for study 
because, despite the relationships reported for 
more general scales, the most precise findings 
appeared to be those in which the content of 
the questionnaire items was highly specific to 
the type of behavioral situation studied (see 
summaries by Alpert & Haber, 1960; Sarason, 
1960). At the same time, the possibility that 
such scales also measure more general aspects 
of anxiety could not be ignored. Task rele- 
vance or irrelevance has routinely been ap- 
plied as an explanatory concept for either type 
of factor. For this reason, it was considered 
essential to include in the present study a 
nonexamination-type task which would serve 
as a basis for comparison with course-exami- 
nation oriented behavior. Performance of a 
nonexamination-type task would supposedly 
disclose effects of a general factor, but fail to 
show effects of a specific factor of test anxiety. 
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Traditional studies of test anxiety have 
been so designed that the task-relevant or 
task-irrelevant tendencies of the subject have 
necessarily functioned covertly. In order to 
make such responses more overt, a procedure 
was necessary wherein overt expression of 
such responses would be both natural and ap- 
propriate. A group discussion of a pending 
important classroom examination appeared to 
meet these requirements. Moreover, investiga- 
tions have repeatedly shown that attitudes, 
judgments, and motives are modifiable by such 
group participation (Cartwright & Zander, 
1960). This effect is particularly evident if 
the discussion leader is actively participatory 
(Kipnis, 1958; Levine & Butler, 1952; Pres- 
ton & Heintz, 1949) and if the group per- 
ceives itself as comprising a collective unit 
in sharing some common interest (Festinger, 
1953). Thus, it was assumed that if high and 
low test-anxious subjects truly differed in 
tendency to make task-irrelevant responses, 
they would be affected differently by active 
participation in a group discussion wherein 
a participatory leader reiterated such re- 
sponses. With a discussion explicitly con- 
cerned with a pending examination, it was 
expected that a task-relevant discussion would 
enhance task-relevant tendencies of low-anx- 
ious subjects. Likewise, a task-irrelevant dis- 
cussion would be expected to enhance task- 
irrelevant tendencies for high-anxious sub- 
jects. However, if the relationship between 
test anxiety and task-irrelevant responding is 
not as posited, then the effect of task-irrele- 
vant and task-relevant discussion would be 
either independent of the level of measured 
test anxiety or show a different type of func» 
tional relationship. 

Results from a number of studies suggested 


that goal-setting behavior would provide a- 


More sensitive index of the effects of anxiety 
and task relevance than would a gross meas- 
ure of overt performance. Trapp and Kausler 
(1958), Sarason, Mandler, and Craighill 
(1952), and Lewin, Dembo, Festinger, and 
Sears (1944), all present findings which sug- 
gest a greater sensitivity for the goal-setting 
measure. Sherif (1956, pp. 202, 254) has 
noted that goal-setting behavior can be al- 


tered by group experience. In addition, goal-’ 


setting behavior provided a common measure 


n which comparisons of reaction to other- 

e unlike tasks could readily be made. 
t was thus specifically hypothesized that: 
jects with high test-anxiety scores would 
er their goals after participating in a dis- 
on group characterized by the insertion 
task-irrelevant stimuli; such subjects would 
e their aspirations following a discussion 
which task-relevant stimuli were intro- 
ced; and the goals of subjects with low 
est-anxiety scores would be affected by the 
group atmospheres in a similar manner, but 
a much lesser degree. 


METHOD 


prescreening psychology classes on the Mandler 
Sarason Test Anxiety Questionnaire (TAQ), 52 
-anxious (scores 20-35) and 52 low-anxious 
res 0-12) subjects were obtained. Thirteen more 
dents with scores in the middle of the TAQ were 
ed as leaders, The Sarason and Gordon (1953) 
ng system was used. All subjects and leaders 
women aged 17-25 and enrolled in undergradu- 
psychology courses at either of two liberal arts 
ileges in Metropolitan New York. 


oups 


è 
Subjects were assigned to 26 groups of five mem- 
each. However, because one subject withdrew 
n school before the data were complete, a final 
lysis had to be based upon 24 groups. Each group 
ted of two high-anxious subjects, two low- 
ious subjects, and a leader, all from the same 
and section in psychology. Half of the groups 
assigned to a task-relevant (TR) condition and 
to a task-irrelevant (TI) condition. Each leader 
ed once in each type of group. All leaders were 
'ucted to promote a permissive atmosphere, to 
ncourage every subject to express opinions, and not 
to impose their own opinions on the group. 
y leaders were instructed to introduce into the 
oup discussion, a minimum of five task-relevant 
Statements—for example, “Oh sure, it may be a hard 
tticky] test, but look, what could the instructor 
on the test that you wouldn't know something 
t?” These statements were to be distributed at 
vals throughout the discussion period. 
‘ ‘Likewise, the TI leader was to introduce a mini- 
mum of five such task-irrelevant statements as, “The 
ble is my feelings always interfere with my tak- 
tests.” All TR and TI statements were contained 
standardized list prepared by the experimenter 
memorized in advance by each leader.6 By con- 
ting the group discussions at a time when impor- 
examinations were imminent, the comments of 
leaders were expected to have an optimal effect 


The complete list of comments and verbatim in- 
ictions are given in Beckwith (1959). 
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in eliciting similar TR (or TI) comments from the 
subjects. Tape recordings of the discussions substan- 
tiated this assumption. 


Measures of Level of Aspiration 


Subjects were asked to make two estimates (hopes 
and goals) of their future performance on each of 
two familiar tasks. Only the goal data were ana- 
lyzed, the hope measures serving as a control on the 
type of level of aspiration (LOA) elicited (Lewin 
et al., 1944). 

Two of the LOA measures were estimates of per- 
formance on a forthcoming major objective exami- 
nation (either a midterm or a final). One of these 
estimates was a free measure in the sense that the 
experimenter indicated the maximum possible score 
on the instructor’s examination and the subject wrote 
down her LOA. A second estimate of this same test 
performance was made on a self-anchored rating 
scale. The subject was asked to fill in a scaled con- 
tinuum with the names of courses in which she had 
in the past performed “best,” “average,” and “poor- 
est.” She was then asked to check this scale at a 
point which corresponded to the performance ex- 
pected on the forthcoming test. 

The third LOA was obtained from a nontest-type 
of task—a pegboard devised by Jacobs (1955). This 
measure served to indicate the degree to which test 
anxiety (and task relevance) would generalize to a 
nonexamination-type situation. To produce famili- 
arity, a practice trial on the pegboard was given be- 
fore any LOA measures for this task were obtained. 


Control Measures 


* To preclude group differences based on friendship 
factors, an acquaintanceship rating was obtained 

*prio? to administration of the experimental pro- 
cedure, The results indicated that within any one 
groyp neither leader nor subjects were more than 
“slightly” acquainted. 

A postdiscussion questionnaire was administered to 
check on the leaders’ effectiveness in role playing, to 
determine whether individual subjects suspected that 
a leader had been assigned by the experimenter, and 
to obtain subjects’ impressions of the group atmos- 
phere. Only one subject suspected that the leader was 
an instructed confederate. Questionnaire results gen- 
erally verified the assumption that the subjects identi- 
fied the confederate leader as a natural leader of the 
group and that the group atmosphere (TR or TI) 
conformed to experimental design. 


Procedure 


Subjects, identified only by code letters placed be- 
fore them, were seated around a table before a one- 
way vision screen (of which they were aware). A 
tape recorder was visibly present. After obtaining 
acquaintanceship ratings, the experimenter explained 
that the group task was to discuss a specified forth- 
coming psychology examination and to reach some 
group conclusions as to how this instructor’s exami- 
nation (and particularly his marking system) af- 
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fected them and their feelings. The three types of 
LOA (free estimate, self-anchored rating, and peg- 
board) were then obtained individually (and pri- 
vately) in writing. The experimenter, having col- 
lected these papers, left the room. The discussjon pro- 
ceeded for 20 minutes during which time the in- 
structed leader inserted her TR or TI comments and 
attempted to behave in a manner appropriate to the 
desired relevance-type for that group. 

Following the discussion, the experimenter returned 
and distributed to each subject a copy of the “aver- 
age goals” for the group. These “averages” were 
actually the subjects’ own LOAs slightly disguised 
by the addition or subtraction of 2 points (counter- 
balanced among all tasks and conditions). Subjects 
were then asked to reconsider their previous esti- 
mates and to provide again a free estimate and self- 
anchored rating on the examination and a goal for 
pegboard performance. A third drial of the pegboard 

_ ‘oested immediately) and administration of the post- 
discussion questionnaire completed the procedure. Be- 
fore leaving, all subjects were given an explanation 
of the study. Actual examination scores were later 
obtained from the instructors in the subjects’ courses. 


Scores 


Raw-score LOAs and actual performance scores 
were obtained for each subject on both the course 
examination and the pegboard. Two types of con- 
verted scores were also obtained for each subject. 
Change scores which indicated amount and direction 
of shift in LOA and attainment-discrepancy scores 
which, by comparing actual achievement with LOA, 
were intended to reflect the subject’s “realism” 
(Lewin et al, 1944) were computed. 2 


J. Becxwitu, M. A. Iverson, AnD M. E. REUDER 


Three change scores were obtained—one for free- 
estimate LOA, one for self-anchored LOA, and one 
for pegboard LOA. Because the free-estimate LOAs 
were obtained for examinations which varied in 
length, these change scores were based on a differ- 
ence in percentage of total test items. All change 
scores were computed by subtracting initial LOA 
from postdiscussion LOA retaining algebraic signs to 
indicate direction of change. 

Attainment-discrepancy scores were obtained by 
subtracting the postdiscussion LOA from the actual 
measured performance for both the free estimate of 
the course examination (in percentage units) and 
for the pegboard task. Again, algebraic signs were 
retained. 


RESULTS 


An analysis of variance of the test-anxiety 
and task-relevance variables was carried dut 
for each of the raw scores, the change scores, 
and for each of the attainment-discrepancy 
scores. Variance due to replications was also 
analyzed in order not to confound the effects 
of the task-relevance variable with variance 
attributable to individual differences in effec- 
tiveness in role playing of the various leaders. 
In some of the analyses (Tables 1 and 2), 
replications were significant whereas in others 
they were not. This fact appeared to warrant 
their isolation and removal as a variable in 
evaluating variance due to main effects and 
interactions. 


° TABLE 1 


SUMMARY OF ANALYSIS OF VARIANCE OF CHANGE SCORES FOR LEVELS OF ASPIRATION 
(Post-Grovp-Discyssion MINUS INITIAL LEVEL) 


Task 
Course examination Pegboard task 
Source 
Free estimate oli aochored Free estimate 
of MS F MS F af MS F 
‘Test anxiety (A) 1 6.7 $ 
Task oa (type of vf ai a t a a 
group) (B) 1 213.4 | 12.60%] 5.34 iad 
Replication (C) it 33.3. | 2.00* ater i 167 36 
A x R 1 66.8 | 3.94 1.35 6.1** 1 722 38 
A x c 11 23.7 1.40 31 14 10 1159 -60 
RRON 11 32.1 1.88 22 1.0 10 3030 1.58 
Error (within cells) is ms | Co idl D E n 
ROE = a 22 a 1918 


k x E ipeehoatd differed because scores were not obtained for the first two groups. 


** b = 1025, 
ee = 1001, 


ee o nT. —ooe 


| 


"en oe 
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TABLE 2 
SUMMARY OF ANALYsIS OF VARIANCE OF ATTAINMENT-DISCREPANCY SCORES 
(ACHIEVEMENT minus LOA) 
Task 
Source Course examination (free estimate) Pegboard 
df MS F af MS F 
Test anxiety (A) 1 3.58 -02 1 27 
Task relevance (type of group) (B) 1 .06 .00 1 128 BoA 
Replication (C) 11 915.20 5.80*** 10 2447 1.14 
AXB 1 1227.40 7.80* 1 16407 8.01* 
AXC 11 156.20 99 10 1078 53 
BXC 11 33.70 2t 10 1818 .89 
AXBXC 11 455.10 2,90** 10 1384 68 
Error (within cells) 48 158.00 44 2048 
Total 95 87s 
- ak “4 
A ay on peabpars differed because scores were not obtained for the first two groups. 
Ok 
Tests for homogeneity of variance yielded these differences were not statistically signifi- 


statistically significant chi-square values for 
Most sets of measures. When conventional 
transformations failed to eliminate this het- 
erogeneity, the original untransformed values 
were analyzed and extreme caution used in 
interpreting marginal results. 


Raw Scores 


The advantage of using change and discrep- 
ancy scores was borne out by a comparison of 
the absolute raw-score measures. As can be 
seen in Tables 3 and 4, there were differences 
among the groups in initial LOA before the 
experimental procedure was begun. Although 


cant in terms of the main effects, an analysis 
of variance suggested that some replication 
groups had higher LOAs than others. Because 
subjects were randomly assigned to groups 
(within limits of the test-anxiety variable) 
and groups randomly assigned to conditions, 
these differences were attributed to sampling 
error. However, they were great enough to 
‘mask changes in behavior when LOAs ob- 
_ tained after the group discussion were ana- 
lyzed independently. The final raw-score LOAs 
for course examination showed significant dif- 
ferences only for the test-anxiety variable. 
High-anxious subjects set lower goals than 
did low-anxious subjects. 


TABLE 3 


MEANS AND SDs or COURSE EXAMINATION LOAS AND PER 
PERCENTAGE OF TOTAL POSSIBLE Test Items CORRECT 


FORMANCE IN UNITS OF 
(N=24 per GROUP) 


Examination level of aspiration 
Examination performance 
Type of group Initial goal Goal after group discussion 
discussion 
igh- r- igh- Low- High- Low- 
riots AEE ae becin s anxious ‘subjects anxious subjects neil subjects 

Task irrelevant 

uM 77.42 79.71 75.38 79.71 65.08 62.61 

SD 1.52 1.43 1.83 1.53 3.05 2.98 
Task relevant 

M 75.21 80.54 77.67 81.38 60.17 71.83 

SD! 21 2.21 1.93 2.23 3.07 3.09 
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TABLE 4 
Means AND SDs or Peceoarn LOAs AND PERFORMANCE IN RAW-ScCoRE Units 


(N =22 rer Group) 


Pegboard level of aspiration 


Pegboard performance 
Type of group Initial goal Goal after group discussion 
discussion. 
High- Low- High- Low- High- _ Low- 
anxious subjects | anxious subjects | anxious subjects | anxious subjects | anxious subjects | anxious subjects 
Task irrelevant 
M 240.59 248.86 » 258.55 288.77 305.55 297.36 
SD 11.46 14.83 12.84 11.00 7.03 6.88 
Task relevant " 
M » 249.05 245.32 278.64 265.14 295.77 298.73 
SD 9.64 9.16 13.05 11.22 6.24 


8.80 


“Change Scores 


The effects of the group experience became 
apparent only when the change from initial to 
Ppost-group-experience LOA for course exami- 
nation was examined. The apparent difference 
in the interaction patterns which was evident 
from examining the means in Table 3, was 
shown to be significant by the analysis of 
variance of the change scores (see Table 1). 
In addition, the main effect of task relevance 
(which was not obtained in the raw-score 
analysis) was significant at the .001 level. 
These results clearly supported the main ex- 
perimental hypotheses, 

Since both the free-estimate measures and 
the self-anchored ratings of examination per- 
formance showed almost identical patternssin 
the analyses of variance (Table 1) and in the 
pattern of mean distribution, only the former 
will be discussed further. 


HIGH ANXIOUS 


Als 


LOW ANXIOUS 


MEAN CHANGE SCORE 
o 


TASK 
IRRELEVANT RELEVANT 


Fic. 1, Mean change scores (examination) of high- 
and low-anxious subjects after task-relevant or task- 
irrelevant discussion. 


The absence of any significant change-score 
effects on the pegboard LOA scores indicated 
that the anxiety and relevance variables be- 
ing studied were specific to examination-type 
tasks and did not generalize to a nonexamina- 
tion-type competitive task. 

Figure 1 also indicates that the direction of 
change for the various experimental groups 
supported the hypotheses. In comparison with 
initial, prediscussion level, both high- and 
low-anxious subjects raised their LOA after 
exposure to task-relevant discussions, high- 
anxious being more affected. In contrast, 
task-irrelevant activity noticeably lowered the 
LOAs of high-anxious subjects from initial 
level whereas low-anxious individuals were 
relatively unaffected by this experience. As 
indicated in the analysis of variance, test 
anxiety alone did not account for any signifi- 
cant main effects, but, in combination with 
task relevance, produced an interaction which 


was statistically significant (p < .05 on both » 


measures of examination LOAs). 

The pegboard change scores showed simply 
that practically all subjects raised their LOA 
regardless of measured test anxiety or the 
task-relevance condition (see Table 4). 


Attainment-Discrepancy Scores 


There was no Statistically significant differ- 
ence between groups in actual achievement on 
either the course examination or the pegboard 
performance. However, analysis of the attain- 
ment-discrepancy scores indicated an effect of 


the experimental conditions on both the de- ' 


gree and direction of the realism of the sub- 
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HIGH 
ANXIOUS 


SANs 


A 
» 


| RAECEVANT velit 


Fic. 2. Mean achievement discrepancy scores on 
examinations for high- and low-anxious subjects after 
task-relevant or task-irrelevant discussion. (Positive 
score indicates that the subject underestimated his 
performance.) 


jects. As can be seen in Table 2, LOAs for 
both the psychology course examination and 
the pegboard task showed a significant inter- 
action between the major variables, but no 
significant main effects. A comparison of the 
means (Figures 2 and 3) showed that on both 
tasks, with task-relevant discussion, high-anx- 
ious subjects were inclined to overestimate 
their performance (large difference) much 
more than the low-anxious subjects. Con- 
versely, the task-irrelevant experience resulted 
in a greater tendency to overestimate for the 
low-anxious subjects. 

In contrast with this consistency in the 
pattern of the interaction, all subjects tended 


„to overestimate their examination perform- 


ance and underestimate their pegboard scores. 
This finding suggested that no particular type 
of realism (as reflected in a consistent tend- 
tency to over- or underestimate performance) 
was related to either test anxiety or task rele- 
vance, Instead, type of realism appeared to 
be specific to the nature of the task. 

Further analyses were made to take into 
account the fact that an individual’s feelings 
of “success” or “failure” with respect to a 
subsequent performance relate to the absolute 
amount by which his achievement departs 
from his LOA (Lewin et al., 1944). When 
differences were treated algebraically in com- 
puting achievement-discrepancy scores, a sub- 
Ject with a high positive discrepancy could 
cancel the effect of another subject with a 
very low negative discrepancy in determina- 
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HIGH ANXIOUS 


& 


W ANXIOUS 


TASK A 
IRRELEVANT RELEVANT 
Fic. 3. Mean achievement discrepancy scores on 
pegboard for high- and low-anxious subjects after 
task-relevant or task;irrelevant discussion. Cod 
score indicates that the subject overestimated his“per- 
formance.) fs 


tion of the group mean. Yet, an individual’s 
personal reactions of success or failure are in 
part a function of how close he came to 
achieving his goal. For this reason, mean ac- 
curacy scores were also determined. These 
were the same as the attainment-discrepancy 
scores with algebraic sign ignored. These ad- 
ditional measures were examined to clarify 
the nature of the interactions obtained in the 
previous analyses of variance. 
. Figure 4 shows that these effects were not 
the same for the two tasks and added support 
«to the relationships observed with the alge- 
braic scores. An analysis of variance of the 
examination accuracy scores showed an inter- 
action effect between task relevance and test 
anxiety which was significant at the .07 level. 
Furthermore, a highly significant (p < .01) 


20. 
HIGH ANXIOUS 


in 
W ANXIOUS 


K TAK 
IRRELEVANT RELEVANT 


Fic. 4. Mean accuracy scores (absolute discrep- 
ancy) on course examination. (Low scores indicate 


high accuracy.) 


MEAN ACCURACY SCORE 
a 


586 


triple interaction indicated that these effects 
were significant for some leaders’ groups, but 
not for others. Because of the greater vari- 
ability of the pegboard accuracy data, none 
of the differences was statistically significant 
(p’s > .20). The latter results, when consid- 
ered as trends, were consistent with the previ- 
ously noted interaction in achievement-dis- 
crepancy scores on the pegboard. 


Control Measures k 
The significant findings which were ob- 

tained on the LOA measures were also con- 

firmed by the several control measures and 

by te qualitative tape-recorded remarks of 
thé’ different subjects. When confederate lead- 

ers introduced TI comments into the group 

discussion, other members tended to join in 

by elaborating with remarks about their own 

examination fears and self-preoccupations. A 

similar pattern was found with task-relevant 

confederates, whose group members joined in 

with their own task-relevant comments.* Al- 

though the exact number of such verbal elabo- 

rations was not tabulated, impressions gleaned 

from listening to the tapes were consistent 

with the subjects’ written ratings. Subjects in 
the TI groups rated the group atmosphere 

(but not the leader) as relatively unfavor-’ 
able to successful examination performance 

and to the development of feelings of ‘self-* 
confidence. TR group members tended to rate 

their groups in the opposite direction. THese 

ratings were statistically significant at the .01 

level. 


Discussion 


Equivalence of Task-Irrelevance and 
Test Anxiety 


The obtained results support the experi- 
mental hypotheses and supply strong evidence 
in favor of the assumptions made by previous 
investigators. When task relevance is delib- 
erately manipulated, level of aspiration 
changes in the same interactive way with re- 
spect to test anxiety as does overt performance 
when instructions or success-failure experi- 
ences are manipulated prior to an experimental 

_* These qualitative differences in behavior also pro- 
vide experimental support for a “behavioral con- 


tagion” effect of group leaders which is described by 
Lippitt, Polansky, Redl, and Rosen (1958). 
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task. The patterns of the obtained interactions 
are almost duplicates of those reported by 
Mandler and Sarason (1952, 1953), Sarason 
and Mandler (1952), Sarason et al. (1952), 
and Sarason and Palola (1960) using such 
procedures. The findings are also in accord 
with Cattell’s view that anxiety is a basic 
personality dimension which, although af- 
fected and modified by specific environmental 
conditions, is not fundamentally altered by 
individual experience (Cattell & Scheier, 
1960). 

The same results comprise evidence contra- 
dictory to the viewpoint expressed by May 
(1950) as to both the nature of anxiety and 
its effects on behavior. According to May, 
high-anxious subjects should tend to become 
rigid and detached in response to external 
arousing stimuli whereas the high-anxious 
subjects in the present study showed the 
greatest amount of change. 

The results, however, also suggest that the 
two major variables of the current study are 
not completely equivalent dimensions. If the 
test-anxiety variable were solely a measure of 
the subject’s tendency to make task-irrelevant 
responses, one would expect the subject’s 
measured tendencies to be intensified by the 
group experience and such effects to be re- 
flected in LOA changes. Thus, high-anxious 
subjects would be expected to lower LOA after 
a task-irrelevant discussion and low-anxious 
subjects to raise LOA after a task-relevant 
discussion. The results in Figure 1 support 
this prediction fully. í 

However, one could also reason that the 
task-irrelevant tendencies of high-anxious sub- 
jects would be counterbalanced by the task- 
relevant tendencies elicited in a TR discus- 
sion and result in no overall LOA change. 
Similarly, no change would be expected for 
low-anxious subjects under the TI condition. 
This prediction was verified for low-anxious 
subjects, but not for subjects with high anxi- 
ety. The latter individuals, instead of holding 
to their initial goals, raised their LOA more 
than any other group as a result of a task- 
relevant group experience (Figure 1). With 
further consideration, this finding seems to 


relate to results obtained by Alpert and Haber . 


(1960). They showed that a low score on a 
scale which measures task irrelevance does not 


ay 
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necessarily require the subject to also possess 
strong task-relevant tendencies. Using sepa- 
rate measuring scales, they found certain sub- 
jects to be low (or high) with respect to both 
tendencies whereas others showed the expected 
inversely related scores. 


Specific Effects of Test Anxiety 


The specificity of effects of different “types” 
of anxiety on different categories of behavior 
which is stressed by Alpert and Haber and 
others (Child, 1954; Sarason, 1960; Sarason 
et al., 1960) is also well supported by the 
present findings. The fact that levels of aspi- 
ration for classroom examination performance 
showed different changes for high and low test- 
anxious subjects after an experience which 
was centered around a discussion of classroom 
test-taking behavior is an internal validation 
of Mandler and Sarason’s questionnaire. The 
fact that for these same subjects, the experi- 
ence had no effect on their level of aspiration 
for the nontestlike, “game-type” pegboard 
task strongly supports the specificity hypothe- 
sis. 

A comparison of LOAs with actual achieve- 
ment on the two different tasks further clari- 
fies some of these specific effects. As can be 
seen in Figures 2 and 3, practically all sub- 
jects underestimated their examination per- 
formance (positive scores) and overestimated 
their pegboard achievement (negative scores). 
Furthermore, although the interactions be- 
tween task relevance and test anxiety are sta- 
tistically significant for both tasks (Table 2), 
the directions of these interactions are almost 
completely opposite. The accuracy scores show 


“a parallel difference (Figure 4). 


High-anxious subjects after a TI discussion 
were both more accurate (Figure 4) and over- 
estimated their actual course examination per- 
formance to a lesser degree (Figure 2) than 
did high-anxious subjects who had had a TR 
discussion. For the low-anxious subjects the 
reverse tendency was true. Such subjects could 
be described as having had a group experience 
which was congruent with their anxiety tend- 
encies, i.e., high test anxiety (or strong task- 
irrelevant tendencies) is paired with a task- 


„irrelevant discussion and low anxiety (task- 


televant tendencies) with a TR experience. 
Thus, for the course examination, subjects 
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who had a group experience which was con- 
gruent with their test-anxiety tendencies 
showed greater realism than did those whose 
group experience was incongruent with their 
tendencies (i.e., high-anxious subjects with a 
TR experience and low-anxious subjects with 
a TI discussion). 

On the pegboard task, greatest accuracy and 
least tendency to underestimate performance 
(Figure 3) were related to an incongruous 
combination of conditions. High-anxious sub- 
jects with a TR experience and low-anxious 
subjects with a TI group discussion were the 
most realistic and accurate. Not only did the 
different conditions of task relevance ‘affect 
the groups of high- and low-anxious subjects 
differently, but the same subjects who showed 
facilitative effects on the one task showed 
opposite effects under the same conditions on 
the second task. 

The fact that significant interactions of the 
two anxiety dimensions were found for attain- 
ment-discrepancy scores on both tasks sup- 
ports the concept of anxiety as a generalized 
dimension which affects a range of different 
behavior. The fact that the direction of these 
effects was opposite for the two unlike tasks 
supports the view that the kind of effect re- 
lated to anxiety-arousing conditions will also 
be a function of the nature of the task, and 

*that®anxiety is also highly specific in its im- 
pact on behavior. 

A limited parallel is suggested between 
these results and certain currently contro- 
versial aspects of psychotherapy. Task-irrele- 
vant tendencies are generally viewed as inter- 
fering with achievement. The process of air- 
ing such tendencies in a group discussion has 
some of the properties of catharsis. Catharsis, 
in turn, is expected to be followed by more 
productive responses. Conversely, a task-rele- 
vant discussion which ignores TI tendencies 
can be viewed as a reinforcement of positive 
response tendencies which are directly produc- 
tive and should thus enhance the subjects’ 
achievement. 

To the extent that this parallel is warranted, 
the results of the present investigation suggest 
that both the particular behavior under con- 
sideration and the subject’s own established 
response tendencies determine which type of 
experience will be most productive for a given 
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subject in a given setting. Where the achieve- 
ment behavior desired was closely related in 
content to the content area of task relevance, 
for example, course examination and „anxiety 
connected therewith, a group experience which 
was congruent with the subject’s established 
tendencies was most effective. When the 
achievement behavior was in a different con- 
tent area from the specific anxiety (for ex- 
ample, the pegboard), an incongruent grup 
experience was most effective. 
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3 
j INFLUENCE OF MODELS’ REINFORCEMENT CONTINGENCIES 
ON THE ACQUISITION OF IMITATIVE RESPONSES * 


ALBERT BANDURA’? 
Stanford University 


In order to test the hypothesis that reinforcements administered to a model 
influence the performance but not the acquisition of matching responses, groups 
i of children observed an aggressive film-mediated model either rewarded, 

punished, or left without consequences. A postexposure test revealed that 
response consequences to the model had produced differential amounts of 
imitative behavior. Children in the model-punished condition performed sig- 
nificantly fewer matching responses than children in both the model-rewarded 
and the no-consequences groups. Children in all 3 treatment conditions were 
then offered attractive reinforcers contingent on their reproducing the model’s 


aggressive responses. The introduction of positive incentives completely wiped 
out the previously observed performance differences, 
amount of learning among children in the model-rewarded, 


and the no-consequences conditions. 


i It is widely assumed that the occurrence of 
imitative or observational learning is con- 
tingent on the administration of reinforcing 
stimuli either to the model or to the observer. 
According to the theory propounded by Miller 
and Dollard (1941), for example, the neces- 
saty conditions for learning through imitation 
include a motivated subject who is positively 
reinforced for matching the rewarded behavior 
of a model during a series of initially random, 
trial-and-error responses. Since this concep- 
tualization of observational learning requires 
the subject to perform the imitative response 
before he can learn it, this theory evidently 
accounts more adequately for the emission of 
Previously learned matching responses, than 
for their acquisition. 

Mowrer’s (1960) proprioceptive feedback 
theory similarly highlights the role of rein- 
forcement but, unlike Miller and Dollard who 
reduce imitation to a special case of instru- 
Mental learning, Mowrer focuses on the clas- 
sical conditioning of positive and negative 
emotions to matching response-correlated 
stimuli. Mowrer distinguishes two forms of 
imitative learning in terms of whether the ob- 
server is reinforced directly or vicariously. In 
the former case, the model performs a re- 
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revealing an equivalent 
model-punished, 


sponse and simultaneously rewards the ob- 
server. If the modeled responses are thus 
paired repeatedly with positive reinforcement 
they gradually acquire secondary reward 
value, The observer can then administer posi- 
tively conditioned reinforcers to himself sim- 
ply by reproducing as closely as possible the 
model’s positively valenced behavior. In the 
second, or empathetic form of imitative learn- 
ing, the model not only exhibits the responses 
ut also experiences the reinforcing conse- 
quenges. It is assumed that the observer, in 
turn, experiences empathetically both the re- 
sponse-correlated stimuli and the response 
consequences of the model’s behavior. As a re- 
sult of this higher-order vicarious condition- 
ing, the observer will be inclined to reproduce 
the matching responses. 

There is some recent evidence that imita- 
tive behavior can be enhanced by noncontin- 
gent social reinforcement from a model (Ban- 
dura & Huston, 1961), by response-contingent 
reinforcers administered to the model (Ban- 
dura, Ross, & Ross, 1963b; Walters, Leat, & 
Mezei, 1963), and by increasing the reinforc- 
ing value of matching responses per se through 
direct reinforcement of the participant ob- 
server (Baer & Sherman, 1964). Nevertheless, 
reinforcement theories of imitation fail to ex- 
plain the learning of matching responses when 
the observer does not perform the model’s re- 
sponses during the process of acquisition, and 
for which reinforcers are not delivered either 
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to the model or to the observers (Bandura et 
al., 1961, 1963a). 

The acquisition of imitative responses under 
the latter conditions appears to be accounted 
for more adequately by a contiguity theory 
of observational learning. According to the 
latter conceptualization (Bandura, in press; 
Sheffield, 1961), when an observer wit- 
nesses a model exhibit a sequence of responses 
the observer acquires, through contiguous as- 
sociation of sensory events, perceptual and 
symbolic responses possessing cue properties 
that are capable of eliciting, at some time 
after a demonstration, overt. responses corre- 
sponding to those that had been modeled. 

Some suggestive evidenee that the acquisi- 
tion of matching responses may take place 
through ‘contiguity, whereas reinforcements 
administered to a model exert their major in- 
fluence on the performance of imitatively 
learned responses, is provided in a study in 
which models were rewarded or punished for 
exhibiting aggressive behavior (Bandura et 
al., 1963b). Although children who had ob- 
served aggressive responses rewarded subse- 
quently reproduced the model’s behavior while 
children in the model-punished condition 
failed to do so, a number of the subjects in 
the latter group described in postexperimental 
interviews the model’s repertoire of aggressive 
responses with considerable accuracy., Evi; 
dently, they had learned the cognitive equiva- 
lents of the model’s responses but they yere 
not translated into their motoric forms. These 
findings highlighted both the importance of 
distinguishing between learning and perform- 
ance and the need for a systematic study of 
whether reinforcement is primarily a learning- 
related or a performance-related variable. 

In the present experiment children observed 
a film-mediated model who exhibited novel 
physical and verbal aggressive responses. In 
one treatment condition the model was se- 
verely punished; in a second, the model was 
generously rewarded; while the third condi- 
tion presented no response consequences to the 
model. Following a postexposure test of imi- 
tative behavior, children in all three groups 
were offered attractive incentives contingent 
on their reproducing the models’ responses so 
as to provide a more accurate index of learn- 
ing. It was predicted that reinforcing conse- 
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quences to the model would result in signifi- 
cant differences in the performance of imita- 
tive behavior with the model-rewarded group 
displaying the highest number of different 
classes of matching responses, followed by 
the no-consequences and the model-punished 
groups, respectively. In accordance with previ- 
ous findings (Bandura et al., 1961, 1963a) it 
was also expected that boys would perform 
significantly more imitative aggression than 
girls. It was predicted, however, that the in- 
troduction of positive incentives would wipe 
out both reinforcement-produced and sex- 
linked performance differences, revealing an 
equivalent amount of learning among children 
in the three treatment conditions. 


METHOD 
Subjects 


The subjects were 33 boys and 33 girls enrolled in 
the Stanford University Nursery School. They ranged 
in age from 42 to 71 months, with a mean age of 
51 months. The children were assigned randomly to 
one of three treatment conditions of 11 boys and 11 
girls each. 

Two adult males served in the role of models, and 
one female experimenter conducted the study for all 
66 children. 


Exposure Procedure 


The children were brought individually to a semi- 
darkened room. The experimenter informed the child 
that she had some business to attend to before they 
could proceed to the “surprise playroom,” but that 
during the waiting period the child might watch a 
televised program. After the child was seated, the 
experimenter walked over to the television console, 
ostensibly tuned in a program and then departed. A 
film of approximately 5 minutes duration depicting 
the modeled responses was shown on a glass lenscreen 
in the television console by means of a rear projec- 
tion arrangement, screened from the child’s view by’ 
large panels. The televised form of presentation was 
utilized primarily because attending responses to 
televised stimuli are strongly conditioned in children 


and this procedure would therefore serve to enhance 


observation which is a necessary condition for the 
occurrence of imitative learning. 

The film began with a scene in which the model 
walked up to an adult-size plastic Bobo doll and 
ordered him to clear the way. After glaring for a 
moment at the noncompliant antagonist the model 
exhibited four novel aggressive responses each ac- 
companied by a distinctive verbalization. 

First, the model laid the Bobo doll on its side, sat 
on it, and punched it in the nose while remarking, 


“Pow, right in the nose, boom, boom.” The model, 


then raised the doll and pommeled it on the head 
with a mallet. Each response was accompanied by 


the verbalization, “Sockeroo . . . stay down.” Fol- 
Jowing the mallet aggression, the model kicked the 
“doll about the room, and these responses were inter- 
spersed with the comment, “Fly away.” Finally, the 
model threw rubber balls at the Bobo doll, each 
strike punctuated with “Bang.” This sequence of 
‘physically and verbally aggressive behavior was re- 
“peated twice. 

The component responses that enter into the de- 
velopment of more complex novel patterns of be- 
havior are usually present in children’s behavioral 
Tepertoires as products either of maturation or of 
‘prior social learning. Thus, while most of the ele- 

“ments in the modeled acts had undoubtedly been 
previously learned, the particular pattern of com- 
ponents in each response, and their evocation by spe- 

“cific stimulus objects, were relatively unique. For 
example, children can manipulate objects, sit on 
them, punch them, and they can make vocal re- 
“sponses, but the likelihood that a given child would 
spontaneously place a Bobo doll on its side, sit on it, 
“punch it in the nose and remark, “Pow . . . boom, 
boom,” is exceedingly remote, Indeed, a previous 
"study utilizing the same stimulus objects has shown 
that the imitative responses selected for the present 
“experiment have virtually a zero probability of oc- 
“curring spontaneously among preschool children 
“(Bandura et al, 1961) and, therefore, meet the cri- 
terion of novel responses. 

"The rewarding and punishing contingencies associ- 
ated with the model’s aggressive responses were in- 
troduced in the closing scene of the film. 

For children in the model-rewarded condition, a 
Second adult appeared with an abundant supply of 
_ candies and soft drinks. He informed the model that 
Te was a “strong champion” and that his superb ag- 
gressive performance clearly deserved a generous 
treat. He then poured him a large glass of 7-Up, and 
“Teadily supplied additional energy-building nourish- 
ment including chocolate bars, Cracker Jack pop- 
"Corn, and an assortment of candies. While the model 
Was rapidly consuming the delectable treats, his ad- 
miter symbolically reinstated the modeled aggressive 
tesponses and engaged in considerable positive social 

Teinforcement. 

a For children in the model-punished condition, the 

‘reinforcing agent appeared on the scene shaking his 

| finger menacingly and commenting reprovingly, “Hey 
‘there, you big bully. You quit picking on that clown. 

+ I won't tolerate it.” As the model drew back he 

tripped and fell, the other adult sat on the model 

“and spanked him with a rolled-up magazine while 

| řeminding him of his aggressive behavior. As the 
model ran off cowering, the agent forewarned him, 

“TET catch you doing that again, you big bully, Til 

‘ive you a hard spanking. You quit acting that way.” 

Children in the no-consequences condition viewed 

_ the same film as shown to the other two groups 
except that no reinforcement ending was included. 


| Performance Measure 


Immediately following the exposure session the 
ldren were escorted to an experimental room that 
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contained a Bobo doll, three balls, a mallet and peg- 
board, dart guns, cars, plastic farm animals, and a 
doll house equipped with furniture and a doll family, 
By providing a variety of stimulus objects the chil- 
dren werg at liberty to exhibit imitative responses or 
to engage in nonimitative forms of behavior. 

After the experimenter instructed the child that he 
was free to play with the toys in the room, she 
excused herself supposedly to fetch additional play 
materials. Since many preschool children are reluc- 
tant to remain alone and tend to leave after a short 
period of time, the experimenter reentered the room 
midtvay through the session and reassured the child 
that she would return shortly with the goods. 

Each child spent 10 minutes in the test room 
during which time his behavior was recorded every 
5 seconds in terms of predetermined imitative re- 
sponse categories by judges who observed the session 
through a one-way mirror in an adjoining obsurva- 
tion room. a 

Two observers shared the task of recording the 
occurrence of matching responses for all 66 children, 
Neither of the raters had knowledge of the treat- 
ment conditions to which the children were as- 
signed. In order to provide an estimate of interscorer 
reliability, the responses of 10 children were scored 
independently by both observers. Since the imitative 
responses were highly distinctive and required no 
subjective interpretation, the raters were virtually in 
perfect agreement (99%) in scoring the matching 
responses. 

The number of different physical and verbal imi- 
tative responses emitted spontaneously by the chil- 
dren constituted the performance measure. 


Acquisition Index 

. At the end of the performance session the experi- 
menter entered the room with an assortment of fruit 
juices in a colorful juice-dispensing fountain, and 
boolgets of sticker-pictures that were employed as 
the positive incentives to activate into performance 
what the children had learned through observation. 

After a brief juice treat the children were infotmed, 
that for each physical or verbal imitative response 
that they reproduced, they would receive a pretty 
sticker-picture and additional juice treats. An achieve- 
ment incentive was also introduced in order to pro- 
duce further disinhibition and to increase the chil- 
dren’s motivation to exhibit matching responses, The 
experimenter attached a pastoral scene to the wall 
and expressed an interest in seeing how many sticker- 
pictures the child would be able to obtain to adorn 
his picture. 

The experimenter then asked the child, “Show me 
what Rocky did in the TV program,” “Tell me what 
he said,” and rewarded him immediately following 
each matching response. If a child simply described 
an imitative response he was asked to give a per- 
formance demonstration. 

Although learning must be inferred from perform- 
ance, it was assumed that the number of different 
physical and verbal imitative responses reproduced 
by the children under the positive-incentive condi- 
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NO CONSEQUENCES 


1c. 1, Mean number of different matching re- 
sponses reproduced by children as a function of posi- 
tive incentives and the model’s reinforcement con- 
tingencies, 


tions would serve as a relatively accurate index of 
learning, 


RESULTS 


Figure 1 shows the mean number of dif- 
ferent matching responses reproduced by chil- 
dren in each of the three treatment conditions 
during the no-incentive and the positive-in- 
centive phases of the experiment. A square- 
root transformation (y = Vf+4) was ap- 
plied to these data to make them amenable 
to parametric statistical analyses. 

‘ a 

Performance Differences i 
A’summary of the analysis of variance based 
on the performance scores is presented in Ta- 
ble 1. The findings reveal that reinforcing con- 
sequences to the model had a significant effect 
on the number of matching responses that the 
children spontaneously reproduced. The main 


TABLE 1 


ANALYSIS OF VARIANCE OF IMITATIVE 
PERFORMANCE SCORES 


Source df MS F 
Treatments (T) 2 1.21 3.27* 
Sex (S) 1 4.87 13.16** 
TXS 2 12 <1 
Within groups 60 an 

* p <.05 
#95 S00 
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effect of sex is also highly significant, confirm- 
ing the prediction that boys would perform 
more imitative responses than girls. 

Further comparisons of pairs of means by ¢ 
tests (Table 2) show that while the model- 
rewarded and the no-consequences groups did 
not differ from each other, subjects in both of 
these conditions performed significantly more 
matching responses than children who had ob- 
served the model experience punishing conse- 
quences following the display of aggression. It 
is evident, however, from the differences re- 
ported separately for boys and girls in Table 
2, that the significant effect of the model’s 
reinforcement contingencies is based predomi- 


TABLE 2 


COMPARISON OF PAIRS OF MEANS BETWEEN 
‘TREATMENT CONDITIONS 


Treatment conditions 


Performance S 
Reward Reward Punishment 
mrapa. verib versus no versus no 
punishment | consequences conaednencsi 
t t 
Total sample| 2.20** 0.55 225A 
Boys 1.05 0.19 1.24 
Girls 7 ln 0.12 2.02* 
* > <.05. 
** p <,025. 


nantly on differences among the girls’ sub- 
groups.® 


Differences in Acquisition 


An analysis of variance of the imitative 
learning scores is summarized in Table 3. The 
introduction of positive incentives completely 
wiped out the previously observed perform- 
ance differences, revealing an equivalent 
amount of imitative learning among the chil- / 
dren in the model-rewarded, model-punished, ‘ 
and the no-consequences treatment groups. 
Although the initially large sex difference was 
substantially reduced in the positive-incentive 
condition, the girls nevertheless still displayed 
fewer matching responses than the boys. 


* Because of the skewness of the distribution of 
scores for the subgroup of girls in the model-pun- 
ished condition, differences involving this group 
were also evaluated by means of the Mann-Whitney 
U test. The nonparametric analyses yield probability 
values that are identical to those reported in Table 2. 
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Acquisition-Performance Differences 


. Tn order to elucidate further the influence of 
direct and vicariously experienced reinforce- 
ment on imitation, the differences in match- 
ing responses displayed under nonreward and 
positive-incentive conditions for each of the 
three experimental treatments were evaluated 
by the ¢-test procedure for correlated means. 
Table 4 shows that boys who witnessed the 
model either rewarded or left without conse- 
quences performed all of the imitative re- 
sponses that they had learned through obser- 
vation and no new matching responses 
emerged when positive reinforcers were made 
available. On the other hand, boys who had 
observed the model punished and girls in all 
three treatment conditions showed significant 
increments in imitative behavior when re- 
sponse-contingent reinforcement was later in- 
troduced. 


Discussion 


The results of the present experiment lend 
support to a contiguity theory of imitative 
learning; reinforcements administered to the 
model influenced the observers’ performance 
but not the acquisition of matching responses. 

It is evident from the findings, however, 
that mere exposure to modeling stimuli does 


not provide the sufficient conditions for imi- , 


tative or observational learning. The fact that 
most of the children in the experiment failed 
_ to reproduce the entire repertoire of behavior 
exhibited by the model, even under positive- 
incentive conditions designed to disinhibit and 
to elicit matching responses, indicates that 
o factors other than mere contiguity of sensory 


` stimulation undoubtedly influence imitative 


response acquisition. 
Exposing a person to a complex sequence of 
stimulation is no guarantee that he will attend 


TABLE 3 
ANALYSIS OF VARIANCE OF IMITATIVE LEARNING SCORES 


Source df MS F 
Treatments (T) 2 0.02 <1 
Sex (5) 1 0.56 6.22* 
TXS 2 0.02 <1 
Within groups 60 0.09 
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TABLE 4 


SIGNIFICANCE OF THE ACQUISITION-PERFORMANCE 
DIFFERENCES IN ImirativE RESPONSES 


. Treatment conditions 
Group No 
severe, ie ea consequences 
Totalsample | 2.38* 5,00*** 2.67** 

Boys 0.74 2.26* 1.54 
Girls $.33* 5.65*** 2.18* 

* p <.025, 

=p Son. 
* p <.001 


to the entire range of cues, that he will neces- 
sarily select frome a total stimulus Papier 
only the most relevant stimuli, or that he will 
even perceive accurately the cues to ‘which his 
attention is directed. Motivational variables, 
prior training in discriminative observation, 
and the anticipation of positive or negative 
reinforcements contingent on the emission of 
matching responses may be highly influential 
in channeling, augmenting, or reducing ob- 
serving responses, which is a necessary precon- 
dition for imitative learning (Bandura, 1962; 
Bandura & Walters, 1963). Procedures that 
increase the distinctiveness of the relevant 
modeling stimuli also greatly facilitate obser- 
ational learning (Sheffield & Maccoby, 1961). 

In addition to attention-directing variables, 
the rate, amount, and complexity of stimuli 
presented to the observer may partly deter- 
mihe the degree of imitative learning. The 
acquisition of matching responses through ob- 
servation of a lengthy uninterrupted sequence 
of behavior is also likely to be governed by 
principles of associate learning such as fre- 
quency and recency, serial order effects, and 
other multiple sources of associative inter- 
ference (McGuire, 1961). 

Social responses are generally composed of 
a large number of different behavioral units 
combined in a particular manner. Responses 
of higher-order complexity are produced by 
combinations of previously learned com- 
ponents which may, in themselves, represent 
relatively complicated behavioral patterns. 
Consequently, the rate of acquisition of in- 
tricate matching responses through observa- 
“tion will be largely determined by the extent 
to which the necessary components are con- 
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tained in the observer’s repertoire. A person 
who possesses a very narrow repertoire of be- 
havior, for example, will, in all probability, 
display only fragmentary imitation of a mod- 
el’s behavior; on the other hand, a persòn who 
has acquired most of the relevant components 
is likely to perform precisely matching re- 
sponses following several demonstrations. In 
the case of young preschool children their 
motor repertoires are more highly developed 
than their repertoires of verbal responses. It 
is, perhaps, for this reason that even in the 
positive-incentive condition, children repro- 
duced a substantially higher percentage 
(67%) of imitative motor responses than 
matching verbalizations (20%). A similar pat- 
tern of differential imitation was obtained in 
a previoùs experiment (Bandura & Huston, 
1961) in which preschool children served as 
subjects, 

It is apparent from the foregoing discussion 
that considerably more research is needed in 
identifying variables that combine with con- 
tiguous stimulation in governing the process of 
imitative response acquisition. 

It is possible, of course, to interpret the 
present acquisition data as reflecting the op- 
eration of generalization from a prior history 
of reinforcement of imitative behavior. Within, 
any social group, models typically exhibit the 
accumulated cultural repertoires that have. 
proved most successful for given stimulus sit- 
uations; consequently, matching the behayior 
of other persons, particularly the superiors in 
an age-grade or prestige hierarchy, will maxi- 
mize positive reinforcement and minimize 
the frequency of aversive response conse- 
quences. Since both the occurrence and the 
positive reinforcement of matching responses, 
whether by accident or by intent, are inevita- 
ble during the course of social development, 
no definitive resolution of the reinforcement 
issue is possible, except through an experiment 
utilizing organisms that have experienced 
complete social isolation from birth. It is evi- 
dent, however, that contemporaneous rein- 
forcements are unnecessary for the acquisition 
of new matching responses, 

The finding that boys Perform more imita- 
tive aggression than girls as a result of ex- 
posure to an aggressive male model, is in ac- 
cord with results from related experiments 
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(Bandura et al., 1961, 1963a). The additional 
finding, however, that the introduction of posi- 
tive incentives practically wiped out the prior 
performance disparity strongly suggests that 
the frequently observed sex differences in 
aggression (Goodenough, 1931; Johnson, 
1951; Sears, 1951) may reflect primarily dif- 
ferences in willingness to exhibit aggressive 
responses, rather than deficits in learning or 
“masculine-role identification.” 

The subgroups of children who displayed 
significant increments in imitative behavior 
as a function of positive reinforcement were 
boys who had observed the aggressive model 
punished, and girls for whom physically ag- 
gressive behavior is typically labeled sex inap- 
propriate and nonrewarded or even negatively 
reinforced. The inhibitory effects of differing 
reinforcement histories for aggression were 
clearly reflected in the observation that boys 
were more easily disinhibited than girls in the 
reward phase of the experiment. This factor 
may account for the small sex difference that 
was obtained even in the positive-incentive 
condition. 

The present study provides further evidence 
that response inhibition and response disinhi- 
bition can be vicariously transmitted through 
observation of reinforcing consequences to a 
model’s behavior. It is interesting to note, 
however, that the performance by a model of 
socially disapproved or prohibited responses 
(for example, kicking, striking with objects) 
without the occurrence of any aversive conse- 
quences may produce disinhibitory effects 
analogous to a positive reinforcement opera- 
tion. These findings are similar to results from 


studies of direct reinforcement (Crandall,’ 


Good, & Crandall, 1964) in which nonreward 
functioned as a positive reinforcer to increase 


the probability of the occurrence of formerly 


punished responses. 

Punishment administered to the model ap- 
parently further reinforced the girls’ existing 
inhibitions over aggression and produced re- 
markably little imitative behavior; the boys 
displayed a similar, though not significant, 
decrease in imitation. This difference may be 
partly a function of the relative dominance of 


aggressive responses in the repertoires of boys | 
and girls. It is also Possible that vicarious re- ' 


inforcement for boys, deriving from the mod- 


| 


l 


ACQUISITION OF IMITATIVE RESPONSES 


el’s successful execution of aggressive behavior 
(that is, overpowering the noncompliant ad- 
versary), may have reduced the effects of 
externally administered terminal punishment. 
These factors, as well as the model’s self-re- 
warding and self-punishing reactions follow- 
ing the display of aggression, will be investi- 
gated in a subsequent experiment. 
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DYADS WITH HIGH AND LOW INTERPERSONAL 
COMPARABILITY * 


LEONARD HABER? ann MARVIN A. IVERSON 
Adelphi University 


College students constituted 56 dyads, + being high and 4 low in interpersonal 
comparability. Each dyad composed a joint letter for a recipient with either 
subordinate or superior status. Certain kinds of verbal usage, particularly 
negative references to role and institution and verbalization designed to be 
self-protective, were more evident in upward than downward communication. 
The effects of vertical interpersonal distance were in most instances further 
influenced by the degree of horizontal distance (high and low comparability) 
present in communicating groups. Although some outcomes were puzzling, 
interpersonal disaffiliation through verbal usage was generally most noticeable 
for dyads with ‘low comparability engaged in upward communication. The 
selective flow of communication was interpreted as maintenance of status 


differences, 


Earlier reported findings in upward and 
downward communication in a status system 
have been largely confined to settings wherein 
the within-group, interpersonal comparability 
of members was held constant or ignored. It 
was the purpose of the present study to de- 
termine the effects of this within-group com- 
parability dimension on the vertical communi- 
cation in a hierarchical relationship. 


Of the earlier work, Kelley’s (1951) ob-, 


servations have been especially definitive in 
clarifying relationships among variables perti- 
nent to vertical communication. In terms of 
his results, members of high-status groups had 
noticeable latitude in offering personal enti 
cisms of members of cooperating low-status 
groups but were less inclined to express nega- 
tive attitudes about themselves and their own 
positions. Although not expressly measured, 
Kelley’s findings also suggested some tend- 
ency among high-status individuals to reserve 
role-relevant comments for members of the 
same high-status level. Low-status groups in 
comparison tended to inhibit personal criti- 
cisms of their superiors and to communicate 
upward their negative views on their subordi- 
nate role assignment. Some of these effects 


1 This report contains a portion of the data pre- 
sented in an unpublished doctoral dissertation by 
the first author. 

2 Now at Hunter College of the City University of 
New York. i N 


were further dependent on whether status was 
mobile or nonmobile. 

Although Kelley’s work was primarily ori- 
ented toward examining restraining forces and 
substitutive locomotion in communication, his 
findings could also be viewed as evidence that 
subjects responded in either an ego-protective 
or ego-enhancing manner. Such interpretation 
has been made by Cohen (1958) and sup- 
ported in an experiment wherein self-protec- 
tive needs were expressly aroused. In this 
sense, high ranking persons could be described 
as behaving ego protectively by inhibiting 
comments about their own role in downward 
communication. In so doing, they would pre- 
serve their social distance from low-status 
groups. By confronting subordinates with per- 
sonal criticisms, superiors would emphasize 
the privilege of higher rank and supposedly 
derive some degree of ego enhancement. Ego 
protection or enhancement in the case of low- 
status persons seemed to take the form of 
communicators striving to provide the defer- 
ential treatment expected from them in re- 
lating to superiors. At the same time, sub- 
ordinate communicators were trying through 
disparagement of their role to dissociate them- 
selves from their assigned inferior group 
status. In both instances, the very behaviors 
elicited by the assigned status served to em- 
phasize the vertical distance between groups 
and also to afford individuals greater security 
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in their statuses. These so-called maintenance 
effects taken in combination may be said 
to have resulted from status-maintaining be- 
havior. 

With this formulation in view, a major ob- 
jective in the present study was to determine 
the degree to which high or low interpersonal 
comparability within a dyad of given status 
affected its vertical communication. Compa- 
rable dyads were defined as partners who note 
similarities with, identify, move toward, or 
have trust in each other whereas noncom- 
parable partners were described by an op- 
posite tendency of moving away, noting dif- 
ferences, distrusting each other, or striving for 
separate identity.* The underlying dimension 
was regarded as one of horizontal social dis- 
tance to be contrasted in its effects on selec- 
tive flow of communication with the vertical 
distance dimension of varied status. 

Presumably high comparability among 
group members is a condition of positive va- 
lence and one that affords personal security. 
Members of such a dyad experience relatively 
little need to be ego defensive. As a conse- 
quence, their tendencies to emphasize status 
differences in communication, whether up- 
ward or downward, should also be reduced. 
Conversely, low comparability in a dyad is a 
condition of negative valence and would in- 
tensify ego-protective needs and therefore ac- 
centuate tendencies to maintain vertical dis- 
tance—the specific mode for doing so depend- 
ing on the direction of communication. The 
latter hypothesis became more plausible by 
‘taking into account the additional fact that 
individuals in low or noncomparable (as op- 
posed to high comparable) groups derive less 
social support from their peers and therefore 
would conceivably become more sensitized to 
the reward/cost implications of differential 
status (see Stogdill, 1959, pp- 150-195, for a 
review of relevant literature). 

The hypothesis regarding the effect of com- 
parability in dyads on their vertical communi- 
cation was necessarily exploratory. To Te- 
fer again to previous experimentation, Kelley 
(1951) and particularly Cohen (1958) have 
related the effects of mobile-nonmobile status 


3A more detailed discussion of interpersonal com- 
parability as a dyadic variable is contained in Iver- 
son (1962). 
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on vertical communication to the security 
which group members felt in high- or low- 
ranking positions. Therefore, in the sense that 
status security generates personal security of 
the same kind found in comparable groups, 
one would suppose results of the present 
study to be consistent with those already 
established. Differences in results, however, 
were also tenable in that personal security 
which arises from an ingroup relationship 
(comparability) could have less relevance for 
vertical communication than security which 
is more directly linked to an outgroup re- 
lationship (mobile-nonmobile status). Thus 
high and low comparability would kave no 
consequential efféct. 

The contention that ego-relevant behaviors 
function as status-maintaining behaviors led 
directly to some further specific hypotheses 
on the effects of differential status as well as 
content of communication, For example, sub- 
ordinate individuals who are necessarily self- 
protective (or required to be socially distant) 
in the presence of higher status persons would 
reasonably be more restricted in overall ver- 
bal output, use more formal styles of ad- 
dress, employ more intellectualized, less col- 
loquial modes of expression, and would resort 
to more objective-type statements in com- 
municating upward. Their style of discourse 
sheuld be generally impersonal. Brown and 
Ford (1961) have noted the restraints in call- 
ing superiors by first name or in using other 
forms of social address which imply intimacy 
in relationship. This particular defensiveness 
in language usage by subordinates in upward 
communication becomes even more under- 
standable if one takes into account the greater 
control over rewards and punishments which 
superiors are perceived to have in interact- 
ing with low-ranking persons (Cohen, 1958; 
Heider, 1958; Thibaut & Kelley, 1959). 

Although low-status individuals can be ex- 
pected to refrain from verbal output which 
connotes self-reference and intimacy, they are 
conceivably more inclined to make evaluative 
observations which are specific to their in- 
ferior hierarchical position. By devaluing their 
particular role, they dissociate themselves 
from having low status and indirectly en- 
hance their self-esteem. Thus subordinates 
should be willing to verbalize negative atti- 
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tudes toward role-relevant matters such as 
the atmosphere at the institution which pre- 
scribes their low-status ranking—more than 
individuals holding higher rank. Indeed, for 
high-status persons to derogate the institu- 
tion which accords them their superiority 
would only serve to downgrade their present 
status in the perceptions of others. 

When translated into operational terms, 
these hypotheses led to the predictions that 
college students would prepare shorter de- 
scriptions of the college when corresponding 
with a high-status official than with a lower 
ranking student. Also, students’ letters to a 
superio? official (as opposed to a subordinate 
high-school student) were expected to contain 
fewer value-oriented statements, words of less 
frequent usage, and, within the limits of such 
restrictions, more negative references to their 
own institution. These differences were pre- 
dicted to be greater in the case of communi- 
cating dyads with low as opposed to high 
comparability. 


METHOD 
Subjects and Experimental Design 


Students from introductory psychology courses 
were assembled to comprise 56 dyads. Half of these 
groups consisted of men partners, and in the remain- 
ing half, women students were paired. Assignment to 
dyads was based on nonacquaintance of members 
and availability for participation at the same hour. 

A 2X 2X2 factorial design was employed to stu 
determinants of the content of written communica- 
tions. Two levels of interpersonal comparability in 
dyads, high and low, were defined. A second levels- 
type variable comprised subjects of superior rank 
communicating with a recipient of inferior rank (re- 
ferred to as a condition of “high status” hereafter) 
and subjects of inferior rank corresponding with a 
higher ranking official (condition of “low status”). 
The third main factor was the sex of the group 
members, 


Procedure 


As a classroom routine, subjects were administered 
two questionnaires on interpersonal likes and dis- 
likes. To orient them to the comparability variable 
in the study, the investigator explained to the sub- 
jects that test scores would be used in the attempt to 
locate congenial partners for Participation in a later 
group procedure, No actual use, however, was made 
of questionnaire data in scheduling individuals, 

The experimental session was conducted in a one- 
way viewing room with each dyad taken one at a 
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time. The experimenter began by informing partners 
of their so-called high or low interpersonal com- 
parability (randomly assigned). For high compara- 
bility, the experimenter commented that on the basis 
of previous personality testing (referring to the class- 
room procedure), subjects were “remarkably similar 
in their general likes, dislikes, and dispositions and 
should, therefore, find each other very agreeable and 
pleasant as partners.” An opposite orientation was 
provided for low comparability. The experimenter 
warned the subjects that they could “expect to ex- 
perience disagreement and some degree of uncon- 
geniality” and explained that he was unable to sched- 
ule more suitable partners at the given hour. 

As an initial task, subjects were told that they 
would have 5 minutes to agree on a topic and to rep- 
resent it in a rough chalk drawing. The purpose was 
to provide subjects with an opportunity to interact 
and to experience the level of comparability to which 
the experimenter had oriented them. During this ex- 
ercise, the experimenter left the room, He returned 
immediately afterwards to administer a self-anchored 
rating scale and a scaled questionnaire. Items were 
designed to verify the fact that the subjects had ac- 
cepted the experimenter’s comparability orientation. 

The major experimental task required subjects to 
prepare a joint letter wherein they described the at- 
mosphere of the college to someone outside the insti- 
tution. Subjects were assigned high status (at ran- 
dom) by having them address the letter to a male 
high-school senior in an upstate city. Under the con- 
dition of low status, they prepared their letter for a 
professor in the state department of education and 
again located in the same upstate city. In order to 
make relevant the high power implications of the 
professor, the experimenter remarked that he would 
be joining the college faculty in the near future and 


»would also hold the position of a new director of 


student personnel. Likewise to focus the low power 
value of the high-school senior, the experimenter 
mentioned that the senior was scheduled to enter the 
college in the coming year as a freshman. In order 


to add to the plausibility of the task, the experi- 


menter noted that the subjects’ letters would consti- 
tute replies to specific personal requests which these 
individuals had made and that letters would be sent 
as written. Names, positions, and addresses were 
actually fictitious. 

While the subjects prepared the letter, the experi- 
menter observed from the other side of the one-way 
window to ascertain that subjects understood the 
task and were involved in its prescribed purpose. 
After 15 minutes he returned, collected the letter by 
placing ‘it in an appropriately addressed envelope, 
and again administered the sociometric scales for de- 
termining the subjects’ comparability. As a final step, 
subjects recorded by means of another questionnaire 
their judgment of the relative power value of the 
recipient of their letter. 

To terminate the experimental session, the experi- 
menter explained to the subjects the variables of the 
study and cautioned them to refrain from discussing 
the experiment with other students. 
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Dependent Measures 


The following measures were applied to the sub- 
jects’ letters: 

1. The total number of words (verbal produc- 
tivity) contained in the written communications of 


each dyad. 

2. The percentage of total number of words in 
each letter having a frequency of occurrence of less 
than 100 times per million according to the Thorn- 
dike-Lorge (1944) word list. Scores are referred to 
as “percentage low” and were intended to indicate 
degree of usage of intellectualized language. 

3. The total number of value-oriented statements 

(value productivity) issued by each dyad. Value 
statements were defined by the frequency of message 
units with directional quality, either positive or nega- 
tive, and were meant to provide an index of subjec- 
tive expression. Reliability of scoring was indicated 
by 99% agreement with an independent rater. 
“4. The percentage of negative references relating to 
the college (percentage negative) provided by each 
letter. Scoring was again checked by an independent 
rater and yielded 99% agreement. 


RESULTS 


All measures of verbal usage and all socio- 
metric data obtained at the end of the experi- 
mental procedure were subjected to a 2 X 2 
X 2 analysis of variance. F ratios were deter- 
mined for all main and interaction effects. 
Initial ratings of interpersonal comparability 
were analyzed in a 2 X 2 design inasmuch as 
status differentiation was introduced later in 
the procedure. 


Sociometric Results and Control Data 


Ratings of the subject’s partner for com- 
parability on the self-anchored and scaled 
questionnaires were combined in a single total 
Score for each administration. Total mean rat- 
ings in both the initial and final administra- 
tions were reliably different and corresponded 
to the orientation of high and low compara- 
bility as provided by the experimenter. With 
the level of the experimenter’s orientation as 
a main effect, the F ratio for initial ratings 
was 7.40 (p< .01) and 6.64 (p< .025) for 
final ratings. 

f Also present was a reliable difference in ini- 
tial ratings associated with sex of dyadic mem- 
bers (p < .01). Women seemed to accept the 
comparability orientation more readily than 
did men subjects. This differentiation, how- 
ever, diminished by the end of the experi- 


» mental procedure (p < .10). All other main 


effects and the evidence for statistical inter- 
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actions in ratings of comparability were at- 
tributed to chance (p > .20). Thus partners 
defined by the experimenter as high in com- 
parability expressed more positive reactions 
toward each other throughout the course of 
the experimental procedure than did dyads 
defined as having low comparability. 

The fact that subjects accepted the differ- 
entiated status of the recipients of their let- 
ters as intended by the experimenter was 
ako verified by sociometric data. The profes- 
sor was rated as relatively important, seri- 
ously intent on his work, and having an effect 
on individual subjects whereas the high-school 
senior was assigned lower ratings (mean dif- 
ference significant at the .001 level). These 
findings in addition to the experimenter’s ob- 
servations during the letter-writing: procedure 
indicated that the subjects regarded the task 
as meaningful and important. 

Because language behavior is related to ver- 

` pal ability, subjects in the various experi- 
mental contingencies were compared in terms 
of their verbal scores on the College Entrance 
Examination Boards. All mean differences 
were found to be statistically nonsignificant 
(ps > .10), indicating that experimental 
groupings were equivalent. 


«Main Effects of the Subjects’ Status on Verbal 
Usage 
Predictions regarding differences in written 
communications by high- and low-status sub- 
je€ts were verified on all four dependent meas- 
ures, As shown by a comparison of means in 
Table 1, students wrote shorter letters, pre- 
ferred wording of low-frequency usage, re- 
stricted their overall number of value-ori- 
ented statements, but increased their percent- 
age of negative institutional references when 
communicating with a superior professor as 
opposed to a subordinate high-school senior 
(all mean differences were reliable at the .025 
level). Thus hypotheses relating to status 
maintenance in language usage were sup- 
ported. 


Main Effects of Interpersonal Comparability 
on Verbal Usage 

Although no specific hypotheses were for- 
mulated, the main effect of the variable of 
comparability was evident with respect to 
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TABLE 1 
Comparisons OF MEASURES oF VERBAL UsaGE By Dyaps or HicH anp Low Status 
Low-status dyads High-status dyads 
O22) (taa) F ratio of 
Measure of verbal usage s Matt. 
M SD My SD 
Total verbal productivity 113.1 22.6 136.6 31.1 11.30** 
Percentage of low-frequency words 22.8 4.2 18.7 3.5 13.83** 
Value productivity 8.6 24 11.5 3.5 11.20** 
Percentage negative references 34.3 25.3 17.9 17.8 6.77* 
*p <.025, ni 
** p <01. 


measures of value productivity ( < .01) and 
possibly total verbal productivity ( < .10). 
Mean scores were higher in both instances for 
dyads with high as compared with low com- 
parability. Mean differences for percentage of 
low-frequency word usage and percentage of 
negative institutional references were statisti- 
cally nonsignificant (p > .20). Relevant sta- 
tistics are given in Table 2. 


Interrelated Effects of the Subjects’ Status 
and Comparability 


Significant interactions were evident in 
three of the four dependent measures of 
verbal usage. However, only one variable, 
namely, percentage of low-frequency word 
usage for male subjects, clearly conformed to 
hypothesis (see Figure 1). Dyads of low in- 
ternal comparability and high status were 
more prone to use familiar-type words in ad- 
dressing a high-school senior than were low 
comparable partners of relatively low status 
whose communication was directed toward a 
college professor (mean difference, p < .05). 
As shown in Figure 1, dyads with high com- 
parability failed to discriminate the recipients 


of their letters in terms of the familiarity de- 
noted by particular word preference. Scores 
indicating percentage of low-frequency word 
usage for high- and low-status dyads with 
high comparability were similar and both 
mean values were intermediate to those ob- 
tained for high- and low-status dyads but 
having low internal comparability. The inter- 
related effect of subjects’ status and com- 
parability on “percentage low” for women 
students was statistically nonsignificant (p 
> .20). 

An interaction was obtained in measures of 
value productivity (p< .05) and was pos- 
sibly present in total verbal productivity 
(p< .10). The interrelated effects of the 
major variables were very similar for the two 
productivity measures and consequently only 
the one interrelationship for value-oriented 
statements is represented in Figure 2. How- 
ever, in both instances, the form of the rela- 


tionship differed somewhat from that obtained ' 


for percentage of low-frequency words and 
also differed from prediction. Because of par- 
ticular scoring methods, the curve for high- 
status dyads in Figure 2 was expected to 


TABLE 2 
Comparisons or MEASURES or VERBAL USAGE BY Dyavs with Hick AND Low COMPARABILITY 


Low comparabilit; 
N =28) 


High comparability 
(N =28) 


Measure of verbal usage a mag a 
diff. 
M SD M SD 
Total verbal productivity 116.0 29.6 a 
Percentage of low frequency words 20.9 3.3 me oe toot 
Value productivity 84 3.2 11.7 28 14:86 
Percentage negative references 22.3 20.7 29.9 23.0 1.45* 


*p >.20, 
++ > <.10, 
"p <01. 


Sratus MAINTENANCE IN COMMUNICATIONS 


6 
} 3 J LOW STATUS $s 
PER CENT 24 
-Low 23 
FREQUENCY22 
worps 2! 
20 

19 

18 
17| 

Tow ran 


COMPARABILITY 


Fic. 1. Comparison of effects of comparability and 
status of male dyads on use of unfamiliar words. 


t 
HIGH STATUS Ss 


slope in an opposite rather than the same 
_ direction as the curve for high-status subjects 
in Figure 1. Instead of being relatively im- 
mune to the implications of addressing out- 
group members with higher or lower status, 
dyads with internal comparability seemed to 
be more sensitized than dyads with low com- 
parability. This finding was most evident 
when subjects were higher status and com- 
municating downward. High-status dyads 
with high in comparison with low compara- 
bility wrote longer letters and issued more 
value-oriented statements. In the case of low- 
status dyads, level of comparability did mot 
appear to mediate these particular effects on 
word usage; results indicated a relatively 
greater degree of verbal constriction in 
general. Although the curve for low status 
‘in Figure 2 was in the expected direction, 
| the mean difference between high and low 
| comparability was nonsignificant. 
The fourth dependent measure of percent- 
: age of negative institutional references re- 
» vealed a nonsignificant interaction (p < 20), 
indicating that status effects were independent 
of the comparability conditions. 


Because of the discrepancy in findings on 
the several indices of verbal usage, coefficients 
of correlation (7’s) were computed between 
the four measures. The fourth dependent vari- 
able of percentage of negative references was 

found to be independent of the other three 
` (intercorrelations did not differ significantly 
| from zero, p > .05) and consequently could 


Interrelationship of Dependent Measures 
a 
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be said to represent a separate behavioral 
effect. Total verbal productivity and scores 
on value productivity were positively related 
(r = .60, p < .01). This finding corresponded 
to the similarity of interaction effects obtained 
with the two measures. Percentage of low- 
frequency words showed negative correlation 
with verbal productivity scores (r= —.44, 
p<.01). The presence of an inverse rela- 
tionship was consistent with the definition of 
decrements in verbal output and increments 
in percentage of low-frequency words as com- 
prising measures of inhibition of self-exposure. 
Inverse relationship between measures was 
also consistent with a negative correlation 
between the mean’ contained in Figure 1 and 
Figure 2 (rho = —.80). 


Effects of the Sex Variable 


As is commonly found, women tended to 
write longer letters than men (p < .05), al- 
though the two groups were similar in verbal 
ability. Main effects with respect to other 
dependent measures, however, tended to be 
unreliable (p’s >.05). Also none of the 
double interactions between sex and subjects’ 
status or sex and comparability was sta- 
tistically significant (p’s > .20). Only a 
significant triple interaction, as already cited, 
occurred with respect to percentage of low- 

* frequency word usage (p< .05). 
E DISCUSSION 
Status-Maintaining Communication 


The results, although limited in scope, bear 
on the broader issues of instrumentally biased 
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Fic. 2. Comparison of effects of comparability and 
status of dyads on value productivity. 
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flow of communication and role maintenance 
within formal status hierarchies. The stability 
of role relationships is in large part dependent 
on the process of status maintenance, Such 
process involves status-maintaining behavior 
which is designed to bulwark the security of 
an occupant of a social position in the power 
benefits and self-protections afforded by the 
position. To provide status security, status 
maintenance operations must comprise spe- 
cialized behaviors which conform to the ex- 
pectancies shared by other persons located in 
the social system.* Status-maintaining behav- 
ior must also afford position-occupants with 
acceptable self-images.° Thus it becomes 
understandable that status-maintaining com- 
munication should be edited and that the 
editing should reflect differences in role- 
centered and self-centered concerns at various 
levels of a status hierarchy. 


Mediating Effects of Comparability of Dyads 


The fact that interpersonal comparability 
in dyads mediates the effects of subjects’ 
status on communication when directed to 
members of other groups was evident in the 
several interaction effects. However, these 
interrelated effects were somewhat puzzling in 
their inconsistency. On the measure of usage’ 
of unfamiliar words, high horizontal distance 
(low comparability) in dyads seemed to pro- ` 
duce language behavior which affirmed vertical 
distance between senders and recipient, tilat 
is, showed increased differentiation of high 
and low status. On productivity measures, 
however, status maintenance became less evi- 
dent with the more horizontally distant 
condition. 

This divergence in interactions with meas- 
ures which were otherwise significantly corre- 
lated indicated the presence of some addi- 
tional intervening factor. One possibility was 


4 This contention is derived from Tolman’s (1952) 
position that role behavior of an individual which 
conforms to expectancies of other members in an in- 
teraction situation gains their approval, thus strength- 
ening the individual’s expectancy for further reward 
(or his sense of security) in repeated role perform- 
ances, 

5 This proposition is akin to Schein’s (1960) ob- 
servations that role maintenance bears on an indi- 
vidual’s self-concept and entails a selective flow of 
interpersonal communication. 
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that such a variable relates to the particular 
sex role of the subject. Women dyads ad- 
dressed either superiors or subordinates of the 
opposite sex whereas men communicated with 
members of the same sex. However, because 
women and men supplied similar status 
ratings at the end of the experimental pro- 
cedure, further explanation seemed in order. 

Such possibility stemmed from reexamina- 
tion of certain effects of the variation in levels 
of interpersonal comparability. With high 
comparability, partners were favorably pre- 
disposed toward one another and could readily 
agree on particular wording of messages. 
Therefore, when given a 15-minute time limit, 
they were relatively free, if they chose, to issye 
longer and more value-oriented communica- 
tions to outsiders. Moreover, because they 
were supposedly compatible in interpersonal 
likes and dislikes, they had relatively little 
need to engage in ego-protective behavior with 
respect to each other. Instead they could 
concern themselves with ego protection as 
required in the vertical relationships con- 
fronting them, especially in upward com- 
munication. Partners who were provided an 
orientation of low comparability, however, 
were placed in an ambiguous, if not negative, 
relationship. Thus self-protective behaviors 
were called for both within the dyadic rela- 
tionship as well as in the external relation- 
ships imposed by the experimental procedure. 
Because subjects had to prepare a joint rather 
than individual communications, their letters 
necessarily reflected disaffiliative tendencies 
arising from sources which could be either 
internal or external to the dyad. 


In terms of the findings of the present ' 


study, the relatively low total verbal produc- 
tivity and infrequent value-oriented state- 
ments of low comparable dyads probably 
represented the subjects’ concern with self- 
Protection from each other (internal secu- 
rity). By keeping letters short and non- 
opinionated, these dyads could avoid the 
unpleasant disagreements which the ‘experi- 
menter said were likely to occur. To obtain 
external security, these same dyads seemed to 
use relatively formal, intellecttialized words 


and phrases when addressing superiors. In , 


writing subordinates, however, they exercised 
their prerogative to initiate a more informal 
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type of discourse (and possibly communica- 
tion which would be ego enhancing). 

The presence of specific divergent modes 
of response, one of which relates to internal 
security and the other to external security, 
is plausible when one considers a group within 
its overall social context and takes into ac- 
count the necessary tendencies toward equi- 
librium between inner and outer forces in 
group functioning (see Bales, 1955). Also 
understandable is the contention that for 
initial short-term relationships such balancing 
effects would be made evident in ambiguously 


` structured dyads, namely, those which con- 


tain members of low in comparison with high 
comparability. A need for more definitive 
investigation along such lines is indicated. 

Evidence relating to interaction effects 
proved to be unreliable on the fourth measure 
of percentage of negative references. The ab- 
sence of a significant outcome on this measure 
as opposed to reliable outcomes on other 
measures implies that comparability in dyads 
is a less critical determinant when communi- 
cation relates to aspects of one’s position 
rather than to more central features of one’s 
personal self. This interpretation followed 
from the fact that negative institutional refer- 
ences failed to correlate with other verbal 
scores and from the assumption of disparate 
modes of status maintenance. 
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SOCIAL REINFORCER EFFECTIVENESS AS A FUNCTION 


OF THE RELATIONSHIP BETWEEN CHILD 
AND ADULT* 


NORMA McCOY ? anp EDWARD ZIGLER 


Yale University 


The hypothesis tested was that the child’s positive- and negative-reaction 
tendencies interact to determine an adult’s effectiveness as a reinforcer. Before 
administering a simple 2-part satiation task under conditions of positive rein- 
forcement, 36 grade-school boys were divided randomly into 3 conditions. 
In the stranger (St) condition Ss had no prior contact with E. In the familiar- 
neutral (FN) condition Ss participated in 3 sessions at weekly intervals during 
which E provided attractive art materials but interacted minimally. The 
familiar-positive (FP) was identical to the FN condition except that E inter- 
acted freely and positively with Ss. Results showed that the FP group elected 
to play the experimental game significantly longer than the FN group (p < .03) 
which played significantly longer than the St group (p<.001). The pattern 
of results in time spent in Part I vs. Part II suggested that the experimental 
conditions influenced both the child’s positive- and negative-reaction tendencies. 


Considerable evidence has now been pre- 
sented indicating that a number of verbal 
responses, for example, “good,” “right,” 
emitted by adults are effective in influencing 
children’s behavior. Attention has been given 
recently to the degree to which a variety of 
situational, subject, and experimenter varia- 
bles influence the effectiveness of these social 
reinforcers (Gewirtz, 1954; Gewirtz & Baer, 
1958a, 1958b; Gewirtz, Baer, & Roth, 1958:’ 
Patterson, 1959; Patterson, Littman, & 
Hinsey, in press; Patterson & Ludwig, 1961; ` 
Stevenson, 1961; Stevenson, Hickman, & 
Knights, 1963; Walters & Ray, 1960; Zigier, 
1961, 1963b, in press; Zigler & Kanzer, 1962; 
Zigler & Williams, 1963). Although the effects 
of a number of variables have been demon- 
strated, the process or processes by which 
these verbal responses of an adult acquire 
their reinforcing properties are far from clear. 
The simplest and most commonly advanced 
view is that parents or caretakers frequently 
pair positive verbal responses with primary 


1 This research was supported by Research Grant 
MH-06809 from the National Institute of Mental 
Health, United States Public Health Service. The 
authors wish to express their thanks to Dora Levy, 
Principal of Edgewood Elementary School for her 
cooperation and assistance during the course of this 
study. The authors are also indebted to Irvin Child 
and Jacob Gewirtz for their insightful criticisms 
of an earlier version of this Paper. 

2 Now at San Francisco State College. 
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reinforcers, and these responses thus acquire 
their reinforcing property. This view further 
holds that through generalization, these re- 
sponses are also reinforcing when employed 
by other adults. 

The difficulty in testing this position can 
be seen in recent studies which compare the 
effectiveness of social reinforcers dispensed by 
the mother or father of the child being re- 
inforced or by a strange adult (Patterson, 
1959; Patterson et al., in press; Patterson & 
Ludwig, 1961; Stevenson et al., 1963). The 
most logical prediction generated by the 
number-of-pairings position is that social 


reinforcers dispensed by parents would be 


more effective than social reinforcers dis- 
pensed by strange adults. However, Patterson 


(1959) found that although both fathers and . 


strangers were positively reinforcing, fathers 
were not generally more reinforcing than were 
strange adults. Taking a somewhat different 
tack, Stevenson et al. (1963) predicted that 
parents would be less effective social rein- 
forcers than would strange adults. Although 
subscribing to the number-of-pairings posi- 
tion, these investigators felt that the continu- 
ous supportive role played by parents results 
in the child being relatively satiated on his 
parents’ supportive comments. Although 
Stevenson et al. did find that strangers were 
more effective reinforcing agents than were 
parents, their results remain surprising. These 


: 
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investigators, employing a measure involving 
the change in the rate of responding with the 
onset of reinforcement, found that the 
fathers’ supportive comments were negative 
reinforcers for both boys and girls and that 
the mothers’ supportive comments were nega- 
tive reinforcers for boys and positive rein- 
forcers for girls. These findings are hardly 
in keeping with the number-of-pairings hy- 
pothesis even when augmented by the satia- 
tion notion. In order to explain their findings, 
Stevenson et al. suggested that fathers in 
particular and men in general induced anxiety 
in children which inflated the rate of respond- 
ing during the base period. The view that 
greater identification exists between girls and 
their mothers than between boys and either 
their mothers or fathers was employed to 
explain the finding that mothers and female 
adults were effective positive reinforcers with 
girls. 

These explanations certainly suggest that 
it is inappropriate to view social reinforcers 
as operating in an automatic or mechanical 
manner or that the key variable is simply 
the number of pairings that these reinforcers 
have had with primary reinforcement. It 
would appear, rather, that the reinforcing 
agent and the responses he makes are complex 
stimuli which activate a variety of emotional, 
motivational, and cognitive responses. De- 
pending upon the particular measure of re- 
inforcer effectiveness employed, the responses 
„of the child thus activated can result in either 
a facilitation or an attenuation in the ef- 
fectiveness of the adults’ supportive com- 
ments. Although reporting certain findings not 
in keeping with those of Stevenson et al. 
(1963), Patterson et al. (in press) have also 
taken the view that the reinforcing adult is 
best conceptualized as being both a general 
teinforcer and a complex cue eliciting a wide 
array of responses. 

The problem in conceptualizing the rein- 
forcing adult as the elicitor of responses is the 
difficulty in predicting just what responses 
he elicits and how such responses influence his 
effectiveness as a generalized reinforcer. The 
complexity of this problem has been noted 


' by Patterson et al. (in press), and their effort 
> to unravel it by investigating the relationship 


between parental practices in the home and 
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the effectiveness of parents as reinforcers has 
been truly pioneering work. 

Zigler and his colleagues (cf. Zigler, 1963b, 
in press) have also asserted that the typical 
experimental situation in which an adult ver- 
bally reinforces a child must be viewed as 
a complex interaction between adult and 
child. While also treating the adult as an 
elicitor, they have not been concerned with 
the entire spectrum of responses which are 
elicited in the child as a function of his total 
social conditioning history. Instead they have 
focused on those general tendencies elicited 
by the reinforcing adult which influence the 
adult’s effectiveness as a reinforcer indépend- 
ent of the particular task being employed to 
assess this effectiveness. The view here is that 
while the adult is both an elicitor and a 
reinforcer, it should be possible to deduce 
certain general principles which determine the 
adult’s reinforcer effectiveness. These investi- 
gators have argued that the history of every 
child is such that any adult elicits both a 
positive (approach) and negative (avoidance) 
reaction tendency. Thus, every interaction 
between an adult and child is viewed as a 
conflict situation for the child. 

Stating that the reinforcing adult elicits a 
positive-reaction tendency in the child is 
simply another way of asserting that the 
history of all children is such that adults 
have been paired with primary or secondary 
reinforcers frequently enough to make adults 
general-positive reinforcers. What has received 
minimal attention in the literature is that 
parents and adults are not only general- 
positive reinforcers but through their history 
of pairings with punishing events, are general- 
negative reinforcers as well, Thus, how rein- 
forcing the adult is for the child will depend 
on the interaction between both these posi- 
tive and negative tendencies. Clearly, then, 
the relative magnitude of the tendencies will 
depend on the relative amount of positive 
and negative experiences the child has had 
with adults. Within such a framework the 
child minimally affected by social reinforcers 
would not necessarily be viewed as having low 
motivation for social reinforcers; he could 
be one whose negative-reaction tendency in- 
hibits him from freely responding in order 
to secure positive reinforcement. 
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In the present study, the hypothesis was 
tested that the child’s positive- and negative- 
reaction tendencies interact in determining 
an adult’s effectiveness as a reinforcer. The 
assumption was made that the magnitude of 
these tendencies is affected by the general 
quality of the relationship existing between 
the reinforcing adult and the child. Three 
experimental conditions were employed. In 
the first, the reinforcing adult was a stranger; 
in the second, she was a familiar but neutral 
person; and in the third, she was a familiar 
and positive person. The general prediction 
tested was that the adult would be least re- 
inforcihg in the first condition, more rein- 
forcing in the second cofdition, and most 
reinforcing in the third condition. This pre- 
diction was derived from the position that the 
strange adult would elicit the child’s negative- 
reaction tendency which in turn would reduce 
the adult’s effectiveness as a social reinforcer. 
In the familiar-neutral condition, the child 
would have learned that this particular adult 
was not a punishing agent; and this knowl- 
edge would reduce the child’s negative- 
reaction tendency towards the adult prior to 
the reinforcing situation. The familiar-positive 
condition was viewed as one in which not only 
the negative-reaction tendency would be res 
duced but one in which the positive tendency 
would be enhanced as well. > 


A test of this position demands independent 


measures of both the child’s positive nd 
negative tendencies. As in previous studies 
(Zigler, 1961; Zigler, Hodgden, & Stevenson, 
1958), the total time the child elected to play 
a two-part satiation task was employed as 
the measure of his positive-reaction tendency. 
A cosatiation index, that is, a score reflecting 
the relative amount of time spent playing 
each part of the game, was employed to assess 
the child’s negative-reaction tendency. That 
this score is a valid measure of a child’s 
negative-reaction tendency is suggested by a 
number of earlier studies in which relatively 
low cosatiation scores were found for subjects 
whose life histories were characterized by a 
high incidence of negative social encounters 
(Kounin, 1941; Zigler, 1961; Zigler et al., 
1958). The rationale of this measure as ad- 
vanced by Zigler (1961) is as follows: If 
the child has no negative-reaction tendency, 
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he should play the first part of the game until 
he is satiated on the social reinforcers being 
dispensed. Such a child should play the second 
part for a shorter period of time than the 
first. The greater the negative-reaction tend- 
ency of the child, the shorter should be the 
time he spends on the first part. However, 
during the first part he is socially reinforced; 
he learns that the adult is not punishing and 
furthermore, he discovers upon the termina- 
tion of the first part that he can indeed end 
the interaction whenever he likes. This should 
reduce the negative-reaction tendency with 
which he begins the second part. How long 
such a child plays the second part depends 
upon how large a negative-reaction tendency 
was present to be reduced during the first 
part. The greater the child’s initial negative- 
reaction tendency, the greater the likelihood 
that he will play the second part longer than 
the first. 

Further evidence that the cosatiation index 
is a valid measure of the child’s negative- 
reaction tendency was provided by Shallen- 
berger and Zigler (1961). These investigators 
found that both normal and retarded children 
who received the negative pretraining condi- 
tion prior to the two-part game had lower 
cesatiation scores than children in a positive 
pretraining condition. For normal children, 
evidence to date suggests that the reduction 
in the negative-reaction tendency during 
Part I of the game is balanced by the satia- 


tion effects. Such children play Part II for» 


about the same length of time as Part I. 
Thus, employing the two-part cosatiation pro- 
cedure, the specific prediction made was that 


children in the stranger condition should play” 


Part II about as long as Part I, while both 
familiar groups should show a marked decrease 
in playing time from Part I to Part II. 


METHOD 
Subjects 


The sample consisted of 36 first- and second-grade 
boys attending the Edgewood Elementary School in 
New Haven, Connecticut, The school was located in 
a middle-class neighborhood having a predominantly 
Jewish population. Subjects were picked at random 
from all first- and second-grade boys with the 


restriction that no boy was included who was judged, 


a behavior problem by the teacher or principal. 
The mean age of the group was 7.2 years. The 
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sample was restricted to boys in order to avoid the 
complications of sex effects. 


Experimenter 


The experimenter was a 28-year-old female PhD 
in child psychology who had had considerable 
experience testing subjects in studies investigating 
social reinforcement effects on children. The experi- 
menter was aware of the hypothesis under test, and 
any bias that this may have produced must be 
considered an uncontrolled variable in this experi- 
ment. In order to diminish the effect of any such 
bias, a highly structured procedure was employed 
during the administration of the experimental game. 
Considering the care exercised by the experimenter, 
it is the authors’ belief that any examiner bias had 
minimal effect on the findings. 


Experimental Manipulations 


The sample was divided randomly into three 
groups of 12 boys each with the restriction that the 
mean CA of the groups be approximately equal. The 
three experimental conditions were: 

Stranger (St). The experimenter had no contact 
with any of the subjects in this condition prior to 
the administration of the experimental task. 

Familiar-Neutral (FN). On three occasions the 12 
subjects in this condition were taken from the class- 
room in two groups of 6 subjects each. The three 
sessions were separated by intervals of 1 week. Each 
group of 6 subjects was taken to an empty class- 
room and given drawing paper and attractive art 
materials consisting of pastels in 12 colors in 
Session 1, felt-top markers in 6 colors in Session®2, 
and presto paints in 6 colors in Session 3. 

The instructions were as follows: 


Hello. As you remember [Sessions 2 and 3 only] 
my name is Miss and I’ve brought some 
things for you to have some fun with. I will 
give each of you some of these pastels [markers, 

» paints] and some paper and you can make a 
picture of anything you like. You can begin as 
soon as you get some materials. 


Following the instructions the experimenter dis- 
tributed the art materials and told the subjects that 
she was going to be busy at her desk (located in the 
front of the room) and requested that they work 
quietly until it was time to return to their classroom. 
Any further comments by a child were reacted to 
minimally or not at all, Questions were answered 
briefly and the child was reminded to work quietly 
because the experimenter was very busy. If the 
children attempted to interact among themselves, the 
experimenter again requested that they work quietly. 
Very few children made more than one attempt to 
interact with the experimenter, and very little effort 
Was required to keep the children quiet. 
Familiar-Positive (FP). The initial procedure for 
this group was the same as for the FN group. Fol- 
lowing the instructions, however, the experimenter 
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responded at some length to all questions and com- 
ments as she passed out the materials. As the 
children began to work, the experimenter approached 
each boy individually and talked with him about 
what he was drawing. The experimenter then con- 
tinued ‘to interact with each subject attempting to 
establish a warm, positive relationship by being 
complimentary, helpful, and responsive. By the end 
of the three sessions the experimenter was employ- 
ing the subjects’ first names in her interactions with 
them, One boy in this group was absent for Session 3 
but was retained in the study. Another boy was 
withdrawn from school during the course of the 
study; and this group, therefore, had 11 rather than 
12 subjects. 


Experimental Game (M arble-in-the-H. ole) 
A 


The experimental game was a two-part satiation 
task consisting of a simple monotonous repetitive 
game called Marble-in-the-Hole which has been de- 
scribed previously (Zigler, 1961, 1963a). The game 
was made up of a wooden box having two holes 
on top. Inside the box was a chute connecting the 
holes with a single opening at the bottom of the 
box. The opening was filled with green and yellow 
marbles, thus insuring the subject a steady supply 
of marbles. The apparatus was automated so that 
it recorded the length of time the child played and 
the number of marbles he inserted. The subject’s 
task was to insert a marble of one color into one 
hole and a marble of another color into the other 
hole. 

Part I. The game was placed in front of the 
subject with the experimenter directly behind it 
‘and facing the child. The experimenter said: 


Hello. As you remember [with FN and FP 
subjects only] my name is Miss , and we 
are going to play some games today. This is a game 
we call Marble-in-the-Hole. I’ll tell you how to 
play it. You see these marbles, Some of them are 
green and some of them are yellow. They go in 
these holes. The green ones go in this hole [the 
experimenter’s left] and the yellow ones go in this 
hole [the experimenter’s right] [the experimenter 
pointed to the appropriate holes]. Now show me 
a green marble. Put it in the hole it goes in. Now 
show me a yellow marble. Put it in the hole it 
goes in, You can put as many marbles in the holes 
as you want to. You tell me when you want to 
stop. Remember, when you want to stop just 
tell me. OK, ready? Begin. 


The subject then played the game until he indi- 
cated that he wished to stop, either by telling the 
experimenter he wanted to stop or by not inserting 


a marble for 30 seconds. A 15-minute time limit was 
used, $ 

Part II. After the subject indicated that he wished 
to stop, the experimenter said: 


Now I'll tell you how to play this game. This 
time we put the yellow marbles in this hole [the 


608 Norma McCoy AND EDWARD ZIGLER 
TABLE 1 
PERFORMANCE OF THE THREE GROUPS ON THE EXPERIMENTAL GAME 
Mean time 
2 

Group N Part I Part II Total Bes Pare Tp 
Parbll’| EEE 
Minutes | Log minutes | Minutes | Log minutes | Minutes | Log minutes Partly eae 

Stranger 12 1.22 3095 1.31 3193 2.53 6288 —.10 0071 

Familiar-neutral | 12 6.72 1327 2.85 4702 9.57 1.2029 3.87 -3203 

Familiar-positive] 11 7.96 8725 5.39 6815 13.35 1.5540 2.56 2455 


experimenter’s left] and the green marbles in this 
hole [the experimenter’s right]. Put a yellow 
marbje where it goes. You can put as many 
marbles in the holes as you want to. You tell me 
when you want to stop. Remember, when you 
want to stop just tell me. OK, ready? Begin. 


The subject again played the game until he indi- 
cated that he wished to stop, either by telling the 
experimenter he wanted to stop or by not inserting 
a marble for 30 seconds. A 15-minute time limit 
again was employed, 


Procedure 


The St subjects were administered the Marble- 
in-the-Hole game prior to any contact between 
the experimenter and FN and FP subjects, This was 
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Fic. 1. Mean log time spent by the three oups in 
Part I and Part II of Marble-i E sate, 


done so that none of the St subjects would become 
familiar with the experimenter by seeing her in the 
school or by hearing about her from subjects in 
the other groups. The FN and FP subjects were 
administered the Marble-in-the-Hole game 1 week 
following their third experimental session. 

The subjects were tested individually in a small 
conference room. The subject was verbally reinforced 
twice a minute for as long as he played either 
part of the game. (The decision to verbally reinforce 
the child rather than to employ attention alone 
as a reinforcer was made in light of studies which 
have indicated that an attention-only condition has 
differential effects depending upon the reinforcement 
condition that has preceded it—Crandall, 1963; 
Stevenson & Snyder, 1960. While it might appear 
that an attention-only condition would provide the 
purest measure of the child’s reaction tendencies, 
the particular contrast effects introduced by such 
a procedure mitigated against its use.) Reinforce- 
ments were administered approximately at the 15- 
second and 45-second points within each minute. Five 
statements were used: “You're doing very well,” 
“That’s very good,” “You know how to play this 
game very well,” “That’s fine,” and “You're really 
good at this game.” These statements were made 
in a predetermined random order established sepa- 
rately for each subject. The experimenter was warm 
and friendly, smiling and nodding when administer- 
ing the praise; but she did not respond to any at- 
tempts by the subject to engage her in conversation. 
Tf the subject dropped a marble on the floor, he’ 
was told not to pursue it but simply to continue 
playing the game. 


RESULTS 


The three groups’ time scores are presented 
in Table 1 and Figure 1. Since the means 
and variances of these scores were correlated, 
a logarithmic transformation of the time 
scores (log X +1) was made. The log 
times spent by each group on each part 
of the game were subjected to a Lindquist 
Type I analysis of variance (Lindquist, 
1956). The method of unweighted means was 
used to handle the unequal number of sub- 
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jects in three groups. The results of this 
analysis are presented in Table 2. 

A Pearson r of .977 was found between the 
subject’s total time scores and the total 
number of marbles inserted. An analysis of 
the number-of-marbles-inserted data resulted 
in findings highly similar to those found with 
the time scores. 

As can be seen in Figure 1, the significant 
groups effect supported the prediction con- 
cerning how the groups would differ in the 
total time spent on the game. Further analyses 
of the total log-time scores using the between- 
subjects mean square as the estimate of error 
variance revealed that each group differed 
significantly from the other two. The FP 
gtoup played significantly longer than both 
the FN (£ = 2.23, p < .05) and St (¢ = 5.88, 
p< .001) groups. The St group also differed 
significantly from the FN group (t= 3.73, 
p< .001). The significant Parts effect re- 
flected the overall tendency of the three 
groups to play Part I (M log = .6315) longer 
than Part II (M log = .4849). However, the 
St group did not contribute to this effect; 
and it was the failure of this group to decrease 
from Part I to Part II that resulted in 
the significant Conditions X Parts interaction. 
The performance pattern of the three groups 
on the two parts of the game supported the 
prediction that the two familiar groups would 
evidence a greater decrease from Part I to 
Part II than would the St group. 

The prediction concerning the decrease 
from Part I to Part II was tested further 
through a direct examination of difference 
scores for each subject. A variation of 
Kounin’s (1941) cosatiation index was em- 
ployed to compute relative difference scores 
(Part I — Part I[/Part I + Part II). Mean 
relative difference scores for each of the three 
groups are presented in Table 1. A simple 
one-way analysis of variance of these scores 
was significant (F2/s2 = 3.64, $ < .05). Fur- 
ther analyses revealed that the FN group had 
a significantly (¢ = 2.59, p < .02) greater de- 
crease from Part I to Part II than did the 
St group. The difference between the FP and 
FN groups was not significant (¢ < 1), while 
the difference between the FP and St groups 


.’ teached a borderline level of significance 


(t = 1.93, p < .10). 
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An examination was made of the number 
of subjects in each group who played Part II 
longer than Part I. As noted earlier, this 
pattern has been found in socially deprived 
children and in normal children who had 
negative experiences with an adult prior to 
being reinforced on the Marble-in-the-Hole 
game. The number of children playing Part IT 
longer than Part I in the St, FN, and FP 
groups was seven, one, and two, respectively. 
The familiar groups were combined and a 
2 X 2 contingency table was set up. Employ- 
ing the Fisher exact test, this difference was 
found to be significant (p <‘.01). However, 
one subject in each of the familiar‘ groups 
played both parts of the game for the total 
time allowed (15 minutes). Since it was pos- 
sible that these two subjects might have 
played Part II longer than Part I had longer 
playing times been permitted, the analysis 
reported immediately above was recomputed 
with these two subjects excluded. The result- 
ing differences remained significant (p < .02 J: 


DISCUSSION 


The findings of the present study clearly 
indicate that the nature of the relationship 
between a child and an adult influences how 
<effective the adult will be in reinforcing the 
child’s behavior. The experimenter was found 
“ to ‘be significantly more reinforcing in the 
FP than in the FN condition. In turn, the 
experimenter was significantly more rein- 
forcing in the FN than in the St condition. 
The significant differences between the groups 
in the decrease in playing time from Part I 
to Part II lent support to the view that one 
factor resulting in an increase in an adult’s 
effectiveness as a social reinforcer is a decrease 
in the child’s negative-reaction tendency, that 


TABLE 2 
ANALYSIS OF VARIANCE OF Loc-Time SCORES 


Source af MS F 

Conditions (A) 2 1.2622 8.87** 

Between subjects 32 .1422 
Parts (B) 1 3761 13:51% 
AXB 2 -1190 4.28* 

Within subjects 32 0278 

*p < .025. 

tp < .001 
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is, wariness of and/or reluctance to interact 
with an adult. However, the total pattern of 
performance of the three groups on the two 
parts of the game indicated that the enhanced 
effectiveness of the adult cannot Be at- 
tributed solely to the reduction of the negative 
tendency. 

If a reduction in the negative tendency 
were the only pertinent factor, then the FP 
group would show a greater decrease from 
Part I to Part II than would the FN group. 
However, the FP group not only played the 
total game longer than the FN group but also 
did not evidente any greater decrease from 
Part Isto Part II. The overall findings thus 
suggest that the increased effectiveness of the 
adult in the FP condition was due to both a 
reduction’ in the negative-reaction tendency 
and to an increase in the positive-reaction 
tendency. It would appear that the reinforcers 
dispensed -by the experimenter take on in- 
creased reinforcement value if the adult has 
been associated with warm, positive experi- 
ences in the child’s recent past. This seems to 
be such a straightforward explanation that 
one is tempted to explain all the findings in 
terms of a differential increase in the positive 
tendency. 

Thus, one could argue that the differences | 
in total time found between the three groups 
reflect differences in the positive-reaction , 
tendency and that the large decrease between 
Part I and Part II found for the familiar 
groups reflects nothing but fatigue and other 
satiation effects caused by playing Part I of 
the game so long. However, the finding that 
over half of the subjects in the St group play 
Part II longer than Part I constitutes strong 
evidence against this more simple view. The 
very nature of the game is such that one 
would expect every child to play the second 
part of the game for a shorter period than the 
first due to satiation and fatigue effects. No 
argument employing the positive tendency 
alone is capable of encompassing the finding 
that the majority of the children in the St 
group played Part II longer than Part I. As 
demonstrated by Shallenberger and Zigler 
(1961), playing the second part longer than 
the first is the clearest indication that a 
negative-reaction tendency is present. It 
would thus appear that the most appropriate 
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conclusion derivable from the findings is that 
the treatment conditions differentially affected 
both the positive- and negative-reaction tend- 
encies of the groups. 

The results of the present study appear 
capable of shedding light on the effectiveness 
of parents versus strangers as social reinforc- 
ers. If it is assumed that parents have been 
more frequently associated with punishing 
events than have strange adults, then one 
would expect the negative-reaction tendency 
elicited by parents to be higher than that 
elicited by strange adults. It would be quite 
possible for this negative tendency to inter- 
fere with the parent’s effectiveness as a rein- 
forcer and given our usual experimental pro- 
cedures, make him appear less effective as a 
social reinforcer than he really is. Given 
differences in the age of their children, their 
procedures, and the actual parents employed, 
it is not surprising that investigators have 
found that parents can be either negative or 
positive reinforcers (Patterson et al., in press; 
Stevenson et al., 1963). Certainly the context 
in which the parent serves as a reinforcer for 
his child and the particular child-rearing prac- 
tices that have been employed by that parent 
would be factors influencing the magnitude of 
the child’s negative-reaction tendency during 
the experimental task. 

The findings of the present study also ap- 
pear to be related to those of Sacks (1952) 
who investigated the effect of treatment con- 
ditions very similar to those employed in the 
present study on intelligence test perform- 
ance. Three groups of children were tested. 
Following this, one group had positive inter- 
actions with the examiner, a second group had 
interactions similar to those employed in our 
neutral condition, and a third group had no 
interactions with the examiner. Upon retest, 
Sacks found that subjects in the first group 
showed the greatest increment; subjects in 
the second group the next largest increment; 
and those in the control group, the smallest 
Increment in mean IQ. These findings suggest 
that negative-reaction tendencies evoked by a 
strange examiner interfere with children’s test 
performance. Very high negative-reaction 
tendencies could explain the large discrepan- 


cies that have been reported (Sarason & Glad- ' 


win, 1958) between the test performance of 
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| institutionalized retardates, typically obtained 
while the child is interacting with a stranger, 
and the retardate’s day-to-day capabilities, 
typically assessed through the child’s inter- 
actions with familiar adults. Further investi- 
gations of the relative magnitudes of the 
positive- and negative-reaction tendencies re- 
ferred to in this paper should also aid us in 
our understanding of the autistic child as well 
as illuminating the current controversy over 
whether extreme social deprivation leads to 
an increase in the interaction of the socially 
deprived individual or whether such depriva- 
tion leads to apathy and/or withdrawal (Cox, 
1953; Goldfarb, 1953; Irvine, 1952; Witten- 
born & Myers, 1957). 

‘One final note is in order. Throughout this 
study longer playing time of the child has 
been interpreted as being due to the greater 
reinforcer effectiveness of the adult. The ar- 
gument can be made that while longer playing 
time may reflect a heightened positive tend- 
ency towards the adult, it does not neces- 
sarily indicate that the adult has taken on 
greater social reinforcer effectiveness. The 
central issue here is whether the adult is 
effective only in maintaining the child’s inter- 
action with him or whether the adult has 
acquired a generally enhanced reinforcer 
effectiveness in respect to specific responses of 
the child. The investigation of the relationship 
between the child’s positive or negative feel- 
ings towards an adult and that adult’s ef- 

, fectiveness in shaping various behaviors of 
_ the child would appear to be an inviting area 
of future research. 
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It was predicted that Ss facing the pı 


attempt to bias the task supervisor thro 
emphasizing self-attributes admired by him, 
had some freedom to modify his standards 
this ingratiation was expected to hinge on 
visor. The results showed that opinion conformity was affected by the super- 


visor’s stated values and this was true 
open to influence. Also, 
the supervisor by presenting 


evidence of differential flattery as a fu 


The worker-supervisor relationship may be, 
and fruitfully has been (for example, Thibaut 
& Kelley, 1959), conceived as a power rela- 
tionship in which an exchange of behavior 
products is involved. Put most simply, the 
worker exchanges his productive effort for 
such tangible rewards-as wages or such in- 
tangible ones as approval and promotion pros- 
pects. The supervisor in various ways Con- 
trols the giving of these rewards so as to pro- 
vide optimal incentives for effective worker 


performance. The supervisor is said to have , 


superior power essentially because he can help 
or hurt the worker more than the latter can 
help or hurt him. Unless his talents are in 
great demand elsewhere and barriers to mobil- 
ity are minimal, the worker is more dependent 
on the supervisor than vice versa. 

e Because of his relative dependence, the 


* worker is motivated to comply with the super- 


visor's directives and must learn and accept 
the latter’s criteria of performance evalua- 
tion. Having done so, he is then in a position 
to maintain his outcomes at a high level by 
performing in line with these criteria. By 
mastering the task, then, the worker in effect 
exerts counterpower on the supervisor and 
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rospect of poor task performance would 
ugh ingratiation (conforming to him, 
flattering him) if the supervisor 
or their application. The form of 
the expressed values of the super- 


only when the supervisor was potentially 


in the open-to-influence conditions S§ tried to impress 
themselves as strong and competent. There was no 


nction of experimental treatment. 


may narrow the range of outcomes received to 
those at the positive end of the potential 
span of outcomes. 

While the worker may thus effectively con- 
trol the supervisor through his own opera- 
tions within the task system, this means of 
control or counterpower is not always avail- 
able. A worker who lacks appropriate talent 
might be unable to meet the supervisor’s 
standards, or those standards might them- 
selves be so ambiguous that the route to 
positive outcomes is obscure. In such circum- 
stances, we might expect to find attempts to 
control the supervisor occurring outside the 
task system itself (cf. Blau, 1954). Thus the 
ineffective or bewildered worker might at- 
tempt to present himself in a positive manner 
to bias the supervisor in his favor. This would 
effectively increase the worker’s outcomes 
only when the supervisor was himself un- 
certain about good and bad performance or 
otherwise free to choose among competing 
standards in evaluating a complex task. 

The present investigation was directed to 
some of these alternative ways of influencing 
a supervisor. It attempted to explore some 
of the conditions affecting the adoption of 
attraction-seeking strategies in a perform- 
ance setting. For convenience these strategies 
are referred to under the rubric of ingratia- 
tion, which refers to those witting and un- 
witting attempts to manage an impression so 
as to increase one’s attractiveness to a par- 
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ticular other (Jones, 1964). The basic 
position of the present paper is that such 
strategies of ingratiation may be identified in 
work situations where the supervisor cannot 
be readily controlled through the task system, 
and the supervisor has some freedom to de- 
velop and modify his standards of performance 
evaluation. The specific form that ingratiation 
takes is expected to be a function of the 
cues provided by the supervisor concerning 
his own values and interpersonal orientation. 

The psychology of ingratiation has only 
recently begun to receive attention as an 
experimental problem. In a study closely re- 
lated in method to the present ones, Jones, 
Gergen, and Jones (1963) investigated the 
differential use of ingratiation tactics as a 
function of status in an undergraduate ROTC 
unit. In that study, experimentally induced 
pressures to make an attractive impression 
led all subjects to show greater opinion con- 
formity to the target, led the high-status sub- 
jects to become more modest in describing 
themselves, and the low-status subjects to 
become more favorable in their ratings of 
the target. These same three tactical classes 
—opinion conformity, self-presentation, and 
other enhancement—were included as depend- 
ent variables in the present study. While these 
types of communication do not exhaust the 


tactical possibilities of ingratiation, they 


cover an extensive proportion of the terrain 
and provide convenient possibilities for 
quantitative measurement. 3 

Having thus set the stage and introduced 
the playérs, we may now consider in more 
concrete detail the problems and opportunities 
created in the present experiment. Picture a 
worker who has just been introduced to a 
rather vague, complex, and ambiguous task. 
The task involves a series of similar problems 
which he is required to solve on a trial-by- 
trial basis. He quickly learns that the prob- 
lems are soluble, but his own prospects for 
solving them are clearly not good: a super- 
visor, whom he has only briefly met, provides 
a pattern of outcomes indicating poor per- 
formance over a series of practice problems. 
Through a plausible experimental arrange- 
ment, the worker is led into a controlled inter- 
action with the supervisor. They exchange 
opinions about important and trivial issues, 
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exchange self-ratings, and end up trading 
evaluative ratings of each other. This con- 
trolled interaction provides an opportunity 
for the worker to try to impress the supervisor 
if he is so inclined. What conditions are likely 
to entice the worker to put this opportunity 
to use? 

As forecast by the foregoing discussion, the 
degree of supervisory freedom in developing 
and applying standards of evaluation seems 
critical. Thus in the experiment the worker 
either learns that the supervisor is free to 
decide after each trial whether a solution is 
correct, or that he is committed to a series 
of problem solutions which he has worked out 
in advance. In the former case there exists a 
possibility of biasing the supervisor’s decisions 
in the worker’s favor; no such possibility 
exists in the latter case. The general predic- 
tion would be, then, that ingratiation should 
occur when the supervisor is potentially open 
to influence but not when he is restricted to 
preestablished criteria in his decisions to 
reward or punish. 

But if ingratiation does occur, what form 
is it likely to take and how may it be identi- 
fied? Common experience suggests that in- 
gratiation attempts occur when the prospects 
for success (that is, creating an attractive 
impression) outweigh the risks of failure. 
These prospects are determined both by the 
surrounding conditions of interaction and the 
apparent characteristics of the target person. 


Furthermore, the characteristics of the target 7 


person should actually shape the form of 
ingratiation once it occurs. In the present 
experiment, the worker received one of two 


patterns of preinformation about the super-, 


visor’s beliefs and values. The worker was 
either led to believe that the supervisor, in 
his leader role, especially valued togetherness, 
accommodation, and worker solidarity or that 
he was only concerned with effective perform- 
ance and cared not about the frills of 
group cohesiveness and personal styles which 
contribute to it. 

Knowing this much, we can predict some 
of the more likely variations in impression 
management as a function of the supervisor's 
openness to influence and his personal views 


about the leadership role. The most confident» 


prediction is that opinion conformity oF 
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agreement between worker and supervisor 
should be greatest when the latter is open 
to influence and values accommodation, least 
when he is open to influence but competence 
and performance are admired. Under the 
latter circumstances, a show of independence 
provides some evidence of competence through 
the associative relationship which presumably 
links competence to assertiveness and self- 
confidence. When the supervisor is not open 
to influence, conformity should be moderate 
and more in the nature of cognitive readjust- 
ment than instrumental opinion change. 

The prediction with respect to self-presen- 
tation was not so clear. The most reasonable 
expectation was that those subjects exercising 
the ingratiation option would show that they 
too valued whatever personal attributes were 
stressed by the supervisor and attempt to 
convince the supervisor that they possessed 
these attributes in fair degree. While the first 
part of this prediction was open to straight- 
forward test (subjects were asked to check 
those attributes they considered important), 
the second part raises difficulties. It seems 
likely that blatant self-advertisement is not 
very winning as an ingratiation tactic, and it 
is difficult for a person to establish his affabil- 
ity or his competence through self-declaratien 
alone, With trepidation, then, it was predicted 
that subjects would describe themselves more 
positively on attributes admired by the super- 
visor than on attributes belittled by him, and 
that this tendency would be especially strong 
when the supervisor was potentially open to 
influence. 

With respect to the third dependent vari- 
cable, other enhancement, it was predicted that 
subjects in the open-to-influence conditions 
would be more favorable in their supervisor 
ratings than those in the closed conditions. 
This was expected to be especially true on 
those personal attributes known to be prized 
by the supervisor. 


METHOD 
Subjects 


A total of 50 undergraduate male volunteers 
participated as experimental subjects. Recruited from 
the introductory psychology course at the University 

«of North Carolina, all but 7 of these subjects had 
Previously completed a Campus Opinion Survey in 
a group testing session. This questionnaire served 
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as the base-line measure for inferring opinion con- 
formity in the experiment, and thus analysis on this 
dependent variable was perforce restricted to those 
who had completed the survey. All data from 4 of 
the remaining subjects were discarded either because 
the subjects were clearly suspicious or because they 
obviously misunderstood experimental instructions. 
A total of 39 subjects were thus available for investi- 
gating opinion conformity, while 46 subjects provided 
usable data for the remaining analyses. 


Procedure 


Subjects appeared for the experiment in pairs and 
were introduced to an experimental accomplice 
identified as a graduate student in the School of 
Business Administration. The subjects were informed 
that they were to be players in a game designed to 
simulate the features of a real business concern. 
The graduate student was to serve as the supervisor 
in the game. He was to evaluate the performance of 
the two subjects and to give or take away points 
accordingly. 

The experimenter then explained that the period 
would be divided into three parts. First there was 
to be a practice session in which the subjects would 
learn how the game was played. Then there was 
to be a “get-acquainted” session in which the sub- 
jects and the supervisor would get to know each 
other a little better as persons. Finally, the game 
was to be played for points. While the final game 
was continually referred to as the climax of the 
experiment, the task toward which everything else 
was to lead, it was not actually played. 

Variation in conditions of supervisory judgment. 
All subject pairs were then told that the supervisor 
had previously looked over some relevant literature 


in preparation for his judging task. However, for 


approximately half of the pairs the experimenter 
emphasized both to the subjects and to the supervisor 
tha: the information was general in its implications 
and, he said, turning to the supervisor, 


these materials . . . won't give you the specific 
information you need in order to evaluate the 
other’s decisions in the game. You'll just have to 
wait until you hear each worker’s answer each 
time, and then use your best judgment in awarding 
points. 


Because of the implied freedom of the supervisor to 
make on-the-spot decisions, this instructional varia- 
tion will hereafter be called the open-judgment, or 
merely open condition. 

The remaining subject pairs were told, on the other 
hand, that the materials which the supervisor had 
been studying prior to their arrival provided much 
of the data on which the answers could be based. 
Jn addition, subjects in this, the closed-judgment or 
closed condition were told, 


He has already recorded his answers to each of 
the practice problems and also to each of the 
problems in the game to be played for real... . 
These will serve as the correct answers and each 
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of your responses will be judged right or wrong 
in terms of whether they match the supervisor’s 
sheet or not. 


After the open or closed variation had been induced, 
the supervisor was taken to an adjoining room to 
finish looking over the materials. 

Practice session. The experimenter then proceeded 
to outline the remainder of the procedure. The sub- 
jects would be placed in individual booths and each 
would be provided with a manual of practice prob- 
lems, a set of message forms, and a scoring sheet. 
The manual would contain 14 sets of four advertising 
slogans, each set consisting of slogans which con- 
cerned a different consumer product (for example, 
neckties, washing, machines, skin cream). It would 
be the task of each subject to rank the four slogans 
in each set in terms of their probable effectiveness 
in increasing sales for the prodyct in question. This 
ranking was to be recorded on a message blank to 
be carried by the experimenter to the supervisor. 
The latter would then communicate to each subject, 
via a microphone, the number of points won or lost 
on that trial, Although the subjects would not be 
competing in any formal way, each would be allowed 
to hear the’ number of points he won or lost as 
well as the outcome of the other subject. 

In order to provide a meaningful incentive for 
doing well in the game, the subjects were further 
told that the research was sponsored by the American 
Advertising Council, and that each subject would 
be paid according to the number of points he re- 
ceived during the game. Each subject would be 
credited with $1 at the outset, but each could win 
up to $10 if the supervisor approved of his choices, 
It was emphasized that the supervisor had not been? 
informed about the monetary significance of the 
points he would award, allegedly because this might » 
cause him to be overgenerous in his ratings. As the 
game proceeded, the subjects were to use the scoring 
sheets to keep abreast of their gains and losses. ° 

The subjects were then led to their respective 
booths and each was assigned a letter by which he 
was to be identified during the remainder of the 
procedure, Both subjects were assigned the letter A 
and both were led to believe that the other subject 
was B. The sets of slogans in the practice session 
were of two distinct types, and the scoring sheets 
were organized to emphasize the fact: one group of 
slogans were for use in magazine and newspaper 
media, the remainder were for use with radio and 
television, As the subjects proceeded through the 
sets of slogans, each found (by prearrangement) 
that he was earning many points on the radio and 
television slogans but actually losing on the magazine 
and newspaper items, In addition, each was led to 
believe that the other subject was doing well on the 
magazine and newspaper slogans but doing poorly on 
the radio and television items. At the end of the 
practice session the experimenter announced that 
because of time limitations the actual game would 
concern only one of the two types of slogans. By 
a rigged coin toss he then indicated that in the real 
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game only magazine and newspaper items would 
be used. 

This elaborately controlled sequence of events was 
designed to serve a number of purposes. The subject 
had to be led to expect that he would do poorly 
in the real game and thus could not effectively con- 
trol his outcomes by successful task performance, The 
fact that the other subject would probably do well 
in the real game presumably added some edge to the 
subject’s motivation. And yet, each subject actually 
had done as well as the other on the average item 
of the practice game; while he was inferior on one 
sort of item, he was superior on the other sort. 
It was hoped that this arrangement would serve 
to minimize the personal failure implications of 
performance on the practice problems and inhibit 
such defensive reactions as autistic distortion of 
expected outcomes, Since his prospects were in the 
end determined by an arbitrary coin toss, there was 
presumably no ready target for blame and the 
subject was free to make the best of a bad situation. 

Get-acquainted pseudointeraction. As indicated 
above, a get-acquainted message exchange was inter- 
posed between the practice game and the “real” game 
which was allegedly to climax the experiment. This 
exchange of personal information was introduced, 
according to the experimenter, in order to make the 
game a more valid simulation of a real business 
organization where 


performance . . . involves many complicated social 
factors. An organization can be set up to work 
in a certain way, but how it actually works 
is often affected by how the people who work 
together feel about each other, their mutual 
understanding, and so on. 


The experimenter attempted to justify the artificial 
arrangement for the interaction by arguing that a 
controlled exchange of messages would make it easier 
to keep track of the “acquaintance process,” and 
insure that each subject had an equal opportunity 
to give the supervisor information about himself. 7 

To begin the get-acquainted session, the experi- 
menter told the subjects that they would be able to 
listen over the sound system as he asked the super- 
visor a few questions about himself. After this, each _ 
subject would be given detailed instructions about 
an exchange of written messages between himself 
and the supervisor. The experimenter then left the 
room and turned on a tape recording of an interview 
between himself and the supervisor. Cross-cutting the 
variation in conditions of supervisory judgment 
(open versus closed), a variation in the supervisor's 
expressed values about leadership was introduced at 
this point. The general format of the interview was 
the same in all conditions: the supervisor was asked 
to discuss his thoughts on the relationship between 
a supervisor and his worker, to state his views On 
personnel selection in industry, and to comment on 
general criteria for determining salary differentials 
and promotion. The supervisor’s tape-recorded re- 
sponse to these problems differed radically in the tw 
halves of the subject sample. 

Half of the subjects heard an interview in which 
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the supervisor continually emphasized the “human” 


7 


« 
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side of business. In his comments he stressed such 
factors as the spirit of cooperation, the importance 
of getting along with others, considerateness, and 
understanding. Competence and performance motiva- 
tion were simply not mentioned in his discussion of 
relevant considerations, though they were not ex- 
plicitly belittled. In view of the emphasis on accom- 
modation and social solidarity pervading the inter- 
view, the supervisor in this condition is hereafter 
referred to as Sol. The remaining subjects were 
exposed to interview responses in which the super- 
visor stressed the quality and quantity of job per- 
formance above all else. The only important personal 
characteristics that should be considered in selecting 
and evaluating workers were those associated with 
talent and perseverance. A man’s thoughts and feel- 
ings and reactions off the job were clearly his own 
business. In view of the emphasis on performance 
and productivity characterizing the interview, the 
supervisor in this condition is hereafter referred to 
as Prod, The Sol-Prod variation was chosen to 
reflect extreme versions of leader orientations often 
distinguished in the literature on role differentia- 
tion in small groups. Bales and Slater (1955), for 
example, contrast the 


Person who symbolizes the demands of task ac- 
complishment and [the] person who symbolizes the 
demands of social and emotional needs [p. 304] 


and suggest that this fundamental differentiation 
occurs in all social systems. 

The pseudointeraction which followed the inter- 
view actually involved an exchange of questionnaires 
between each subject and the supervisor. Since the 
questionnaire responses of the subject provided the 
main dependent variable data of the study, the 
nature of the questionnaires will be discussed in 
that context. 


Measures of Opinion Conformity, Self-« 
Presentation, and Other Enhancement 


Each subject completed in the same sequence a 
self-rating questionnaire to be transmitted to the 
Supervisor, a shortened version of the same opinion 
Questionnaire he had completed earlier in the 
Semester, and finally rated the supervisor on the 
same questionnaire form used for the self-ratings. 
He was led to believe that the supervisor was filling 
out the opinion questionnaire while he, the subject, 
was rating himself; the supervisor was filling out 
is self-rating form while the subject was indicating 
Is opinions; and both were evaluating each other 
during the final phase of the exchange. It is impor- 
tant to remember that the subject was aware of the 
ee opinions at the time he recorded his, 
e subject was aware of how the supervisor had 
tated himself before the subject rated him, and all 
ee responses were made with the clear 
ie lerstanding that they would be transmitted to 
Me supervisor. 
The self-description rating form consisted of 20 
Pairs of attributive antonyms. Each personal at- 
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tribute was separated from its opposite by three 
12-point scales. On the first of these scales the sub- 
ject was to rate himself, on the second he was to 
indicate his ideal along that dimension, and the 
third scale was to be left for the supervisor’s use. 
Subjects were also asked to check those antonym 
pairs which they considered “especially important 
personal characteristics.” While each item was quite 
clearly evaluative in connotation (that is, was com- 
posed of a favorable and an unfavorable antonym), 
the items fell in two predetermined content classes. 
Half of the items dealt with qualities connoting 
afiability, warmth, or afiectionateness (for example, 
friendly versus aloof, likable versus not likable), 
while the other half were more obviously centered 
around judgments of respect (for ęxample, efficient 
versus inefficient, strong versus weak). Eighteen in- 
dependent raters had previously classified tht items 
as affability or respect attributes with near- 
unanimous agreement on the items finally chosen, 
The 20 items were presented in mixed order as far 
as content was concerned and the position of the 
more socially desirable attribute varied unsystemati- 
cally from left to right on the page. 

The opinion questionnaire consisted of 20 items 
taken from the 50-item Campus Opinion Survey 
which most of the subjects had completed earlier 
in the semester. From the initial battery of items, 
the 10 items judged by the class to be most 
“important” as a determiner of personal attractive- 
ness and the 10 judged to be least important were 
combined to comprise the final shortened opinion 
questionnaire. Each of the 20 items was separated 
by two 12-point scales along which agreement or 
disagreement could be expressed. When he received 
the opinion questionnaire, each subject found that 
the supervisor had already used the first of the two 
ccaleseto indicate his agreement or disagreement with 
the item statement. The subject was thus to use 
the second scale and return the completed question- 
naire to the supervisor via the experimenter. 

The opinions presented as the supervisor’s had 
been especially prepared in advance for each sub- 
ject. Working from the subject’s own Campus 
Opinion Survey responses, the supervisors’ opinion 
ratings were SO arranged that: for half of the 
important and half of the unimportant items the 
supervisor’s responses were the same as those which 
the subject had earlier given; on three of the impor- 
tant and three of the unimportant items, the super- 
visor disagreed with the subject’s previous opinions 
by 6 scale points; on two of the important and two 
of the unimportant items, the supervisor disagreed 
with the subject’s prior opinions by 3 scale points. 
The pattern of agreement and disagreement was the 
same for all subjects, so that the same item in the 
same position was always an agree, a 3-disagree, 
or a 6-disagree item. The overall effect of this pro- 
cedure was to confront the subject with opinions 
from the supervisor which agreed with his own on 
many of the items (assuming little change from the 
previous testing) but showed some or much dis- 
agreement on other statements varying in importance. 

The other description form was identical in ap- 
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pearance to the self-description rating form. The 
ideal and self-scales were already filled in, presum- 
ably by the supervisor, and the subject was left 
with the task of evaluating the supervisor on the 
same dimensions before the form was retuyned to 
him. Again by prearrangement, Sols ideal and self- 
ratings were slightly closer to the positive antonym 
on the affability items than the respect items, and 
Prod’s ratings were slightly closer to the positive 
end on respect than affability items. The mean and 
standard deviation of supervisor self-ratings (that 
is, his overall favorability in rating himself) was the 
same for both Sol and Prod. * 

A final questionnaire was also administered to 
inquire into the subject’s “private” impressions of 
the supervisor, his view of the experimental setting, 
and his awareness of opportunities for social influ- 
ence. Phe subjects were then told that the experi- 
ment was completed, informed of the true purpose 
of the experiment and of the deceptions involved, 
and requested not to discuss the experiment with 
classmates, Each received $2 for his cooperation. 


RESULTS 
Effectiveniess of Inductions 


Open-closed judgment variation. Before ex- 
amining the results relevant to the main 
experimental predictions, it is first necessary 
to establish the extent to which the two 
independent variables were successfully ma- 
nipulated. The open-closed judgment variable 
was, of course, intended to have complex 
cognitive and motivational effects. To pursue 
all the evidence on these would carry us 
directly to an analysis of the pseudointer- 
action data. At a minimum, however, it is 
essential that the subjects properly understood 
and were able to recall the nature of the task 
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situation for which they were preparing 
throughout the experiment. In the postexperi- 
mental questionnaire all subjects were asked 
to indicate whether the supervisor was to 
grade their answers (administer points) in 
terms of a set of previously developed solu- 
tions or whether he was free to decide on each 
trial whether a given answer merited credits 
or debits. A total of four subjects, in three 
conditions, were eliminated because they did 
not choose the alternative appropriate to their 
experimental condition, thus indicating gross 
inattention or confusion. The results for these 
subjects were in line with their incorrect 
self-assignment and not with the actual in- 
structions to which they had been exposed. 
The overall effect of the differential instruc- 
tions, then, was clearly as intended for more 
than 90% of the subjects. 

Going a step beyond this minimal valida- 
tion of the open-closed variation, we might ask 
whether the open condition in fact aroused 
more explicit concern with the impression 
created than did the closed condition. There 
is little indication from a set of four question- 
naire items that subjects were aware of such a 
concern and willing to report it to the experi- 
menter. Subjects in the two open conditions 
did express slightly more interest in being 
liked and admired by the supervisor, but not 
significantly more. It is not clear whether such 
an interest is always difficult to admit, whether 
judgmental openness never aroused self- 
conscious desires to impress the supervisor, or 


TABLE 1 


MEAN RATINGS OF THE SUPERVIS 
VERSUS THE PRODUC 


OR FOLLOWING THE SOLIDARITY 
TIVITY INTERVIEW 


Condition 
Trait Solidarity Productivity baitt. 
Open Closed Total Open Closed Total 
Cold (versus warm) 6.55 6.27 6.42 7.4: i: 
7 ; : 43 5.90 ; s 
Weak (versus strong) 5.73 6.36 6.05 3.43 4.80 $ o 
Incompetent (versus competent) 3.82 3.55 3.69 2.79 2.80 2.80 05 
Permissive (versus demanding) 5.54 5.54 5.50 2.79 3.80 3.30 .001 
Intolerant (versus tolerant) 5.09 3.82 4.46 7.14 5.60 6.37 ‘ote 
Tenderhearted (versus tough-minded)| 7.09 6.91 7.00 5.14 5.00 5.07 001 
Unfair (versus fair) i 3.73. 3.82 3.78 4.07 3.20 3.64 ns 
Not likable (versus likable) 427 3.91 4.09 5.57 4.20 4.89 ns 


Ne 


` e ha scores signify greater attribution of first listed trait in each pair. 


he difference between open and closed conditions is also significant (p <.05). 


` 


ais 


| 
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TABLE 2 
CONFORMITY ON DISCREPANT Items (TOTAL, Important ONLY) AS A 
FUNCTION OF SUPERVISOR VALUE AND JUDGMENT CONTEXT 
Means* . 
All discrepant items Important items 
Solidarity Productivity Solidarity Productivity 
Open 20.78 28.40 9.67 15.20 
Closed 27.60 25.60 14.70 13.10 
Analysis of variance summary 
All discrepant items Important ‘items 
Source e- 
df MS F af. MS F 
Open-closed (A) 1 4.04 1 215 
Solidarity-productivity (B) 1 7.80 d 3.86 
AXB 1 23.04 2.47 1 12.71 4.34* 
Error 35 9.34 35 2.93 


* 


whether such desires were aroused initially but 
had declined by the time of the postexperi- 
mental questionnaire. We shall return to this 
question in considering, later, individual dif- 
ferences in response to the experimental in- 
ductions. 

Supervisor values. While it might be ey- 
pected that the subjects’ impressions of the 
supervisor would be influenced by their own 
behavior during the experiment, gross dif- 
ferences in the pattern of postexperimental 
ratings should still have distinguished bétween 
personal attributes implied by an emphasis on 
solidarity versus those implied by an emphasis 
on productivity. Table 1 presents the mean 


| statins of the supervisor as a function of his 


“ expressed values, along with the probability 
levels of significant mean differences. From 


_, the results in this table it is clear that the 


two taped interviews succeeded in conveying 
divergent information about the supervisor 
and that subjects typically used this informa- 
tion to construct quite different impressions of 
him. As expected, relative to the productivity- 
oriented supervisor (Prod) the solidarity- 
oriented supervisor (Sol) was seen as more 
permissive, more tolerant, and more tender- 
hearted. The expected difference in ratings of 


warmth, however, was not significant. It is 


Perhaps not surprising that subjects saw Prod 


s jA dower the score, the greater the agreement with the supervisor. 
<.05. 


as stronger and more competent than they saw 
Sol. Sol, on the other hand, was better liked 
than Prod though not reliably so. The per- 
ceived differences between the two super- 
visors were, thus, largely differences on the 
attributes specifically emphasized in the inter-. 
yiews and do not seem to be strongly medi- 
ated by an evaluative halo effect. 


“Opinion Conformity as an Ingratiation Tactic 


A major hypothesis of the present study 
concerned the interactive impact of both inde- 
pendent variables on the degree to which sub- 
jects conform to the supervisor’s opinions. 
Specifically, it was predicted that subjects in 
the open-judgment conditions would conform 
more to Sol and less to Prod than would sub- 
jects in the closed conditions. As detailed 
above, conditions for estimating the degree of 
conformity were established by having the 
supervisor appear to transmit bogus opinion 
ratings, some of which were systematically 
discrepant from the earlier expressed opinions 
of the subject. The subject was instructed to 
respond to each of the supervisor’s opinions 
with his own, degree of conformity being 
measured as the proximity of subject and 
supervisor on the discrepant items. 

As Table 2 shows, the resulting means fell 
into the pattern predicted by the conformity 
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hypothesis but the analysis portion of Table 
2 indicates that the crucial interaction did not 
approach significance. It may be recalled, 
however, that the message exchange involved 
both “important” and “unimportant” issues 
(as determined by the normative judgments 
of the population from which the present 
subject sample was drawn). While conformity 
on unimportant items showed a slight (and 
nowhere near significant) trend in line with 
the hypothesis, conformity on important items 
resulted in a significant interaction (Table 
2). Consistent with the greater statistical sig- 
nificance obtained when important items only 
were considered, subjects in the open-solidar- 
ity condition do conform more on important 
versus unimportant items relative to all other 
subjects (p< .10). Thus, subjects in this 
crucial cell seem to attach a special meaning 
to the more important opinion items—whereas 
all subjects show a moderate and similar de- 
gree of conformity on unimportant items, 
open-solidarity subjects show a distinctive 
tendency to conform on the important items. 

One difficulty with measuring conformity 
by summing the ultimate discrepancies be- 
tween the supervisor’s and the subject’s opin- 
ion ratings is that much of the resulting vari- 
ance may reflect opinion changes occurring 
prior to the experiment—that is, between tak- 


J 


TABLE 3 


OPINION CONFORMITY: 
MEASURE OF MOVEMENT TOWARD SUPERVISOR 


Movement index means* 


Judgment context 


Open Closed 
Solidarity 8.9 6.9 
Productivity 67 7.6 
Analysis of variance summary 
Source df MS F 
Open-closed (A) 1 31 
Solidarity-productivity (B) 1 57 
AXB 1 | 2.09 | 5.08* 
Error 35 AL 


a The higher the score, the greater the incidence of 
prad, fe Ales Sir each pi each of the 70 dis. 
ant items was assigned a score of +1 for 
the: poe and a 0 otherwise. tetany toward 
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ing the before measure in the classroom and 
being confronted with the supervisor’s opin- 
ions. It was possible for drastic changes on 
individual items to obscure the overall pat- 
tern of conformity or nonconformity. It was 
decided, therefore, to limit the contribution 
of individual opinion items and to assign each 
subject a score based on the frequency of his 
conformity as inferred by the direction of pre- 
post change. Disregarding the magnitude of 
this change, a score of 1 was assigned for each 
rating falling between the prerating and 2 
points beyond the supervisor’s rating; all other 
items were simply assigned a 0. The decision 
to include movement slightly beyond the 
supervisors rating seemed psychologically 
reasonable and was an a priori decision. The 
choice of 2 points beyond, rather than 1 or 3, 
was arbitrary. This method reduces substan- 
tially the within-cell variance, the predicted 
interaction is clearly significant (p< .05), 
and interpretation does not need to be re- 
stricted to the important items. 


Positivity of Self-Presentation 


Prior to the exchange of opinion messages, 
subjects were instructed to describe them- 
selves to the supervisor via a 20-item rating 
scale composed of evaluative personal attri- 
butes in antonym form. The content of these 
attributes had been preselected so that half of 
them connoted qualities relevant to judgments 
of respect or admiration for personal strength 


and competence (respect items) while the re- , 


maining half centered around qualities of 
friendliness and approachability (affability 
items). The apparent congruence between 


these two categories of item content and the, 


qualities emphasized by the two supervisors 
was, of course, deliberately contrived. It was 


expected that the open-judgment context — 


would give rise to ingratiating self-presenta- 
tions in the particular areas stressed as im- 
portant by the subject’s supervisor. 

It will be recalled that subjects both rated 
themselves and indicated their ideal position 
on each antonym. They were also instructed 
to check attribute pairs they considered es- 
pecially important in determining the attrac- 
tiveness of others. As mentioned in the intro- 
ductory section, a clear expectation was that 
those subjects in the open conditions would 
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TABLE 4 
FAVORABILITY OF SELE-PRESENTATION (IDEAL AND SEL¥-RATINGS 
ON RESPECT AND AFFABILITY ATTRIBUTES) 
Experimental condition 
Y obe Closed Significant 
comparisons 
Solidarity Productivity Solidarity Productivity 
Affability 
Self 35.64 38.50 37.18 33.60 — 
Ideal 21.91 25.00 ° 1125:63 25.00 oo 
Respect 
Self 39.91 34.86 47.00 35.40 Svs; P** 
Ideal 22.09 15.86 25.00 16.20 G Sivs. PION 
Total 
Self 75.55 73.36 84,18 69.00 —e 
Ideal gsi 44.00 40.86 50.63 41.20 = 
Respect-affability 4.27 —3.64 9.82 * 180 S vs. P** 
discrepancy* O vs. C* 
Self-ideal discrepancy 3 i 
Affability 13.73 13.50 11.55 8.60 — 
Respect 17.82 19.00 22.00 19.20 Ca 
E= Ab 4.09 5.50 10.27 10.60 O vs. C* 


items, 


* > <,05. 
ED <.01 


=p <.001. 


show that they too valued whatever personal 
attributes were stressed by the supervisor. The 
most direct means to this end was, presum- 
ably, for the subject to attach more impor- 
tance to these attributes. 

Turning to the results, solidarity subjects 
did emphasize the importance of affability 
items whereas productivity subjects attached 
more importance to respect items (Paitr. = 
01). While this differential emphasis was 
_ More marked in the open treatments, the in- 
teraction between judgment condition and 
Supervisor values was not significant. This 
"Suggests that the subjects’ conceptions of 
| important attributes were shaped by the 
| , Stpervisor’s comments during: the interview, 
but there was no clear evidence that the sub- 
Jects used agreement with the supervisor’s 
Standards to win his favor. 

The remaining issue concerns evidence that 
q the subjects in the open conditions tried to 

Convince the supervisor that they in fact pos- 
sessed the admired attributes. It was predicted 
P that subjects would describe themselves more 
Positively on attributes admired by the super- 
Ke ‘na than on attributes ignored or belittled by 
im, and that this tendency would be espe- 


iy The lower the score the greater the tendency to describe self positively on respect (relative to affability) items. 
The lower the score the greater the tendency to ‘describe self (relative to ideal) positively on respect 


(relative to affability) 


cially strong when the supervisor was poten- 
tially open to influence. This prediction was 
proposed with some hesitance in view of the 
fact that a person does not necessarily con- 
vince others that he is, say, “noble” by con- 
stantly extolling his own nobility. But a study 
by Jones, Gergen, and Davis (1962) showed 
that, female subjects became more self-en- 
hancing when instructed to create a good im- 
pression. In a subsequent study by Jones et 
al, (1963), low-status ROTC students became 
more self-enhancing when instructed to win 
the favor of higher status students, except on 
those items judged to be especially important. 
Because of the variation of perceived item im- 
portance as a function of experimental treat- 
ment in the present study, the effects of im- 
portance on self-ratings could not be meaning- 
fully studied. 

Table 4 presents the mean values of both 
jdeal and self-ratings on respect and affability 
attributes; the lower the value, the closer the 
ideal or self-rating to the positive or favorable 
end of the scale. Several features of this rather 
complex table deserve comment. First of all, 
there is no reliable tendency for subjects to 
describe themselves favorably on all items or 
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on affability items as a function of either 
judgmental context or supervisor values. When 
the analysis is restricted to respect attributes, 
however, there is a strong effect of the Sol- 
Prod variable on both ideal and self-ratings. 
When confronted with a supervisor who has 
previously emphasized performance and com- 
petence, subjects locate their ideal selves 
closer to the scale extremes represented by 
such trait names as efficient, original, well- 
organized, and levelheaded. In addition, they 
present themselves as actually having more of 
these characteristics when communicating to 
Prod. 

While these systematic variations in respect 
to item ratings are of interest in their own 
right, reflecting as they do an implicit tailor- 
ing of the presented self to accommodate the 
characteristics of another, there is no evidence 
in the raw data of Table 4 that this accom- 
modation serves as an ingratiation tactic. That 
is, the tendency to model themselves after the 
supervisor seems no greater in the open than 
in the closed conditions. 

An unexpected finding was that those sub- 
jects confronting Sol described themselves 
more positively on respect items in the open 
than in the closed condition (p < .06). To 
the extent that this is a reliable difference, „it 
might be argued that when there is no direct 
effect of the supervisor’s admiration of respect 
attributes (such as occurred in the Prod con- 
ditions), the context of supervisory judgment 
does affect respect item self-ratings. In the 
Prod conditions, in other words, the impact of 
the supervisor’s values may have been power- 
ful enough to override the weaker effects of 
openness to influence. 

A notable (if unavoidable) weakness in the 
present design is its “‘after-only” nature. The 
final ratings of self and ideal are a compound 
of current subjective state, self-conscious 
strategy, scale interpretation, and other 
sources of idiosyncratic response bias. This is, 
probably, the reason why there is such exces- 
sive variability in the self-presentation scores. 
In an effort to get a clearer picture of the 
experimental effects less contaminated by these 
sources of response bias, the following pro- 
cedure was followed as a way of using each 
subject as his own control. Since there were 
no systematic differences in either ideal or 
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self-ratings on affability items, and since affa- 
bility self-ratings were moderately correlated 
with respect self-ratings (r’s ranging from | 
431 to .784), each subject was assigned a 
score representing favorable self-rating on re- 
spect items minus favorable self-rating on 
affability items. The resulting R—A dif- 
ference score serves as an index of his tend- 
ency to describe himself positively on respect” 
relative to affability items. This we 
then, a relative emphasis score and also con- 
trols for each subject’s tendency to restrict 
himself to a particular region of the scale. 
Placing these R — A scores in a factorial | 
design yields two significant main effects: the | 
Sol-Prod variation is highly significant and 
judgmental context also has a reliable effect 
(see Table 4). Thus not only do subjects 
show a general tendency to place a relative 
stress on their respectworthy qualities when 
confronted with a task-oriented, performance 
centered supervisor, they also stress res 
qualities when in a position to influence the 
supervisor to act in their behalf. 
A final point needs to be considered before 
leaving the self-presentation results. Tt hai 
been noted that both ideal and self-ratings 
on respect items were clearly affected by 
,supervisor’s characteristics. This suggests that 
the Sol-Prod variable influenced in a general 
way how the scale should be used, but did not 
affect the relative placement of ideal and self- 
ratings. It is as if the supervisor, through his” 
intefview responses, defined for the subject 
what is “good”—it is good to be extremely 
competent, strong, cool-headed, etc.—but’ he 
did not affect how the subject saw himself” 
relative to maximal goodness. The open-closed 
variable, on the other hand, did affect this 
placement of self relative to ideal so that sub- 
jects presented themselves as more competent , 
etc., when more strongly motivated (in the 
open conditions) to impress the supervisor 
with their worthiness. 
This reasoning is supported by the last row 
of Table 4, where a final self-ideal discrepancy 
index is shown. Since the effects of shifting 
both ideal and self-ratings canceled each other 
out, only the open-closed variable remained 
significant. a 
It should be emphasized that the obtained, 
pattern of self-presentation results was not 
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predicted in detail. In particular, there was at 
the outset no theoretical reason to expect 
systematic variations in respect items but not 
afiability items. It is also’clear that the open- 
closed variables had weak effects on most in- 
dices of self-presentation, reaching substantial 
proportions only with a derived index of self- 
enhancement on respect minus affability items. 
Nevertheless, the significant findings reported 
seem provocative, and their implications will 
be discussed below. 


Other Enhancement 


As in the previous study by Jones et al. 
(1963), the final portion of the interchange 
between subjects and supervisor involved rat- 
ings of the supervisor on the same sheet on 
which he had presumably rated himself. These 
ratings were presumably to be transmitted to 
the targets of evaluation. In the earlier study, 
low-status subjects showed a greater tendency 
than high-status subjects to evaluate their 
partners positively, and this was especially 
true when instructions emphasized the impor- 
tance of compatibility. No such differential 
tendency was observed in the present study. 
That is, contrary to experimental expectations 
subjects in the open conditions were not more 
favorable in their transmitted ratings of the 
supervisor than those in the closed conditions. 
Nor were there any more subtle effects de- 
pending on the content of the items, etc. Rat- 
ing variability was extremely high, and it may 
be that the experimental inductions had lost 
their potency by the time that the “other rat- 
ings” took place. 


Discussion 


Subjects in the present study were either 
Presented with or denied the opportunity to 
improve their outcomes by ingratiating them- 
selves with the supervisor. Provision of this 
Opportunity was implicit in the conditions 
which allegedly permitted the supervisor to 
judge each solution attempt as it was offered 
by the subject. Denial of this opportunity was 
Implicit when the supervisor was allegedly 
Committed to a set of predetermined answers 
in rewarding or punishing the subject’s solu- 
tion attempts. In the latter conditions, we 


+ Would expect the subjects to be relatively un- 


Concerned with the personal impression they 
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may be making on the supervisor, whereas this 
type of concern should be heightened in the 
conditions of open or uncommitted supervisory 
judgmept. The way in which this concern 
might manifest itself was more or less an open 
question, though it seemed clear that the tac- 
tics of managing a favorable impression would 
vary with the values and other personal at- 
tributes characterizing the supervisor. For 
this reason, quite different information was 
conveyed to subjects in the solidarity versus 
the production treatment, designed to em- 
phasize radically different supervisor attitudes 
and orientations. i; 

The most specific and theoretically obvious 
prediction involved the differential use of 
opinion conformity as an ingratiation tactic. 
It was predicted that conformity would be 
greatest in the open-solidarity condition and 
least in the open-productivity condition, the 
two closed conditions falling in between. The 
results generally support this prediction, 
though the degree of support depends on the 
particular measure employed. If one totals 
the item discrepancies between subject and 
supervisor ratings, then the interaction is sig- 
nificant only with the more “important” items. 
Tf one merely classifies each item as movement 
toward the supervisor versus residual, the in- 
teraction is significant with both unimportant 
and important items. Insofar as the impor- 
tance distinction is worth taking seriously, it 
doese seem reasonable to suggest that issues 
generally viewed as important are more useful 
than trivial ones for emphasizing agreeable- 
ness as well as for emphasizing independence. 

The theoretical interpretation of the pre- 
dicted interaction assumes that subjects in 
both open conditions have a special concern 
with being liked or admired, but that they 
must proceed to implement this concern in 
different ways. Subjects in the open-solidarity 
condition are invited by circumstances and 
the nature of the target to employ conformity 
for strategic purposes. Subjects in the open- 
productivity condition, on the other hand, are 
confronted with a person who seems to devalue 
opinion agreement as such and circumstances 
seem to point away from conformity as a tac- 
tic for impressing the supervisor. Some in- 
teresting, if tentative, correlational data seem 
to support this interpretation, Those subjects 
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who expressed an interest in being attractive 
to the supervisor (as measured by a series of 
items on the postexperimental questionnaire) 
showed a special tendency to respond differ- 
ently as a function of the supervisor’s ex- 
pressed values. In the solidarity conditions, 
the correlation between conformity and at- 
traction motivation was .434; in the produc- 
tivity conditions the correlation between the 
same two variables was —.179. In spite of the 
small samples involved, the difference between 
these correlations is almost significant (p < 
.10) suggesting that wanting to be liked leads 
to conformity only when the target person 
himself seems to value accommodation and 
togetherness. With a few qualifications, then, 
the results fit the argurhent developed in the 
introductory section: the openness of super- 
visory judgment provides the incentive to 
concentrate on creating an attractive impres- 
sion; the’ supervisor’s own emphasis on effec- 
tive interpersonal relations points out the ap- 
propriate path of impression management. 

The tenuous but provocative self-presenta- 
tion results also merit some discussion. It is 
apparent that self-descriptions on traits con- 
noting interpersonal warmth and congeniality 
were unaffected by the experimental variables. 
Ideal ratings on these traits were similarly 
unaffected. This is quite surprising in view 
of the solidarity supervisor’s explicit emphasis 
on the value and importance of such traits in 
work groups. In comparison, the stated values 
of the supervisor had a clear effect on ratings 
of traits connoting competence and personal 
power. Both self-descriptions and ideal rat- 
ings were more favorable in the productivity 
conditions, The fact that both sets of ratings 
were more positive suggests that respect attri- 
butes are evaluated relative to a shifting 
standard, a standard quite sensitive to the 
value attached to competence and perform- 
ance by a supervisor. Ideal and self-evalua- 
tions on affability attributes may be more 
firmly anchored in each individual’s past ex- 
periences with others. Many subjects may 
have entered the experimental situation with 
the conviction that too much friendliness and 
warmth is a bad thing and this conviction was 
little influenced by the promptings of the 
solidarity supervisor. 

Of greater theoretical interest was the tend- 
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ency for subjects in the open conditions to 
present themselves as closer to their ideal on 
respect than on affection items—relative to 
the subjects in the closed conditions. For 
whatever reason, subjects in the open con- 
ditions were more anxious to impress the 
supervisor with their competence than their 
affability. Under the circumstances, if the 
supervisor could be led to have respect for the 
subject’s judgment, he might begin to assign 
more points for the latter’s solution attempts. 
Such an increase in respect could, of course, 
have no such effect in the closed conditions. 
Impressing the open judge with one’s compe- 
tence seems to be a more direct way to influ- 
ence his judgment than presenting oneself as 
affable, and (by inference at least) dependent. 
It would be interesting to explore the possible 
conditions under which self-advertised de- 
pendence (placing oneself at another’s mercy) 
would become a strategy to influence perform- 
ance evaluation. 

The expectation that subjects in the open 
conditions would “flatter” the supervisor— 
especially on those attributes he has stressed 
as valuable—was not confirmed by the data. 
Tt may be true that rating the supervisor in a 
complimentary way is viewed as a risky or 
inopportune strategy of influence. In our con- 
sidered judgment, however, the open-closed 
induction had probably lost much of its 
strength before the final rating scales were 
administered. Not only had a fair amount of 


time elapsed since the instructions had em-' 


phasized this variable, but there were probably 
growing suspicions that the “real” game would 
never be played. Apropos of this suggestion 
that the induction had lost its strength by the 
time subjects were asked to rate their super- 
visors, there is evidence in the postexpeti- 
mental questionnaire that at least in the cru- 
cial open-solidarity condition, those who 
remained interested in making themselves at 
tractive did evaluate the supervisor more 
positively. In the open-solidarity condition 
only, a questionnaire measure of attractiveness 
motivation showed a firm positive correlation 
with favorability of ratings (r = .636); in the 
other conditions the correlation between the 
same two indexes ranged from —.131 to, 
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+.082. Perhaps if the open-to-influence in- ~ 


duction had been stronger, the former corre- 
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Jation would have been lower and the tendency 
for subjects in the open-solidarity condition to 
flatter would have been more uniform. 
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OBSERVATION OF FAILURE IN ANOTHER PERSON 


AS A DETERMINANT OF AMPLITUDE AND 
SPEED OF A SIMPLE MOTOR RESPONSE * 


AKIRA KOBASIGAWA °? 
University of lowa 


60 Ist-grade boys observed an adult performer (P) during 3 types of trials on 
a repeated task: success (SUC), failure at + completion of the task (FF), and 
failure at 1s completion of the taske (NF). 4 of the Ss received social reinforce- 
ment from P prior to the these observations, while the remaining Ss were 
ignored by P during a preliminary period. S’s task was to turn off a buzzer 
sounded at the end of each trial by pressing a plunger. Magnitude of observa- 
tionally aroused emotion was inferred from S’s amplitude and reaction times, 
Ss responded with significantly greater amplitudes and greater speeds following 
observation of R’s NF than FF or SUC. The results suggest that situations 
which are differentially frustrating to P elicit correspondingly different magni- 
tude of respdnse in an observing S. Few effects of social reinforcement on 


magnitude of response were noted. 


The present study was concerned with the 
arousal of an emotional response in young 
children as a consequence of observing an 
adult performer undergo mild stress. Early 
descriptions and analyses of the observational 
arousal of emotional responses were presented 
by Humphrey (1922) and Allport (1924) and 
more recent ones by Kagan (1958) and de- 
Charms and Rosenbaum (1960). There have 
been relatively few empirical demonstrations 
of this phenomenon, although Haner and 
Whitney (1960) and Berger (1962) have re-» 
cently reported that a conditioned GSR oc- 
curs in adult subjects who watch another adult 
supposedly being shocked. 

The study reported here differs from previ- 
ous experiments in this area with respect to 
the manipulation of the stressful situation be- 
ing observed by the subject. First, the stress- 
ful situation consisted of blocking the per- 
former’s ongoing activity rather than present- 

1 This paper is based on a portion of a doctoral 
dissertation submitted to the Graduate College of the 
University of Iowa. The writer is deeply indebted to 
Willard Hartup, who directed the dissertation and 
contributed to the paper in its present form. Grati- 
tude is also extended to G. N. Cantor, I. E, Farber, 
M. E. Rosenbaum, and C, C. Spiker. Thanks are also 
due to the principals and teachers of West Liberty 
and North Liberty Public Schools and St, Mary’s 
School in Towa City for their help in obtaining sub- 
jects, and to Merle Miller, who aided in the con- 
struction of the apparatus. 

* Now at the University of the Ryukyus, Naha, 
Okinawa. 


ing an electric shock. It was assumed that 
blocking an ongoing response is ordinarily a 
stressful event (for example, Brown & Farber, 
1951) and would be so interpreted by an 
observing subject. 

Second, an attempt was made to vary the 
intensity of the stress. In the Haner and Whit- 
ney (1960) and Berger (1962) studies, in- 
tensity of shock was not specified to the ob- 
serving subject. In the present study, subjects 
observed the performer during three types of 
trials on a repeated task: “far failure” (per- 
formance blocked at one-fifth of the distance 
to goal attainment), “near failure” (perform- 
ance ihterrupted at nine-tenths of completion 
of the task), and “success” (goal attainment). 
These conditions were selected because Hanér 
and Brown (1955) and Holton (1961) have 
shown that frustration is greater when ongoingé 
activity is blocked near the goal than when the 
response sequence is interrupted relatively far 
from the goal or when the goal is reached. 
One purpose of the present study, therefore, 
was to determine whether situations pre- 
viously shown to be differentially frustrating 
to persons performing a task would elicit emo- 
tional responses of different magnitudes in ob- 
serving subjects. 

Another purpose of this study was to deter- 
mine whether children show stronger emo- 
tional reactions when observing the frustra- 
tion of a performer who has previously pro-". 
vided social reinforcement as opposed to ob- 
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serving the frustration of a performer who has 
not provided social reinforcement. It has been 
suggested that the observational evocation of 
an emotional response is dependent on the 
interpersonal relationship existing between the 
performer and the observer (Berger, 1962; de- 
Charms & Rosenbaum, 1960; Kagan, 1958). 
For example, deCharms and Rosenbaum state 
that the exposure of a “valued person” to a 
stressful stimulus may arouse greater fear in 
the observer than the exposure of a “non- 
valued person” to such a stimulus. Despite the 
fact that the importance of the performer- 
observer relationship has been stressed in the 
literature, this variable has not been subjected 
tọ experimental study in relation to the ob- 
servational arousal of an emotional response. 
It was assumed in this study that if the per- 
former dispensed frequent social reinforce- 
ment (for example, attention and verbal ap- 
proval) to the subject, the performer would 
represent a valued person as contrasted with 
a condition in which the performer ignored 
the subject. Consequently, half of the subjects 
in this study received frequent social rein- 
forcement from the performer prior to the 
experimental task, while the remaining sub- 
jects received no such reinforcement. 
METHOD 

Subjects 

Subjects were 60 first-grade boys from three ele- 
mentary schools in the vicinity of Iowa City, Iowa. 
They ranged in age from 6 years, 7 months to 7 
years, 7 months. Thirty subjects were assigned at 
random to Group S, and received frequent social 
reinforcement during a period prior to the criterion 
task; the remaining 30 subjects were assigned to 


Group C and did not receive social reinforcement 


during this period, A male college student served as 
the performer. 


Apparatus 

The apparatus for the criterion task was a modi- 
fication of one used by Haner and Brown (1955). 
Part of the top surface of the boxlike apparatus 
(19% 13 X 7 inches) included two columns of 10 
holes into which marbles were placed. A series of 10 
wires strung on a wooden frame (6 X 19X 2 inches) 
inside the box supported marbles placed in the holes. 
When this frame was pulled forward the marbles 
dropped onto a sound-insulating fiber-glass pad on 
the inside floor of the box. The movement of the 
frame activated a microswitch, causing a buzzer to 
sound. The remainder of the top surface contained a 
marble bin and the handle of a spring-loaded plunger. 
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Depression of this plunger a distance of 4 inch 
(requiring approximately 2 pounds of pressure) 
activated a second microswitch which stopped the 
buzzer, 

Addigional instruments, a Hunter timer and a 
Standard Electric timer, were contained in a second 
box (10X 10X 9 inches) and were placed at the 
rear of the main apparatus. The Hunter timer con- 
trolled the time interval (.5 second) between the 
frame movement and onset of the buzzer. The 
Standard Electric timer, which recorded the subject's 
reaction time, was activated by the onset of the 
buzzer. The buzzer was located in a third box, 
5X63 inches, also placed at the rear of the 


apparatus, 
e 


Procedure 6 


Preliminary games» Each subject was brought indi- 
vidually to the experimental room. Subjects in Group 
S were introduced to the performer, who was then 
attentive to the subjects during two preliminary 
games. The experimenter began the games immedi- 
ately with subjects in Group C, ignoring the per- 
former who continued to read a magazine while the 
games were played. 

The subject was told that he would play “his 
games” first and that he would have an opportunity 
to win a prize. The first task was to insert 100 
small wooden pegs into 100 holes as quickly as 
possible, For subjects in Group S, the following com- 
ments were delivered by the performer on a fixed 
schedule. At the beginning of the game, the per- 
former said, “You look like a very good game 
player. I hope you'll win a prize.” After every fifth 
‘peg was inserted the performer said; “Fine,” “Good,” 
or “That’s it.” After every 20 pegs the performer 

ssaid* “You are a very good game player, 
[subject’s name],” or “You really know how to play 
this game,” or “It’s fun to watch a good game player 
like® you.” 

Immediately following the first game, the experi- 
menter showed two identical-appearing hinged blocks 
(3% 3.75 inches) to the subject and told him 
that his next task was to guess which of the two 
blocks contained a marble. On “correct” trials, the 
experimenter placed a marble in both blocks. Marbles 
were not placed in either block on “incorrect” trials. 
At the beginning of this game, the performer said to 
subjects in Group S, “I’m sure you will be good at 
this game, too. I hope you will get enough marbles.” 
After each correct trial, the performer said one of the 
following: “Let’s open it. Good. You guessed right.” 
“There, you got another marble.” “You know how to 
play a guessing game, don’t you?” After each in- 
correct trial the performer said, “I'm sorry but I'm 
sure you'll find the marble next time.” At the end 
of 10 trials the performer said to the subject, “Look, 

you won seven marbles. You are a very good game 
player.” Instructions for the two games were the 
same for all subjects; all subjects were told by the 
experimenter that they played their games well. 

k. The experimenter next showed both 


Criterion tas 
the performer and the subject the apparatus for the 


628 


TABLE 1 


MEANS AND STANDARD DEVIATIONS OF RESPONSE 
AMPLITUDES AND SPEEDS ON SUCCESS, FAR- 
FAILURE, AND NEAR-FAILURE TRIALS FOR 


Group S AnD Group C ° 
Type of trial 
Success | Far failure |Near failure 
Response amplitudes 
Gi S 
ivan 8.40 8.51 10.52 
SD 4.63 5.39 6.00 
G c 
M. z ` 8.35 8.37 9.64 
SD 3.83 4.58 5.75 
Response speeds 
Group S 
M 129.92 *| 140.65 | 206.60 
SD 54.45 59.79 | 154.79 
Group C: 
M 142.40 | 130.87 | 181.09 
SD 65.72 52.16 80.19 


criterion task. The experimenter told the subject that 
the performer would play kis game, that the per- 
former must place 20 marbles in holes in the board 
to obtain a card, and that the performer needed 
several cards to win a prize. The subject was told 
that sometimes the performer might be unlucky and 
the machine would make the marbles drop into the 
box before he could finish. It was explained that 
a buzzer would sound at the end of each trial which 
would tell the performer to be ready for the next 
game. The subject was told that his task was td 
turn off the buzzer by depressing the plunger. He 
was then asked to put his right hand on the ap-> 
paratus approximately 4 inches from the plunger. 

Four practice trials were given. The next 12 trials 
consisted of: 4 trials during which blocking occutred 
after the performer placed 4 marbles; 4 trials during 
which blocking occurred after 18 marbles were 
placed; and 4 trials during which the performer 
reached the goal. Order of occurrence of these trials 
were randomized for each subject with the following 
restrictions: within each block of three trials each 
type of trial occurred once and each type of trial 
occurred on the first trial for one-third of the sub- 
jects, 

The performer counted as he placed the first five 
marbles; beginning with the fifteenth marble, he re- 
sumed counting. Following a successful trial, the 
performer said, “I was lucky this time.” After fail- 
ure, he said, “I was unlucky this time.” He did not 
react differentially to near- and far-failure trials. 


3 It should be recognized that this procedure con- 
founded proximity to the goal and time elapsing be- 
tween the beginning and end of each trial. This con- 
founding does not affect the interpretation of the 
results. Although response magnitudes differed ac- 
cording to type of trial, there was no direct relation 
between response magnitudes and time (see Results). 
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The experimenter made appropriate comments after 
each trial such as, “Good. You get a card,” or “He 
was unlucky.” It should be noted that the whole 
procedure for the criterion task was designed to 
emphasize the importance of the performer's success 
or failure; the subject presumably understood that 
his own prize had already been assured by his per- 
formance in the first two games. 


Response Measures 


Magnitude of observationally aroused emotion was 
inferred from the subject’s response amplitudes and 
reaction times (time between onset of the buzzer and 
its termination by depression of the plunger). Ampli- 
tude of response was measured in terms of units 
which were directly correlated with the distance the 
plunger was depressed. Reaction times were con- 
verted into speed of response measures by recipro- 
cating each reaction time and multiplying by 100. 


RESULTS 
Complete Task Performance 


The subject’s four scores on each of the 
three types of trials were averaged. The 
means and standard deviations of the response 
amplitudes on success, far-failure, and near- 
failure trials for Groups S and C are presented 
in the upper half of Table 1. A Type I analy- 
sis of variance (Lindquist, 1953) was applied 
to these data, in which Social Reinforcement 
condition (Group S versus Group C) was the 
between-subjects factor and Type of Trial 
(success, far failure, and near failure) was the 
within-subjects factor. The results of this 
analysis reveal that the main effect of Type 
of Trial was statistically significant (F = 
10.06, df = 2/116, p < .001). The main ef- 
fect of Social Reinforcement and the Social 
Reinforcement X Type of Trial interaction 
effect were not statistically reliable. The analy- 


sis of simple effects by means of f tests showed” 


that subjects responded with significantly 
greater amplitudes following observation of 
near failure than following far failure or suc- 
cess (p< .001). The difference between the 
means for success trials and far-failure trials 
was not significant. 

The lower half of Table 1 presents the mean 
response speeds and standard deviations on 
each type of trial for Groups S and C. The 
analysis of variance performed on these data 
showed essentially the same results as those 


reported for amplitude scores. The main effect 
of Type of Trial was significant (F = 13.13, © 


dj = 2/116, p< .001). Subjects responded 


pÅ = 
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with faster speeds on near failure than on far 
failure or success (p < .001) and mean speeds 
for success and far-failure trials did not differ 
reliably. No effects of social reinforcement 
were observed in this analysis. 


First Trial Performance 


An exception to the negative findings with 
respect to the social reinforcement conditions 
is contained in an analysis of changes in re- 
sponse amplitude from the four practice trials 
to the first trial of the criterion task. Lind- 
quist Type III analyses (Lindquist, 1953) 
were performed on these scores, in which the 
main effects were Social Reinforcement, Type 
of Trial; and Practice versus First Trial 
scores. The analysis of amplitude data indi- 
cated that the Practice versus First Trial main 
effect was significant (F = 5.10, df = 1/54, 
~<.05) as was the Social Reinforcement 
X Practice-First Trial interaction (F = 4.32, 
dj = 1/54, p < .05). Further analysis showed 
that socially reinforced subjects responded 
with significantly greater amplitude on the 
first trial than on the practice trials ($ < 
01), while a reliable difference between 
Practice and First Trial scores was not found 
for nonreinforced subjects. This increment 
was greater for the reinforced subjects regard- 
less: of type of first trial (success, far failure, 
and near failure), although subjects who ob- 
served success or near failure showed some- 
what greater practice-first trial increments in 
response amplitude than subjects who ob- 
served far failure. A parallel analysis of first 
trial scores conducted on the speed data did 


_not show similar social reinforcement effects. 


DISCUSSION 


The results strongly suggest that situations 
which have been reported to be differentially 
frustrating for the performer elicit corres- 
pondingly different magnitudes of response in 
the observer. Support for this interpretation 
may be obtained by comparing the results of 
the present study with those by Haner and 
Brown (1955). In their study, the experi- 
menter interrupted the subject’s marble-plac- 
ing performance at various distances from the 


` goal. The Haner-Brown results suggest that 


failure near the goal produces stronger emo- 
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TABLE 2 


MEAN RESPONSE AMPLITUDES FOR “VERBALIZERS” AND 
‘NONVERBALIZERS” ON Success, FAR-FAILURE, 
AND NEAR-FAILURE TRIALS 


— 

Trial et ee AG Nahr it 
Success 9.31 8.13 
Far failure 9.60 8.17 
Near failure 14.02 8.93 


tionality (frustration) in the performer than 
either failure far from the goal or success. 
Since the analysis of both the amplitude and 
speed data in the present study tended to 
replicate the findings of the Haner-Brown 
study, it may be concluded that observation- 
ally aroused emotionality varies as a function 
of the effectiveness of the stimulus situation 
in arousing an emotional state in the per- 
former. i 

Informal data gathered during the present 
study suggest at least one factor which may 
have been involved in determining the mag- 
nitude of the observationally aroused emo- 
tional responses. Records were kept through- 
out the data collection of instances when the 
subject voluntarily counted as the performer 
„placed the marbles and when the subject made 
supportive verbal statements (for example, 


. “Move fast,” “You are lucky this time,” or 


“Just two more.”). Thirteen subjects, seven 
fram Group S and six from Group C verbal- 
ized in this manner. The mean response ampli- 
tudes of these subjects on each type of trial 
is reported in Table 2, along with means for 
the nonverbalizing subjects. Analysis of these 
data showed that the main effect of verbaliza- 
tion was significant (F = 4.18, df = 1/58, p 
< .05) as was the Verbalization X Type of 
Trial interaction (F= 10.76, df = 2/116, 
p < 001). The significant interaction effect 
indicates that differences in response ampli- 
tude associated with type of trial varied be- 
tween verbalizing and nonverbalizing sub- 
jects. Verbalizers responded with significantly 
greater amplitude on near-failure trials than 
on far-failure or success trials, but such dif- 
ferences were not noted for the nonverbalizers. 
Further analysis showed that difference in 
mean response amplitudes between verbalizers 
and nonverbalizers was statistically reliable 
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on near-failure trials ( < .05), but not on 
the other two types of trials. 

These results suggest several hypotheses. 
First, the differences in response amplitudes 
between nonverbalizing and verbalizing sub- 
jects may indicate that verbal rehearsal medi- 
ates the observational arousal of emotional 
response in some manner. Second, subjects 
who verbalized during the criterion task may 
have possessed stronger, more widely general- 
ized tendencies to “match” or “become in- 
volved” in the performance of others than 
subjects who did not verbalize. In other words, 
personality differences among subjects may 
have been related to both verbalization and 
emotionality in this experiment. The present 
data, however, do not offer verification of 
either hypothesis. 

The second purpose of the present study 
was to investigate the effect of social rein- 
forcement in relation to observational arousal 
of emotional response. With one exception, 
the results show that there were no effects of 
social reinforcement condition on the criterion 
measures, The exception was observed in the 
analysis of changes in the amplitude of re- 
sponse from practice trials to first trial; rein- 
forced subjects showed a greater increment in 
response amplitude from the practice trials to 
the first trial than nonreinforced subjects. 

The failure to obtain any but tenuous*ef- 
fects of social reinforcement in the present 
study may indicate that such conditions are 
irrelevant to observational arousal of emo- 
tionality. It should be recognized, however, 
that the social interaction between the per- 
former and the observer was relatively short. 
The phase of the experiment in which the 
adult performer provided social reinforcement 
lasted only about 10 minutes, On the other 
hand, in a study by Bandura and Huston 
(1961), in which social reinforcement was as- 
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sociated with amount of imitative behavior, 
subjects spent two 15-minute sessions with a 
“nurturant” model prior to the criterion task. 
Also in that study the model actually helped 
the child in his play as well as verbally re- 
warded him. The short-lived effects of social 
reinforcement noted in the practice versus 
first trial analysis in the present study sug- 
gest that a more extended period of social re- 
inforcement would perhaps have been desir- 
able. 
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Assimilation and contrast effects in social j 


terms of latitudes of acceptance and re] 


view, assimilation occurs when the judge, 


the content of the judgment stimuli. A 
testing this view in judgment of a persua: 


Kentucky 


judgment have been interpreted in 
jection for discrepant beliefs. In this 
because of his own attitude, accepts 
field experiment with 93 women Ss 


sive communication about fluoridation 


finds contrary evidence, suggesting an interpretation in terms of dissonance 


incurred following exposure 


communication are consistent with this inter 
external anchoring and judge’s familiarity with the fi 
is presented that examines, with supporting results, 


interpretation of social judgment. 


Sherif and Cantril (1945, 1946, 1947) pro- 
vide a wide-ranging review of the early litera- 
ture relevant to social judgment, emphasizing 
the role of internal and external frames of 
reference. In situations of social judgment 
probably the most important internal anchor 
is the judge’s own attitudinal position on the 
scale of judgment (Berkowitz, 1960; Sherif & 
Hovland, 1961). Under conditions of ego in- 
volvement the influence of the judge’s own 
attitudinal position as an anchor should be 
particularly strong. Early research on this ° 
problem concerned placement of items in the, 
construction of attitude scales. Several studies 
were interpreted as indicating that the judge’s 
attitude did not influence judgment e( Fergu- 
son, 1935; Hinckley, 1932; Pintner & For- 
‘lano, 1937); another more recent study also 
suggests that under standard instructions used 
in construction of a Thurstone scale a judge’s 
attitude does not influence his judgments 
(Segall, 1959). Other studies, some using ob- 
jects of judgment more complex than single 
statements, have found the opposite effect 
(Cartwright, 1941; Hovland, Harvey, & 
Sherif, 1957; Hovland & Sherif, 1952; Kelley, 
Hovland, Schwartz, & Abelson, 1955; Manis, 
1960; Sherif & Hovland, 1953; Weiss, 1961). 


1 These studies were conducted while the investi- 
gator was a member of the Behavioral Science Divi- 
sion, School of Public Health, University of Cali- 
fornia, Berkeley, which was supported by a grant 
from the Russell Sage Foundation. 
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to discrepant beliefs. Judges’ evaluations of, the 


pretation. This study also examines 

ocal issue. A 2nd study 

1 facet of a dissonance 
. 


Srupy 1? 


Accounting for the above-mentioned find- 
ings, Hovland et al. (1957) and, Sherif and 
Hovland (1961) conceptualize a person’s 
beliefs about an object as ‘comprising a judg- 
mental scale that involves latitudes of accep- 
tance and rejection. The latitude of accep- 
tance refers to that range of opinion state- 
ments that a person will accept; the range of 
beliefs that the individual rejects comprises 
his latitude of rejection. A third range of be- 
liefs represents a region of neutrality. 

The judge’s familiarity with the topic and 
his ego involvement with the issue are ex- 
pected to influence the size of latitudes of 
acceptance and rejection. Persons who are 
more familiar with the issue, and those who 
are more ego involved, are expected to have 
narrower latitudes of acceptance and wider 
latitudes of rejection. 

Hovland et al. hypothesize that beliefs ac- 
cepted by the judge are assimilated to the 
judge’s own attitudinal stand, this stand serv- 
ing as an internal anchor. Beliefs that are 
perceived to represent stands outside the per- 
son’s latitude of acceptance, and especially 
those falling within the latitude of rejection, 
are contrasted with the judge’s own attitude 


2This study is based upon part of a doctoral dis- 
sertation submitted to the Department of Psychol- 
ogy, University of California, Berkeley. I acknowl- 
edge with gratitude the stimulating and helpful 
guidance of the thesis committee: M. Brewster 
Smith (chairman), Andie L. Knutson, and Edward 


E. Sampson. 
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and thus displaced away from that anchor. 
Hovland et al. suggest the following general- 
ization, which is tested in the present study: 
Relative contrast effects in judgment should 
be associated with greater discrepancies be- 
tween a judge’s own opinion (as an internal 
anchor) and the position advocated in a com- 
munication.* 

‘An additional hypothesis to be tested is the 
following: With a moderate discrepancy be- 
tween the position of a communication and 
the position of the judge, judges who are 
more familiar with the topic should show con- 
trast effects relative to judges who are less 
familiar with the topic. With small or large 
distances between the position of the com- 
munication and the position of the recipients, 
there should be little difference in judgments 
between judges familiar and less familiar with 
the issue. These predictions stem from the 
assumption’ that relative familiarity is associ- 
ated with narrower latitudes of acceptance. 

With respect to external anchors, both as- 
similation and contrast effects have been 
found in studies of psychophysics and social 
judgment (see Sherif & Hovland, 1961). The 
balance of these studies find contrast effects, 
with assimilation apparently limited to an- 
chors that are near to the stimuli being judged. 
Weiss (1959), however, found considerable 
assimilation for nearby and distant stintuli 
in a social judgment task with external an- 
chors, K 

Each of the three main theoretical attempts 
to account for external anchoring phenomena 
predicts contrast (Helson, 1959; Sherif & 
Hovland, 1961; Volkmann, 1951). With ex- 
treme anchors, the judge’s adaptation level 
should shift in the direction of the anchors, 
resulting in assignment of the stimuli to cate- 
gories farther from the anchor than would 
otherwise be the case. Volkmann on his part 
describes the scale as being extended in the 
direction of the anchor with the result that 
the categories “slide past” the stimuli being 
judged. Finally, Sherif and Hoyland base 


3 Mani? (1961a) theoretical account of assimila- 
tion and contrast phenomena in terms of a need to 
reduce the influence potential of the impinging mes- 
sage appeared after the present study was completed. 
His theoretical views are incorporated in the Discus- 
sion. 


Ronatp C, DILLEHAY 


their generalization on empirical evidence 
largely from psychophysical studies. They pre- 
dict assimilation effects when the anchor is 
adjacent to or within the range of stimuli 
being judged; in all other cases contrast is 
predicted, and the greater the discrepancy 
between the stimulus and the anchor, the 
greater the predicted effect. 

With respect to external anchoring, the 
present study tests the general hypothesis 
that a communication is judged to be more 
or less favorable toward the object discussed 
in the communication depending upon whether 
it is preceded by a strongly negative anchor 
or a strongly positive anchor. With a strong 
negative anchor, the communication should be 
seen as more positive than when it is judged 
without the anchor; following a strong positive 
anchor the communication should be seen as 
more negative than it would be alone or after 
a negative anchor. 

Evaluation of a persuasive communication 
should covary with judgment, as determined 
by latitudes of acceptance and rejection. A 
persuasive message that falls within the re- 
cipient’s latitude of acceptance should tend to 
be evaluated in positive terms. If evaluation 
depends upon latitudes of acceptance, the 

, value of a communication for a person should 
be predictable from knowledge of his judg- 

+ ment of the communication and his own atti- 
tude in the matter. Familiarity with the topic 
should influence evaluation through its effect 
upon latitudes of acceptance and, therefore, 
upon judgment. 

Hovland et al. (1957; see also Sherif & 
Hovland, 1961) found that a high percentage 
of subjects who agree with the content of a 
communication evaluate it in favorable terms, 
whereas a low percentage of opposed subjects 
take a favorable view of the communication. 
Weiss (1958) found that a significantly larger 
percentage of subjects in the group that 
judged an anonymous communicator’s posi- 
tion as less extreme judged him as relatively 
trustworthy, compared with the group of sub- 
jects who judged the communicator more €x- 
treme. In Weiss’ study the more extreme com- 
municator differed to a greater extent from 
the beliefs of the subjects. 

Regarding evaluation of a communication, 
the hypothesis tested is that smaller discrep- 
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ancies between the judged position of the com- 
munication and the attitude of the judge are 
associated with more positive evaluations. This 
prediction may also be derived from an ex- 
tension of balance theory (Heider, 1958; 
Newcomb, 1959) to belief congruence and 
evaluation. 


Method 


The study was conducted as a survey in which 
two forms of an interview schedule incorporated 
different conditions. Only through a field experiment 
could the hypothesis be tested in the context of an 
intrinsically involving issue (sce Sherif & Hovland, 
1961) with a population of nonstudents, avoiding 
the possible disadvantages of a contrived laboratory 
atmosphere. Subjects, residents in the cities of 
Berkeley and adjacent Albany, were told the study 
concerned their opinions on topics of recent interest 
in the Bay Area. 

The topic selected for study was the addition of 
fluorides to a community water supply. This issue 
had received considerable local attention, a proposal 
for fluoridation having been recently defeated in a 
referendum in the locale of the study, and consid- 
erable nationwide attention (Dowd, 1962; Paul, 
Green, Simmel, Kirscht, Knutson, Raulet, Sanders, 
Gamson, Irons, & Kegeles, 1961). Arguments about 
fluoridation frequently involve considerations of mass 
medication, socialized medicine, and religion (Davis, 
1959). The few subjects who had not heard of the 
issue were excluded from the sample. 

The message on fluoridation was moderately 
favorable toward the topic (as determined by pre- 
testing) and about 400 words in length, The appeals 
used were moderate, rational, and two-sided, in that 
opposing points of view were mentioned and refuted 
in the message. Within the scope of the study it was 
not possible, though it would have been desirable, to 
use a parallel argument opposing fluoridation with 
additional groups. 

One group of subjects heard three highly positive 
statements about fluoridation immediately preceding 
«the message. These statements were carefully labeled 
“highly positive” and were given as examples of 
what people had said about the issue. This procedure 
was intended to provide extreme anchors on the 
favorable end of the scale of attitude toward fluori- 
dation, The negative anchoring group received three 
highly negative statements about fluoridation before 
hearing the communication. These statements were 
labeled “highly negative” and were also given as 
examples of what people had said about fluoridation. 
In selecting the anchors, highly charged and ad 
hominem statements were avoided. 

Materials and measurement techniques. Preexperi- 
mental attitude was assessed by a semantic differen- 
tial technique using 10 scales selected because of 
their high loadings on the evaluative dimension in a 
«number of studies reported by Osgood, Suci, and 
Tannenbaum (1957). In addition to the evaluative 


DID 


scales, 7 other scales were used as filler items. The 
order of the scales on the instrument was randomly 
determined. The decision to use the semantic differen- 
tial as a measure of attitude grew out of a desire to 
minimize interference with the communication and 
anchortng effects. 

The communication was judged as to its favorable- 
ness toward the topic on a graphic scale ranging from 
1 (highly opposed to fluoridation) to 15 (highly in 
favor of fluoridation). Only the end points were 
defined. 

Evaluation of the communication was obtained 
using Likert-type items with five alternatives based 
on the following dimensions: liking, reasonableness, 
propaganda-fact, goodness, biasedness. Intercorrela- 
tions among these items show that the first four are 
substantially interrelated; scores on these items were 
summed to provide an overall index of evaluation. 

Familiarity with the topic was assessed by a direct 
question. To provide a somewhat similar frame of 
reference for each subjact for replying to this ques- 
tion, the four response alternatives were read to him. 

Subjects and procedures. Ninety-three women resi- 
dents of the cities of Berkeley and Albany comprise 
the sample. They ranged in age from 20 to over 65; 
median age in the very flat distribution was 40-44 
years, Modal income for the group was $6000-6999, 
consistent with the relatively estable, middle-class 
neighborhoods in which these women were living. 
Education of the subjects also ranged widely from 
some grammar school to graduate or professional 
training. Subjects were obtained from areas known 
to contain individuals with a full range of attitude 
toward fluoridation. 

Measures of familiarity and attitude were obtained 
in that order. Subjects were then randomly given 
either positive or negative anchoring followed by the 

i communication. They were then asked to make 
their judgment and evaluate the communication. The 
interviews were conducted by the investigator and 
five interviewers trained and supervised by him. 

The data comprise a 3X22 factorial design 
(Attitude X Anchoring X Familiarity) with unequal 
numbers per cell (cell frequency varies from 8 to 
26) due to the correlation between extremity of 
attitude and familiarity. The range of attitude scores 
possible on the evaluative scales used in this study 
was 10-70, high scores representing a more favorable 
attitude. Subjects were grouped into three attitude 
classes as follows: 20-40, opposed to fluoridation; 
41-54, slightly favorable; 55-70, highly favorable. 
Responses to the item measuring familiarity were 
dichotomized at the best median split. Unless other- 
wise indicated, the analyses of variance reported are 
based on the procedure for unequal cell frequencies 
recommended by Walker and Lev (1953, p. 381), All 
significance levels are based on two-tailed tests. 


Results 


Judgment of the communication. The 
judgments of the communication as raw data 
were negatively skewed inasmuch as most 
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Fic, 1. Judgments of the communication by atti- 
tude and familiarity. (The higher the judgment 
score, the more fayorable toward fluoridation the 
communication was perceived to be. N varies from 8 
to 26.) 


subjects judged it to be favorable toward 
fluoridation. In order to meet the assumptions 
of analysis of variance, these data were trans- 
formed by the ranking method presented by 
Hull (1928) prior to performing the analyses. 

Original attitude as an internal anchor., 
With respect to original attitude and its effect 
upon judgment, the greater the discrepancy 
between the judge’s attitudinal stand on the 
issue and the stand taken in the communica- 
tion, the greater, by hypothesis, should be the 
contrast effect on judgment. Because the 
communication used in the study is moderately 
favorable toward fluoridation, this hypothesis 
requires that subjects opposed to fluoridation 
should judge the communication as more fa- 
vorable than subjects who are themselves 
slightly favorable toward the issue. If the 
subjects highly favorable toward fluoridation 
show contrast effects in judgment, the data 
should indicate a monotonic, inverse relation- 
ship between original attitude and judgment. 
A direct relationship between original attitude 
and judgment of the communication would 
indicate overall assimilation effects, the oppo- 
site of the prediction. 

With subjects classified by familiarity, orig- 
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inal attitude, and experimental condition, the 
data show a positive relationship between 
favorableness of attitude and judgment of 
the communication: the less favorable the 
attitude of the judge toward fluoridation, the 
less in favor of fluoridation the communica- 
tion is perceived to be. While these differ- 
ences fall short of statistical significance, 
their magnitude suggested a further analysis 
of variance using a classification by original 
attitude and familiarity (Snedecor, 1956). By 
this analysis the effect of original attitude on 
-judgment of the communication is significant 
beyond the .05 level (see Figure 1). A com- 
parison of the use of different categories of the 
response scale by subjects of different atti- 
tudes indicates more favorable attitudes are 
associated with the use of scale positions 
12.5-15, the more favorable the attitude the 
more frequent being the use of these cate- 
gories (x° = 9.35, df = 2, p < .02). 

External anchors. The hypothesis that the 
communication would be judged less favorable 
when preceded by strongly positive anchoring 
statements than when preceded by highly 
negative anchoring statements is not supported 
by the data at a statistically reliable level. 
The slight difference between the positive 
anchor group (M = 49.07) and the negative 
anchor group (M = 49.4) is in the predicted 
direction. There is no suggestion of a differen- 
tial effect of the external anchors for different 
levels of either familiarity or original atti- 
tude, no significant interaction with external , 
anchoring appearing in the analyses. 

Familiarity with the issue. In the present 
study a majority of judges slightly favorable 
toward fluoridation and most of those highly» 
favorable toward fluoridation may be regarded * 
as moderately discrepant from the stand taken 
in the communication; the opposed judges are 
clearly much more discrepant than those who 
favor fluoridation. The hypothesis, then, re- 
quires that subjects who are more familiar 
with the issue and slightly favorable in atti- 
tude judge the communication more favorable 
toward fluoridation than subjects of the same 
initial attitude who are less familiar; among 
those highly favorable toward fluoridation at 
the outset, judges more familiar should judge 
the communication less favorable toward 
fluoridation than their less familiar countet- 
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parts; subjects who oppose fluoridation and 
are familiar with the issue should not differ 
markedly in judgment from subjects" who op- 
pose fluoridation and are less familiar with it.* 

The data show that judges more familiar 
with the issue perceive the communication to 
be more favorable toward fluoridation than do 
those less familiar (Figure 1, p < .05). Since 
none of the interactions in these data ap- 
proach significance, familiarity appears to 
have a similar effect over different levels of 
original attitude and over positive and nega- 
tive anchoring. Figure 1 shows, however, and 
t tests confirm, that the difference is due 
mainly to the highly favorable (p < .01) and 
slightly favorable (p < .10) attitude groups. 
* Evaluation of the communication. Most sub- 
jects evaluated the communication in positive 
terms, producing a negative skew in the dis- 
tribution of raw scores. The analyses reported 
below are based upon the cube of the raw 
data. 

Original attitude as an internal anchor. The 
hypothesis dealing with evaluation states that 
the moderately positive persuasive communi- 
cation should be evaluated more positively by 
judges who are initially favorable toward 
fluoridation than those who initially hold un- 


favorable attitudes. The data of the present 


study strongly support this hypothesis in that 


4 These predictions require that the “true” location” 


of the message falls between the slightly favorable 
attitude group and the highly favorable attitude 
group. By pretest, the communication, without pre- 
ceding anchors, was judged to be intermediate be- 


_tween the neutral point and the extremely favorable 


end of a graphic scale. This agrees with the consensus 
judgment of the communication by the total subjects 
of 12.5 on the 15-point scale. The cutting point for 
attitude groups on the semantic differential is a 
score of 54.5, midway between a psychologically 
neutral score of 40 and the highest favorable score 
of 70, lending credence to the assertion that most 
subjects slightly favorable toward fluoridation and 
most subjects highly favorable toward fluoridation 
are moderately discrepant from the message and to 
the differential predictions, based on differences in 
latitudes of acceptance for subjects familiar and less 
familiar with the topic in these attitude groups. This 
indirection in specifying the discrepancy between the 


munication. 
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there is a main effect of original attitude upon 
evaluation (Figure 2, p< .01) and a more 
positive evaluation by subjects initially favor- 
ing fluoridation compared to those opposed 
(t test, p < .01). 

The clearest test of the prediction regarding 
evaluation and distance between the beliefs of 
the judge and those advocated in the com- 
munication is a comparison between the 
judges with slightly favorable attitudes and 
those with negative attitudes; both of these 
groups are less favorable toward fluoridation 
than the attitude expressed in, the communica- 
tion, while the group opposed to fluoridation 
is a greater distance from the commanication 
than the group of subjects slightly in favor of 
the issue. This comparison, by ¢ test, shows 
that slightly favorable subjects evaluated the 
communication more positively than did the 
negative judges (p < 01). 

External anchoring. Since the external an- 
choring manipulation was found to be insig- 
nificant in altering judgment, no effect of ex- 
ternal anchoring upon evaluation of the com- 
munication is expected. None in fact was 
found. Smaller discrepancies between the posi- 
tion of the judge and the perceived position 
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of the communicator tend (short of signifi- 
cance) to be associated with more favorable 
evaluations, an effect in the predicted direc- 
tion. 

Familiarity with the issue. On the basis of 
the judgment data and the hypothesis regard- 
ing evaluation, the data on evaluation should 
show an interaction between original attitude 
and familiarity—more positive evaluation 
should be associated with greater familiarity 
in the judges highly favorable toward fluori- 
dation while more positive evaluation should 
be associated with less familiarity in the 
judges slightly in favor of fluoridation. There 
should be little or no difference between groups 
of subjects with negative attitudes. 

There is a significant main effect of famili- 
arity upon’ evaluation of the communication 
(Figure 2); more familiar subjects evaluate 
the communication in more positive terms 
than do subjects who are less familiar with 
fluoridation (p < .05). Subgroup comparisons 
by ¢ test show a significant difference between 
judges differing in familiarity who are highly 
in favor of fluoridation (p < .01). The dif- 
ference in the same direction between famili- 
arity subgroups that were slightly in favor of 
fluoridation ( < .05) is contrary to expecta- 
tion. Because the less familiar, slightly favor- 
able subjects judged the communication to be 
closer to their own opinion, they theoretically 
should have evaluated it more positively than 
their attitudinal counterparts who were more 
familiar with the issue. Among the subjects 
opposed to fluoridation, the slight difference 
between subgroups more and less familiar 
with fluoridation is, as expected, not signifi- 
cant, but consistent with obtained differences 
in judgment. 


Discussion 


External anchoring. External anchors did 
not yield reliable differences in judgment and 
evaluation of the communication. There are a 
number of possible reasons for the lack of 
significant findings with these anchoring state- 
ments. 

1, With anchors that are not extreme rela- 
tive to the communication, contrast might not 
be expected. Studying judgments of weights, 
for example, Sherif, Taub, and Hovland 
(1958) found that anchors identical with the 
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end stimulus in the series being judged, or 
adjacent to the stimulus series, tended to pro- 
duce assimilation effects. It was only after 
the anchor was made more remote that it 
produced contrast. Implications from a study 
by Upshaw (1962) of social judgment are 
relevant here. From his analysis we may ex- 
pect that if the anchors fell within the ref- 
erence scale of the subjects in the study they 
would not be expected to alter existing sub- 
jective categories and consequently modify 
judgments of the communication. The local 
and national controversy over fluoridation in 
the months preceding the study make this a 
plausible suggestion. 

2. That the use of anchoring in the social 
judgment situation presents some additional 
complexities is shown in a study by Weiss 
(1959) dealing with single statements of be- 
lief. Using either an end anchor or a neutral 
anchor, marked assimilation effects were 
found, and the effects tended to generalize 
throughout much of the scale. 

3. In our present state of knowledge about 
the use of anchoring with communications on 
controversial social issues it is possible that the 
major effect of anchors is to remind the lis- 
tener about arguments already known to him, 
and this role is performed equally by negative 
or positive statements (cf. Janis, 1957). 

Original attitude and familiarity. The data 


“on attitude as an anchor in judgment are di- 


rectly opposite to the prediction of contrast 
that stoms from psychophysical studies and 
the findings of Hovland et al. (1957). In the 
study by Hovland et al., however, only the 
extreme subjects (at Positions 1 and 2 on 4 
9-point scale) displaced the communication 
located at 6 on the scale away from their 
own position. Subjects who were at Positions 3 
and 4 on the scale, who would be comparable 
to the judges in the present study who op- 
posed fluoridation, showed assimilation and 
not contrast effects (Figure 2, p. 248). These 
data were apparently minimized or overlooked 
in arriving at the contrast hypothesis. Other 
studies (Manis, 1960, 1961a) report assimi- 
lation by neutral judges relative to both mod- 
erate and extreme judges; subsequent studies 
by the same author show related findings 
(Manis & Blake, 1963). Considering the data 
from these various studies, the following modi- 


, ‘ ceptance and rejection. While we 
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fied hypothesis seems advisable: In situations 
of social judgment, contrast effects occur when 
the judge holds an extreme view opposing the 
stand taken in the communication. Other 
things being equal, relative assimilation occurs 
in other combinations of position of the judge 
and the message. 

This hypothesis does not wholly agree with 
some data based on judgments of single state- 
ments of belief nor with psychophysical evi- 
dence. Judgments of statements referring to 
Negroes by Negro judges have shown consid- 
erable displacement of items throughout much 
of the scale, not merely for opposing state- 
ments. Conditions of extreme commitment and 
high personal involvement, as in the case of 
Negroes judging statements about Negroes, 
may reduce assimilative tendencies. Negro 
judges are likely more intensely involved in 
judging assertions about Negroes than are 
even highly committed judges in the case of 
prohibition repeal or fluoridation. 

The psychophysical evidence on which much 
of the present thinking about social judgment 
is based may be misleading in its relative 
contextual simplicity. Strong motives enter 
the social judgment situation via the topic, 
the communicator, etc. Also, for many people 
disagreement is unpleasant, especially when 
that disagreement is with a moderate or sensi- 
ble-sounding person or communication, a fac- 
tor that may produce assimilation tendencies 
in many social judgment situations. Disagree- 
ment is not an issue in psychophysical Studies 
of judgment. 

The prestige of the communicator is a 
further example of a social variable that modi- 

s fies judgment (Manis, 1961b), and has been 


` shown to produce assimilation tendencies for 


messages widely discrepant from the judge’s 
when the communicator was prestigeful. Pres- 
tige of the communicator thus extends assimi- 
lative tendencies to more discrepant points on 
the scale, suggesting increased specification of 
the foregoing assimilation-contrast hypothe- 
sis. 

The judgment data in the present study 
from different attitude and familiarity sub- 
groups also do not conform to the predictions 
stemming from differential latitudes of ac- 
hypothe- 
sized that opposed judges would not show 
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differential judgment due to familiarity, since 
the message favoring fluoridation should fall 
well within the latitude of rejection for fa- 
miliar and less familiar judges of this attitude, 
these Judges as a group showed relative assim- 
ilation. It seems implausible that these op- 
posed subjects accept the views in the mes- 
sage, a necessary condition for assimilation 
under the theory of latitudes of acceptance. 
In addition, the greater assimilation by the 
familiar, highly favorable subjects compared 
to the less familiar, highly favorable subjects 
was not predicted since greater familiarity 
should have produced contrast tendencies in 
the latter group due to narrower latitudes of 
acceptance. rE 

These results, however, are accountable in 
terms of Manis’ (1961a) suggestion that as- 
similation is due to attempts to minimize the 
dissonance incurred by exposure to discrepant 
beliefs. Subjects opposed to fluoridation may 
reduce the dissonance experienced through ex- 
posure to the communication by interpreting 
the message as closer to their own beliefs; fa- 
miliar, highly favorable subjects should expe- 
rience more dissonance following a moderate 
statement than should less familiar, highly 
favorable subjects, and therefore show greater 
assimilation. Alternative forms of response to 
dissonance such as rejection of the content of 

. the amessage, discrediting the source, etc., are 

unlikely in the present instance due to the 
moderate view expressed in the communica- 
tion. 

While the theory of latitudes of acceptance 
and rejection can predict the judgmental re- 
sults for the slightly favorable subjects, dis- 
sonance theory can as well given certain as- 
sumptions. The assumptions necessary are 
that mildly committed subjects regard views 
on the issue in terms of a simple dichotomy— 
either favorable or unfavorable toward fluori- 
dation—and further, that exposure to a more 
extreme but like-minded view will arouse less 
dissonance than a moderate or mild state- 
ment, since the more extreme view will be 
seen as stronger justification for one’s own 
beliefs. Coupling these assumptions with the 
expectation that the magnitude of dissonance 
following discrepant inputs is a function of 
involvement, given attitude constant, we would 
predict that familiar, slightly favorable sub- 
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jects should experience more dissonance fol- 
lowing a moderate, two-sided statement of 
their views than should less familiar, slightly 
favorable subjects because of differences in 
involvement reflected by familiarity. 

A simple alternative accounting for the 
differences in judgment associated with famili- 
arity is that for any given attitude, subjects 
more familiar with the topic yield more ex- 
treme judgments, a generalization parallel to 
Manis’ (1960) findings for committed (non- 
neutral) subjects. Given the above-mentioned 
assumptions, however, dissonance reduction 
may be the process underlying these judg- 
mental outcomes. 

Because the slightly favbrable subjects fa- 
miliar with fluoridation evaluate the com- 
munication more positively than less familiar 
subjects in this group, the evaluation data do 
not conform to the prediction based on the 
theory of latitudes of acceptance and rejec- 
tion that relative contrast in judgment is as- 
sociated with less favorable evaluation. One 
possible interpretation of these data is in 
terms of the amount of dissonance overcome 
through perceptual displacement of the com- 
munication. The present data can be ac- 
counted for if one assumes that the greater 
the dissonance overcome by displacing the 
communication in a favorable direction, re- 
gardless of the absolute amount of dissonance, 
experienced, the more positive the evaluation. 
In this view subjects in the groups highly 
favorable toward fluoridation and those more 
familiar with fluoridation in both attitude 
groups should evaluate the communication in 
more positive terms. 

An alternative interpretation is based not 
on dissonance reduction but upon the relation 
between judgment of the communication and 
a hypothetical set of strong arguments in sup- 
port of a person’s existing attitude. In these 
terms, a communication that affords support 
for existing beliefs should be evaluated posi- 
tively, regardless of the amount of dissonance 
reduced through perceptual displacement of 
the message. Conversely, the greater the per- 
ceived discrepancy, the less positive the evalu- 
ation. The process being suggested is, in short, 
one of displacement of the communication 
as a function of dissonance, and, following 
perceptual displacement, evaluation based on 
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the congruence between attributed position 
of the communication and a hypothetical 
set of strongest supporting arguments for 
one’s own views. The data fit this interpreta- 
tion if an earlier made assumption prevails, 
viz., that stronger arguments (that is, more 
extreme yet reasonable) imply greater sup- 
port for an attitude of similar valence re- 
gardless of how moderate the attitude is. In 
support of this suggestion one can argue that 
a strong communication in support of an indi- 
vidual’s existing beliefs may induce little or 
no dissonance yet produce highly positive 
evaluations, as, for example, in the case of 
highly favorable judges and an extreme, 
favorable communication. It may also be pps- 
sible to show that similar amounts of dis- 
sonance reduction yield predictably different 
evaluations when judgments indicate different 
degrees of support for the judge’s attitude. 


Stupy 2 


In view of the outcomes of the above study 
and the reinterpretation of judgmental proc- 
esses in terms of dissonance reduction, a 
second study was conducted to examine some 
conditions under which dissonance should be 
relatively minor and therefore lead to little 
displacement in judgment and reduced liking 
for a discrepant communication. This study 
is based on the premise that individuals 
taking an extreme stand and with support for 
their view should be subject to less dissonance 


than individuals similarly committed but rela- « 


tively lacking in support. 
In the case of fluoridation, members of @ 
professional group committed to support of 


and responsible for knowledge about fluorida- 


tion should experience little potential disrup- 
tion of their beliefs when exposed to a moder- 
ate appeal compared to persons who arè 
comparable in attitude but without similar 
cognitive, social, and institutional support. 
Accordingly, a group of public health school 
nurses was selected and compared with 4 
subgroup of women from the previous study 
similar to the nurses in attitude and familiar- 
ity. The nurses studied (N = 52) include 
most of the nurses employed by a large metro- 


politan school district. They were adminis- 


tered the data-gathering instruments and’. 


responded to the communication during 4 


° 


} 
4 


. by the ranking method given by 
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regular meeting of their group. They were 
asked to give their responses under the guise 
of familiarizing themselves with some re- 
search that had been done in the community. 
The materials and order of presentation for 
these subjects were identical to those of the 
previous study; the procedure differed in 
that the residents of the community were 
interviewed individually in their homes. 


Results 


Nurses were matched in attitude toward 
fluoridation with a comparable group of non- 
nurses from the previous study. This was 
accomplished by first selecting all nonnurses 
in, the previous study who obtained scores 
within the nurses’ range of attitudes (47-70 
on the 10 evaluative semantic differential 
scales). After grouping attitude scores into 
eight intervals of equal width, a total of five 
nonnurses were eliminated from those inter- 
vals most overrepresented as determined by 
comparability of proportion of cases in the 
nurse and nonnurse distributions. The two 
distributions were then compared by chi- 
square and found to be insignificantly differ- 
ent (x? = 8.10, df=7, p> .30). The nurses 
mean attitude score was 60.6, with standard 
deviation of 5.11; the final matched nor- 
nurses mean was 60.0, with standard devia- 
tion equal to 6.38. 

Judgment of the communication yielded 
skewed distributions which were normalized 
Hull (1928). 
The means and standard deviations are given 
ih Table 1 for the nonnurse group (subjects 
of the previous study) as a whole, the sub- 
group of nonnurses matched with the nurses 
in attitude, and the nurses. The nurses do 
not differ significantly from the total group 
of nonnurses; they do differ significantly 
(t= 3.23, df = 78, p< 01) from the sub- 


TABLE 1 
JUDGMENTS OF THE CoMMUNICATION 


Group M SD N 
Total nonnurses 49.7 9.2 93 
Matched nonnurses 54.0 7.9 28 
Nurses 47.7 8.0 52 


Note. —High scores indicate communication judged more 


favorable toward fluoridation. 
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TABLE 2 
EVALUATION OF THE COMMUNICATION 


Group M SD N 
+ 
Matched nonnurses 58.2 14.78 28 
Nurses 49.3 17.7 52 


Note.—High scores indicate more positive evaluation. 


sample of nonnurses who are comparable to 
the nurses in attitude valence and extremeness. 

The evaluation data, skewed in the direc- 
tion of a positive evaluation, were first trans- 
formed by the’ cube of the raw ‘data and then 
reduced in location on the scale by dividing 
each score by 100, The evaluation means and 
standard deviations are presented in Table 2. 
The nurses differ significantly from that sub- 
sample of nonnurses that hold extreme favor- 
able attitudes and are familiar with fluorida- 
tion (¢ = 2.24, df = 78, p < .05). These dif- 
ferences in evaluation means parallel the 
judgment data. . 


Discussion 


The mean judgment for the total non- 
nurse group is composed of judgments made 
by subjects that range in attitude from op- 
posed to fluoridation to highly favorable 
toward it. In the absence of an objective 
standard, such an average may be taken as 
a reflection of the consensus placement of the 
communication. (This consensus may err 
slightly in a neutral direction since no sub- 
jects highly unfavorable toward fluoridation 
were included in the sample of nonnurses, and 
these highly opposed subjects would be 
expected to contrast the communication with 
their own attitudes.) This value as an ap- 
proximate consensus of the placement of the 
communication essentially agrees with pretest 
data obtained with the communication (see 
Footnote 4). The nurses display considerable 
agreement with this consensus. The higher 
mean judgment given by the involved non- 
nurses on the other hand, represents a marked 
displacement in judgment toward the attitudes 
represented in this group. 

What of education as a factor influencing 
the obtained result? We may be sure that 
the nursing group has attained a higher level 
of education than the nonnurses studied, 
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since the latter were selected from the com- 
munity. Education data on the latter group 
show that 82% graduated from high school; 
35% of the group attended college. While the 
numbers in each group are small, we can 
examine the tendency for a negative relation- 
ship between judgment and education in this 
group. Such examination shows no marked 
trend for lower mean or median judgment to 
be associated with educational level (means, 
from low to high education: 58.7, 58.4, 56.0; 
medians, from low to high education: 62, 
57.2, 59). Eduvational level and judgment are 
not related in this group of nonnurses, even 
though wide variation in education occurs in 
the sample. = 

The fact that the data’ were obtained from 
the nurses in a regular meeting of their group 
while the nonnurses were assessed individually 
in their homes might yield the kind of differ- 
ences obtained to the extent that the nurses’ 
own attitudes were dependent upon the sali- 
ence of group norms. It seems likely, however, 
that the nurses’ stand on this issue is more 
a result of internalization than identifica- 
tion processes (Kelman, 1961), yielding the 
expectation that similar results would have 
been obtained if they were interviewed 
individually in another situation. 

These results support the interpretation of 
judgmental processes in terms of cognitive 
dissonance; given various support for one’s 
point of view, less dissonance should be 
expected following exposure to discrepant be- 
liefs, leading to less perceptual displacement 
and reduced tendencies toward inflated 
evaluations. We would hypothesize that other 
circumstances of the communication situa- 
tion that produce differences in potential 
dissonance for different groups would yield 
predictable outcomes in social judgment. 
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SOCIABILITY, FEARFULNESS, AND GENETIC VARIATION 
IN» THE MOUSE* 


GARDNER LINDZEY,? HARVEY D. WINSTON,’ anp LARRY E. ROBERTS 


University of Minnesota 


A total of 187 mice from 5 different inbred strains was observed in an open- 
field situation. In general, the results, indicate significant differences in sociability 
(time spent adjacent to stimulus mice) between different genotypes thus 
providing evidence for the importance of genetic variation as a determinant 
of sociability. It also appears that those strains that are most sociable are 


least fearful. 
’ 

The present study is concerned with sociability 
or affiliative behavior in Mus musculus or the 
common house mouse. It involves investigation 
of the performance of five highly inbred strains 
of mice using an objective measure of sociability. 
Perhaps the most novel feature of the study 
is its provision of a basis for examining the 
role of genetic variation as a determinant of 
sociability. There ate also some interesting links 
to Schachter’s (1959) research on anxiety and 
affiliation as a result of the fact that we studied 
animals known to be highly fearful (anxious) 
as well as animals known to be low in fearfulness. 


METHOD 
Subjects 


The subjects of this study were 187 mice dis- 
tributed across five different genotypes (strains). 
The five strains (BALB/c, C3H/Bi, A, C57/BL/1, 
DBA/8) have all been bred by brother-sister mating 
well beyond the 30 generations that theoretically sre 
sufficient to approximate homozygosity. It is reason- 
able to assume that within each strain there is 
almost complete genetic identity, except for the 
X and Y chromosomes, and that between strains 
there is considerable genetic diversity. All animals 
were raised under standard conditions (Lindzey, 
Lykken, & Winston, 1960) and were tested when 
70 days or more of age. Age distributions were 
comparable in the various strains and the two sexes, 


Procedure 


The apparatus consisted of a square, open field 
(36 X 36 inches) the floor of which was covered 


1 This research was supported by a grant from 
the Ford Foundation and by Research Grant M-3536 
from the National Institute of Mental Health 
United States Public Health Service. The paper was 
prepared while the first author was in residence at 
the Center for Advanced Study in the Behavioral 
Sciences. 

? Now at the University of Texas, 

3 Now at the University of Pennsylvania. 


» adjacent to stimulus cages and to control cages. He 


lightly with shavings and which was illuminated 
by a 25-watt light bulb suspended 1.5 feet above 
the floor of the field. The field was surrounded 
a wall 10.25 inches high. In each of the four corn 
of the field was a wire cloth cage, 4 inches high, with 
hinged plastic top and metal bottom (5.5 X5.5_ 
inches), and in two of the diagonally opposed cages — 
were two stimulus mice of the same sex, strain, and 
approximately the same age. The position of the 
stimulus and control cages was systematically 
rotated. The subject was placed in the center of 
the open field and allowed to run freely for 5 
minutes. Each subject was given a 5-minute trial 
on each of 5 consecutive days, 

A line was drawn on the floor of the cage 2 inches” 
around each cage and whenever a mouse placed 2 
feet inside such a line he was considered adjacent © 
toethe cage. An observer using stopwatches recorded 
for each subject the amount of time that he spent 


also recorded the amount of time that was spent in 
freezing. Freezing was recorded only when the sub- 
Ject wag motionless for more than 3 seconds and 
it was not recorded when adjacent to any of the 
cages. This apparatus is a modification of a measure 
of sociability employed by Antonitis and Ki 
(1955). ki 
RESULTS 
Strain Differences in Sociability 


The descriptive findings (Table 1) and analysis 
of variance results indicate that there are highly 
significant differences (p < .001) between strains — 
in sociability as measured by the amount of 
time spent adjacent to the cages containing 
stimulus mice. When the strains are clustered 
or grouped in terms of this measure, using the 
Newman and Keuls method (Winer, 1962) for 
multiple group comparisons, it appears that the 
most sociable is the C57/BL/1 which is sig- 
nificantly different from all other strains except 
the BALB/c, which in turn is significantly more — 
sociable than the remaining three strains. The 
DBAS strain is significantly more sociable than 
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TABLE 1 
COMPARISON OF FIVE InsreD MOUSE STRAINS 
Seconds of time adjacent Seconds of time adjacent Defecation 
to stimulus cages tacontrol cages (Number of boli) 
Strain N 

M SD M SD M SD 
C57/BL/1 39 129.26 22.30 103,19 21.99 1.13 1.08 
BALB/c 25 122.00 31.86 109.23 26.36 3.60 1.19 
DBA/8 42 106.52 30.86 110.54 25.49 2.36 1.90 
C3H/Bi 36 98.68 29.67 89.16 23.01 2.61 1.79 
A 34 86.39 36.98 404.89 37.50 4.33 1.02 

the A, while there is no significant difference little understood and has been Subjected to no 


between the A and C3H strains. 

Several additional findings suggest that the 
above results are not an artifact produced by 
strain differences in motility or exploratory tend- 
ency. First, when the strains are compared in 
the amount of time spent adjacent to the control 
(empty) cages there are once again significant 
strain differences (p < .02) but the order is now 
quite different. Commencing with the strain spend- 
ing the largest amount of time adjacent to the 
control cages the order is: DBA/8, BALB/c, A, 
C57/BL/1, C3H/Bi. Second, when time adjacent 
to the stimulus cage is correlated with time 
adjacent to the control cage we obtain a 
highly significant negative correlation (r = —.31, 
p<.001). Third, when the scores for the indi- 
vidual animals are corrected by subtracting for 
each trial the time spent adjacent to the control 
cages from the time spent adjacent to the stimu- 
lus cages we once again obtain highly significant 
strain differences (p < .001) and this time ob- 
serve an ordering of strains that is substantially 


. the same as that obtained with time adjacent to 


the stimulus cages alone. The order of strains 
from high to low is: C57/BL/1, BALB/c, 
C3H/Bi, DBA/S, A. 

Our results are thus unambiguous in suggesting 
that some of the variance in affiliative or social 
tendencies in mice is accounted for by geno- 
type. Moreover, it appears that this association 
cannot be accounted for by means of strain dif- 
ferences in motility. It is interesting that a 
character so complex and apparently remote 
from biological processes as sociability is still 
linked to some degree with genetic variation. 
These findings are congruent with a growing 
body of research (for example, Lindzey, Winston, 
& Manosevitz, 1961; Scott, 1958a) demon- 
strating that in lower animals complex social 
phenomena such as social dominance and aggres- 
sion are partly controlled by genetic mecha- 
nisms. The role of affiliative or social tendencies 


in influencing social struture or organization is 


controlled investigation. On an a priori basis, how- 
ever, there seems every reason to anticipate sig- 
nificant associations between these two variables. 
Moreover, it seems likely that in particular set- 
tings the strength of social tendencies’ would sig- 
nificantly influence the fitness or likelihood of sur- 
vival of the animal (Scott, 1958b), and thus it 
should not be surprising that genetic variation is 
associated with varying degrees of sociability. 

Tt should be noted that our measure of socia- 
bility reflects the tendency of the mouse to seek 
a position adjacent to other mice but this does 
not imply that all mice seek other mice in pref- 
erence to nonsocial stimuli. Indeed, for two of 
the strains (DBA/8, A) the average time spent 
adjacent to the cages containing mice was 
slightly less than the time spent adjacent to the 
empty cages. This finding is consistent with the 
results of Antonitis and Kish (1955) who found 

© Jitte difference in adjacent time between an 
empty cage and a cage containing an active 
mouse, while the stimulus that elicited most 
adjacent time was an anesthetized mouse. Thus, 
although mice show individual and strain differ- 
ences in sociability, they do not consistently 
display social responses in preference to non- 
social responses. 


Sociability and Fear fulness 


Perhaps the most widely used of all measures 
of animal temperament is the open-field test 
(defecation/urination) which is ordinarly be- 
lieved to assess fearfulness, emotionality, or 
anxiety (Broadhurst, 1960; Hall, 1941). The 
strains under discussion have been observed ex- 
tensively in the open-field situation in a variety 
of investigations, some of which have been pub- 
lished (Lindzey, 1951; Lindzey et al., 1960; 
Lindzey et al., 1963); moreover, they display 
remarkably consistent and highly significant 
strain differences in fearfulness. In this case the 
order of strains from most fearful to least is 
A, BALB/c, C3H/Bi, DBA/8, C57/BL/1. 
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A comparison of this rank with the order of 
strains according to sociability indicates that the 
measures are, if anything, inversely related. In 
particular, the C57/BL/1 strain, which has al- 
ways appeared much lower in open-field fearful- 
ness than our other strains, appeared in the 
present study to be the most sociable, while 
the A strain, which was lowest in sociability, has 
always been among the most fearful of the 
strains. 

Another measure of fearfulness or anxiety was 
available for the individual animals used in the 
present study. This was an index of the amount 
of time that, each animal spent freezing 
(motionless) during the sociability trials. When 
strains are compared on this variable we once 
again observe significant differences (p < .001) 
and find an order from high to low (A, C3H/Bi, 
BALB/c, C57/BL/1, DBA/8) that bears little 
resemblance to the strain differences in sociabil- 
ity. Thus, the strain that is least sociable is 
also the strain that is most fearful as measured 
by freezing: 

It should be emphasized that association be- 
tween particular tfaits or characters in inbred 
populations can be produced readily by means 
of the random assortment of independent gene 
factors. Thus, whether the suggestive relation 
between fearfulness and sociability we have ob- 
served is produced by linkage, epistasis, pleitro- 
pism, or chance is a matter that can be settled 
only by further experimentation involving 
hybridization studies and back-crosses. 

> 
DISCUSSION 


The growing interest that psychologists hæve 
displayed in recent years in the variable of socia- 
bility, affiliation, or gregariousness is largely at- 
tributable to the efforts of Schachter and 
Atkinson. A series of studies by Schachter 
(1959) and others (for example, Gerard, 1963; 
Gerard & Rabbie, 1961; Sarnoff & Zimbardo, 
1961) has demonstrated that affiliative tendencies 
increase in situations that heighten anxiety or 
emotional disturbance and that this finding is 
particularly pronounced in the case of first-born 
subjects, Pursuing quite a different tack, Atkin- 
son and his collaborators (Atkinson, 1957) have 
devised a measure of individual differences in 
affiliation, using a modification of the TAT, and 
have shown that their affiliation score is corre- 
lated with behavior in a variety of situations, 
including visual perception of social stimuli, task 
performance, and social choice. Many of the 
important differences between these two sets of 
studies are attributable to the fact that the 
Michigan group generally has followed Murray 
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(1938) in treating affiliation as a motive or 
dispositional concept, while Schachter and those 
who have extended his work generally have 


appeared to view affiliation as an instrumental | 


response that serves to facilitate social compari- 
son and thus, under certain circumstances, to 
reduce anxiety or other negative affects. 

The present study deals with this same domain 
of behavior although it involved a drastic pro- 
cedural regression in the choice of subjects, Our 
findings are congruent with Atkinson’s in sug- 
gesting the existence of meaningful individual 
differences in sociability that are related to other 
significant variables. The absence of any positive 
association between sociability and fearfulness 
(indeed the possibility of a negative association) 
appears to be inconsistent with the findings of 
Schachter and others. However, it is difficult to 
link our results to much of the current discussion 
concerning anxiety and affiliation not only be- 
cause we are talking about mice rather than men 
but more importantly because so much of the 
human research has dealt with the effect of one 
situational variable (anxiety arousal, hunger) 
upon a single affiliative response. It is quite 
possible that research at the human level dealing 
with enduring dispositions or traits might pro- 
duce findings congruent with our results rather 
than suggesting a positive association between 
anxiety and affiliation. While it is generally true 
that we need to know more about the experi- 
mental conditions that influence affiliative tend- 
encies, it is equally important to explore stable 
individual differences in affiliation, their corre- 
lates, and their antecedents. 
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The purpose of this study was to delineate aspects of aggression which relate 
to academic effectiveness in college as a function of differences in the social role 
ascribed to males and females. Academic effectiveness was defined as the degree 
to which performance exceeds or falls short of what would be predicted on 


the basis of aptitude alone. A 


ministered Siegel’s Manifest Hostility Scale, 


sample of 45 males and 48 females was ad- 


from which subscale scores were 


derived assessing the strength of the tendencies toward acts of aggression, 


feelings of aggression, and absenee of. 


guilt over aggression. The following 


hypotheses received support: (a) Males would exceed females in direct ex- 


pression of aggression while females would exceed mal 


les in guilt over aggressive 


expression, (b) Among males, those both high in direct aggressive expression 
and low in guilt over aggression would have the highest academic effectiveness. 
(c) Among females, those both low in direet aggressive expression and high in 


guilt over aggressive expression wo 


Recent evidence indicates that academic 
achievement, conceptually defined as the degree 


» to which academic performance exceeds or falls 


short of that expected on the basis of aptitude 
alone, may be analyzed successfully in terms of 
the different social roles aspired to by males and 
females (Wyer & Terrell, 1963). If this is true, 
the extent to which personality characteristics in- 
fluence academic achievement (or academic ef- 
fectiveness, as it will be referred to in this paper) 
may partially depend upon the extent to which 
these characteristics are manifested in role-con- 
sistent behavior patterns. Hostility is among those 

1 This study was supported by grants from the 
United States Office of Education and the Council 
on Research and Creative Work of the University of 


Colorado. 
2 At the Institute of Child Behavior and Develop- 


ment, 


uld have the highest academic effectiveness. 


characteristics which can be analyzed effectively 
from this standpoint. 

Previous studies (for example, Burgess, 1956; 
Gebhart & Hoyt, 1958; Shaw & Grubb, 1958) 
have yielded inconsistent data concerning the 
relationship between hostility and aggression and 
measures of academic effectiveness. These ap- 
parent inconsistencies can be attributed to one 
or more of the following factors: methodological 
and conceptual differences in the definition of 
academic effectiveness, differences in the educa- 
tional environment in which the studies were 
performed (for example, grade school, high 
school, college), differences in the measures of 
hostility or aggression, and failure to discrim- 
inate among various components of aggressive 
behavior and to analyze the differential effects of 
each. The present study will attempt to clarify 
the effects of at least the last factor, 


646 


Aggression is not always directly manifested in 
feelings of hostility or aggressive actions. Ag- 
gression-linked anxiety or guilt is one by-product 
of the socialization of aggression in our society 
and the anticipation of a guilt reaction serves to 
inhibit direct expression of aggression. It may 
also motivate the channeling of aggressive im- 
pulses into forms of behavior in which the ag- 
gressive component is not obvious. Competitive- 
ness in the academic sphere, which is likely to 
be an important factor in determining an indi- 
vidual’s academic effectiveness, is one kind of 
behavior that can serve as a vehicle for expressing 
aggression in a censtructive, socially useful man- 
ner. However, there is reason to hypothesize that 
academic’ competition plays a different role for 
males than for females in expressing aggressive 
tendencies. This distinction hinges upon differ- 
ences between males and females in the behavior 
patterns which characterize their present and 
future social roles. 

Males who, relative to females, are expected to 
fill a dominant position in society, may be en- 
couraged to adopt a social role in early years to 
which direct aggressive expression is important. 
Fighting tends to be a recognized aspect of male 
social development and success in this area is 
regarded as an indication of dominance or mascu- 
linity. Athletics may be an additional area in 
which direct or indirect aggressive expression has 
a positive social value among males, In the voca- 
tional role ultimately aspired to by males, and to 
which academic competence is directly relevant, 
competitiveness and aggressiveness also tend to be 
positively valued characteristics. High aggression 
may therefore reflect a high need for dominance 
which may be directly manifested in all areas 
relevant to fulfillment of the male social role. 
To the extent that vocational success is perceived 
by males (particularly college students) as a 
primary aspect of the male role and academic 
competence is seen as relevant to this success, un- 
inhibited aggressive tendencies in these students, 
which might be indicated by the conjunction of 
high direct expression of aggression and low 
guilt over such expression, should relate positively 
to academic effectiveness. On the other hand, low 
active aggression needs, or inhibition of direct 
aggressive expression, may in part indicate failure 
to adopt the dominant male role and may there- 
fore be associated with a lesser degree of academic 
effectiveness. 

The primary role filled by females in our soci- 
ety is that of housewife or mother, and a large 
proportion of college women can be expected to 
be motivated toward that goal (Wyer & Terrell, 
1963). Striving for dominance which may be 
manifested in direct aggressive expression is much 


Brier ARTICLES 


less consistent with this role than with the mascu- 
line role. In fact, it has been found that femal 
will purposely adopt submissive roles in int 
action with males in order to avoid situations 


nant status (Weiss, 1961). If it is true that direct 
expression of hostility and aggression conflicts 
with feminine (submissive) social role behavior — 
and impedes attainment of role-relevant (social) = 
goals, socially conscious females with high ag- 


ig 


these needs into areas which would not conflict 
with, but might even facilitate, primary goal 
attainment. Academics, which are often seen by 
females as augmenting possibilities of social goal” 
attainment (Wyer & Terrell, 1963), may be an 
effective outlet in the university setting since ~ 
aggressive tendencies can be sublimated through 
competitiveness in intellectual areas without con- = 
tradicting female role expectancies. 

A high need to displace aggression may be 
indicated by a low tendency to express aggression 
directly in conjunction with high guilt over ag- 
gression. In such a case the lack of overt aggres- 
sion may be a result of this guilt, and the need 
for indirect aggressive expression may therefore 
be maximal. On the other hand, a low degree of 
direct aggressive expression in conjunction with — 
low guilt may indicate a low need for aggressive ` 
expression of any kind. Therefore, to the extent 
that academic striving provides an outlet for 
indirect expression of aggression in females, those 
who are low in direct aggressive expression but ~ 
high in guilt over aggression should show higher 
academic effectiveness than females not fitting 
this description. ; 

The rcasoning outlined above suggests the fol- 
lowing hypotheses: 

1. Males will exceed females in direct expres- 
sion of aggression while females will exceed males u 
in the need to inhibit aggression (that is, im 
guilt over expression of aggression). } 

2. Among males, those who are both high in” 
direct aggressive expression and low in guilt over 
aggressive expression will have the highest aca- 
demic effectiveness. 

3. Among females, those who are both low in 
direct aggressive expression and high in guilt over 
aggressive expression will have the highest aca- 
demic effectiveness. 


METHOD 


A group of 45 male and 48 female college sopho- 
mores who had been administered the Siegel (1956) 
Manifest Hostility Scale (MHS) were selected from — 4 
a larger group of 225 males and 351 females who 
had taken part in a larger study of academic effec- 
tiveness, 
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Academic effectiveness, conceptually defined as the 
degree to which academic performance exceeds or 
falls short of that predicted on the basis of aptitude 
alone, was defined operationally as the difference in 
standard scores between performance as measured by 
grade-point average and aptitude as measured by the 
combined Verbal and Mathematics scores on the 
Scholastic Aptitude Test administered by the Edu- 
cational Testing Service. 

Although use of the above definition yielded a 
normal distribution of academic effectiveness scores 
in the total sample in which the scores were com- 
puted, the scores of the subsamples selected for this 
study did not mirror this normality. Specifically, 
male subjects were fairly equally distributed over the 
range of effectiveness, while a disproportionately 
low number of females were in the midrange of 
effectiveness (within one standard deviation from 
the mean). 

„o From the MHS three subscales were identified by 
having judges sort the items into categories reflect- 
ing: the tendency to commit acts of aggression—1, 
the tendency to experience feelings of aggression—2, 
and the extent to which guilt associated with aggres- 
sion is absent—3* The first two subscales were 
assumed to be indices of a readiness to express ag- 
gression relatively directly; a moderate correlation 
was found between these scales (1,2 = .476). The 
‘Absence of Guilt scale was used as an index of a 
tendency to inhibit direct aggression or to displace 
aggression; this scale was found to be relatively 
independent of the other two (fas = .229, ras = 278). 

To test Hypotheses 2 and 3, subjects were divided 
into levels above or below the median on each of the 
three aggression scales. To test Hypothesis 2, sub- 
jects both high in acts of aggression and low in guilt 
over aggression and subjects both high in feelings of 
aggression and low in 
compared on academic effectiveness with the residual. 
To test Hypothesis 3, subjects both low in acts of ag- 
gression and high in aggression guilt and subjects 
both low in feelings of aggression and high in ag- 
gression guilt were compared with the residual. 

In addition to the formal testing of the hypothe- 
ses, the relationships of academic effectiveness to the 
MHS taken as a whole and to each of the three 


3 Copies of the MHS with subscales identified may 
be obtained from the second author. 


guilt over aggression were, 
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TABLE 2 


MEAN ACADEMIC EFFECTIVENESS FoR Mates at Two 
LEVELS OF ACKNOWLEDGMENT OF ACTS OF 
AGGRESSION AND GUILT OVER AGGRESSION 


o 
Guilt over aggression 
Acknowledgment of 
aggressive acts 


High Low 
High —.54 (8) 37> (15) 
Low —.26 (13) —26 (9) 


e Note,—N in each cell is given in parentheses. 

at = 2.38, df =43, p <.025 (one-tailed test) when this 
mean is compared with the mean of the other three cells 
combined. 


a 


subscales were assessed individually. To do this, 
subjects were divided into three levels of academic 
effectiveness: highs (those over one SD above the 
mean effectiveness score), normals (those within one 
SD above or below the mean), and lows (subjects 
over one SD below the mean). Males and females 
were classified separately. Comparisons were made 
between subjects at each level of effectiveness on the 
aggression characteristics in question. ,This method 
was considered advisable due to indications in earlier 
studies of nonlinear relationshjps between academic 
effectiveness and personality variables. 


RESULTS 


Table 1 shows comparisons of males and fe- 
males on each of the aggression characteristics 
assessed in this study. Hypothesis 1 was supported 
by these data. Males acknowledged significantly 
more aggressive acts and showed significantly 
less guilt over aggression (that is, more absence of 
guilt). No sex difference occurred in acknowledg- 
ment of feelings of aggression. 

Mean academic effectiveness scores for each 
combination of direct aggressive expression (acts 
of aggression or feelings of aggression) and guilt 
over aggression are shown, for males, in Tables 2 
and 3 and, for females, in Tables 4 and 5. (Scores 
are in units of standard deviations from the 


TABLE 3 


MEAN ACADEMIC EFFECTIVENESS FOR Mares AT Two 
LEVELS OF FEELINGS OF AGGRESSION AND 
GUILT OVER AGGRESSION 


TABLE 1 
MEAN SCORES ON AGGRESSION VARIABLES OF MALES 
AND FEMALES 
Variable P e 

Manifest Hostility Scale 

(full scale) 7 15.8 | 12.2 2.79% 
Acts of aggression 4.69 | 3.77 | 2.28 
i eelings of aggression 4.58 | 4.40 | .42 

bsence of guilt over 

pes 1.73 Sol alia es 


* p <,025, one-tailed. 
** b <.01, one-tailed. 


Guilt over aggression 


" mmc 
Feelings of aggression 


High Low 
(above median) | (below median) 


04 (8) .35* (13) 
—.63 (13) —.11 (11) 


High (above median) 
Low (below median) 


Note.—Number of subjects under each condition is given in 


paratha, df =43,p < .025 (one-tailed test) when the mean 
is compared with the mean of the other three cells combined. 
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TABLE 4 


Mean ACADEMIC EFFECTIVENESS FOR FEMALES AT Two 
LEVELS or ACKNOWLEDGMENT OF ACTs OF 
AGGRESSION AND GUILT OVER 
AGGRESSION 

a 


Guilt over aggression 


Acknowledgment of 
aggressive acts 


High Low | 
(above median) | (below median) 


—.44 (5) .14 (18) 
23" (14) | —.41 (42) | 


High (above median) 
Low (below median) 
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TABLE 5 
MEAN ACADEMIC EFFECTIVENESS FOR FEMALES AT 


Two LEVELS OF FEELINGS OF AGGRESSION 
AND GUILT OVER AGGRESSION 


Guilt over aggression 


Feelings of aggression 
High Low 
(above median) | (below median) 
High (above median) —.41 (7) —.12 (14) 
Low (below median) 34" (12) 


—.04 (16) 


Note,—Number of subjects under each condition is given in 
parentheses, 

at = 1,66, df = 47% p <.10 (ns) when the mean is compared 
with the mean of the other three cells combined. 


mean.) Males high in tendency,to commit acts of 
aggression and low in aggression guilt were sig- 
nificantly higher in acadentic effectiveness than 
other male subjects, as were males high in feel- 
ings of aggression but low in guilt; Hypothesis 2 
was therefore supported. Hypothesis 3, that fe- 
males low in direct aggression and high in guilt 
over aggression would have higher academic ef- 
fectiveness than other subjects, was not convinc- 
ingly supported although the results approached 
significance (p < .10) in both comparisons made. 

Mean scores on single aggression character- 
istics between subjects at three levels of aca- 
demic effectiveness are shown for males in Table 
6 and for females in Table 7. While males of 
high academic effectiveness showed significantly 
less guilt over aggression (that is, more absence 
of guilt) than lows, females of high effectiveness 
acknowledged more guilt than either normals or 
lows. Highly effective males acknowledged more 
acts of aggression than normals while highly éf- 
fective females acknowledged both fewer aggres- 
sive acts (p < .10, two-tailed) and fewer feelings 
of aggression than the normal group, However, 
females of low effectiveness also acknowledged 
fewer aggressive acts than normals. 


Note.—Number of subjects under each condition given in 
parentheses. z 

^t = 1.33, df = 46, p <.10 (ns) when the mean is com- 
pared with the mean of the other three cells combined. 


DISCUSSION 


While males and females differed as expected 
in their acknowledgment of aggressive acts and 
their guilt over aggressive expression, they did 
not differ significantly in the acknowledged feel- 
ings of aggression. These findings in conjunction 
Suggest that although males and females have 
similar aggressive needs, males may express this 
aggression directly without guilt while females 
may feel guilt over these feelings and hence may 
inhibit direct expression. The fundamental dif- 
ference in expression of aggression hypothesized 
to exist between males and females due to their 
differing social roles therefore appears to receive 
strong support. 

Among female subjects low in acknowledgment 

of “aggressive acts, those high in guilt over ag- 
gression were, as hypothesized, higher in academic 
"effectiveness than subjects in any other category 
defined by these variables (.23 SD above the 
mean). On the other hand, those females low in 
both guilt and aggressive acts were markedly low 
in effectiveness (41 SD below the mean). Al- 


though the mean of neither of these groups dif- 


fered from the residual at the accepted level of 
significance (p< .10 in both cases), the differ- 
ences are sufficient to suggest that future studies 


TABLE 6 
MEAN SCORES ON AGGRESSION VARIABLES OF Mates at THREE LEVELS or ACADEMIC EFFECTIVENESS 


Means Statistical comparisons 
Variable 

High (1) N 

CaN IO oe a ha tia taa 
o y Scale 

ull scale) 18.2 14.2 15.4 a 

Acts of aggression i 5.54 4.00 4.69 ee Pat 36 
Feelings of aggression “ 477 4.56 444 28 42 15 
Absence of guilt over aggression 2.23 1.94 1.13 ‘61 2.56* 1.86 


*p <.05, two-tailed, 
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TABLE 7 


MEAN SCORES ON AGGRESSION VARIABLES OF FEMALES AT THREE LEVELS OF ACADEMIC EFFECTIVENESS 


Means Statistical comparisons 
Variable aj 
menm | Nomao) | Tow @ | na in i 
Manifest Hostility Scale 
(full scale) 10.8 15.8 11.9 —2.42* —.76 1,96 
Acts of aggression | 3.78 5.22 3.14 —1.73 1.26 2792F 
Feelings of aggression 3.78 5.67 4.38 —2,40* —.95 1.62 
Absence of guilt over aggression 44 1,33 1.05 —3.37** —2.47* 79 


*p <.05, two-tailed. 
** p a< 01, two-tailed. 


on a larger sample might substantiate these re- 
lationships. Low direct aggressive expression in 
conjunction with low guilt may indicate a low 
need for both direct and indirect aggressive ex- 
pressřon, If this is true, the second of the two 
relations cited above would suggest that academic 
effectiveness may suffer when aggressive needs 
are very low, Academic goal attainment may re- 
quire a certain amount of competitiveness which 
is lacking among females with low aggressive 
needs. 

Data in Tables 6 and 7 indicate that the vari- 
ous aspects of aggression considered in this study 
differ appreciably in their relationship to aca- 
demic effectiveness. Moreover, scores on the indi- 
vidual subscales of the MHS appear generally to 
relate more strongly to effectiveness than does 
the full-scale measure of aggression. This sug- 
gests that one reason for the apparently incon- 
sistent data cited in studies in this area has been 
the failure to delineate adequately the types of 
aggression being assessed by the various scales 
used. y 

One of the more convincing pieces of research 
in this area has shown that high-school male 
underachievers (that is, students with low aca- 


„demic effectiveness) scored higher on each of 
three hostility scales than did normals (Shaw & 


Grubb, 1958). This finding is in apparent oppo- 
sition to the general relationship between direct 
expression of aggression and academic effective- 
ness found in this study among college males. 
These studies in conjunction may clarify further 
the effect of social role on the relationship be- 
tween personality factors and behavior. Overt 
aggression and dominance, characteristic of the 
male social role at all age levels, may be primar- 
ily manifested by seeking dominance in role- 
relevant areas. These areas may not be the same 
in high school and college. In high school, aca- 
demic goals may not be perceived as an aspect of 


‘the dominant masculine role and may even be 


seen as in opposition to it. High needs for ag- 


gression or dominance may in these years be 
manifested in areas more positively defining the 
male role (for exđmple, athletics, social leader- 
ship, etc.). Males entering college, however, are 
likely to see vocational success as a major aspect 
of the dominant male role and perceive academic 
competence as a necessary means of attaining this 
role-relevant goal (Wyer & Terrell, 1963). Aca- 
demics may therefore acquire a positive value and 
hence aggression and dominance needs may be 
directly manifested in these areas, facilitating aca- 
demic effectiveness. 

This interpretation may be generalized to in- 
clude the data obtained from females. When 
direct expression of hostility and aggression are 
acceptable aspects of one’s social role, this ex- 
pression is manifested in areas which are rele- 
‘vant to this role. However, when overt expression 
of’ aggression is inconsistent with one’s social 

“role? this expression is inhibited in role-relevant 
areas and displaced into areas which do not con- 
tradict primary role expectations. 
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COMMUNICATOR-RECIPIENT SIMILARITY AND DECISION CHANGE: 


TIMOTHY C. BROCK? 
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. 


A field experiment in the paint department of a large retail store supported 
the hypothesis: a recipient’s behavior with respect to an object is modifiable by 
the communicator’s appeal to the extent that the recipient perceives that he and 
the communicator have a similar relationship to the object. A salesman, who 
reported his own magnitude of paint consumption as similar or dissimilar to 
the purchaser’s, attempted to induce the purchaser (N = 88) to switch to a 
different price level. The findings’ were ordered to theories of identification 
(Stotland) and social comparison (Festinger). 


` 
The recipient’s perception of his similarity to 
a communicator or model has been hypothesized 
to account for change toward the communicator 
(Back, 1951; Leventhal & Perloe, 1962), at- 
traction to the communicator (Byrne, 1961; 
Byrne & Wong, 1962; Gerard & Greenbaum, 
1962), for adoption of the model’s preferences 
(Stotland, Zander, & Natsoulas, 1961), ability 
level (Burhstein, Stotland, & Zander, 1961; Stot- 
land & Dunn, 1962; Stotland & Hillmer, 1962), 
and “anxiety” (Sfotland & Dunn, 1963). These 
studies, which can be fitted within the framework 
of Stotland’s cognitive theory of identification 
or Festinger’s (1954) theory of social compari- 
son processes, do not yet sum to a set of 
principles shown to yield clear predictions of 
behavioral change in real-life situations. This 
shortcoming was noticed when an attempt was 
made, with the management of a department 
store, to learn more about the effects of inter-, 
personal influence on consumer behavior, 
Management assigned special value to info: al, 
word-of-mouth endorsement (Katz & Lazars eld, 
1955) of a product and sought means of im- 
proving interpersonal, rather than media, promo- 
tion. It was proposed that a communicator, who 
is perceived to be thoroughly experienced with 
a product, will be more likely to influence the 
recipient than a less experienced communicator. 
Perceived expertise of the communicator was 
considered crucial in face-to-face contexts. To 
illustrate, management believed that a person 
about to buy a Chevrolet would be less influ- 
enced by a neighbor’s approbation of his car 
(a Ford, say) than an equally enthusiastic en- 
dorsement by the owner of a fleet of Fords. The 
first hypothesis was: the more experience a com- 


1A portion of the data was collected while the 
writer was at the University of Pittsburgh. The com- 
ments of Leroy Wolins are gratefully acknowledged. 

2 Report completed while the author was at Iowa 
State University. 


municator is perceived to have had with an 
object, the more likely it is that the recipient’s 


behavior with respect to the object will be modi- } 


fied by the communicator’s influence attempts. 

The second hypothesis followed the aforemen- 
tioned research on communicator-recipient simi- 
larity: to the extent that the recipient perceives 
that he and the communicator share an attribute, 
that is, have a similar relationship to an object, 
to that extent is the recipient’s behavior with 
respect to the object likely to be modified by 
the communicator’s influence attempts. The two 


forces, perceived communicator expertise and | 


perceived communicator similarity, were pitted 
against one another by conducting a field experi- 
ment in which the communicator, for half the 
recipients, was perceived as similar but inexperi- 
enced while for the other half, he was perceived 
as dissimilar but experienced. 


METHOD 


In the paint department of a large retail store,? 
salesmen attempted to induce paint purchasers to 
change to a different price level, There were two 
independent variables: the similarity of the salesman 
and customer with respect to the salesman’s prior 
magnitude of paint consumption; the direction of 
the advocated change in price level, upward Ors 
downward. The dependent variable was whether oF 
not the paint purchaser changed his decision concern- 
ing price level after an influence attempt by the 
salesman. 


3 The experiment was conducted during a 5-month 
period in the paint department of a retail store in 
an Eastern city. The store is a unit in an interna- 
tional chain with annual sales volume greater than 
one billion dollars. A sine qua non for conducting 
the experiment was preservation of anonymity for 
the store and its permanent staff in any public 
report. The writer is indebted to the manager for 
his unusual accessibility and enlightened contribu- 
tion to the execution of the research. Thanks Gi 
due J. Blackwood and D. Koenigsberg for their 
skill and objectivity in collecting the data. 
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Procedure 


The role of experimenter-communicator was taken 
by two part-time salesmen. The experimenter selected 
customers or subject-recipients for inclusion in the 
experiment, attempted to modify the subject’s price 
decision, and recorded whether or not the influence 
attempt was successful. Selected subjects were ran- 
domly assigned to a similar or dissimilar condition 
and to a higher or lower condition, resulting in a 
2X2 design. Influence attempts were restricted to 
paint purchases, particularly the kinds and prices 
given in Table 1. In the higher condition, the 
experimenter advocated buying paint at prices in 
Column 3 rather than Column 2 in Table 1. In the 
lower condition, the experimenter advocated pur- 
chases at prices in Column 1 if the subject had 
chosen a price in Column 2, or in Column 2 if the 
subject selected a price in Column 3. In the similar 
condition, the experimenter emphasized that the 
nlagnitude of his own consumption was the same 
as the amount being purchased by the subject; in 
the dissimilar condition the experimenter reported his 
own magnitude of consumption to be 20 times that 
of the subject’s prospective purchase.* 

The treatments were administered after the subject 
indicated he wished to purchase X units at a given 
price and had proceeded with the experimenter to 
the cash register to “ring up” the sale. At this point, 
the experimenter delivered himself of the following 
well-practiced appeal. Alternative terms for the dis- 
similar and higher conditions appeared in brackets, 


Listen, I just thought, I wonder if I can give 
you some advice. I’m going to college right now 
and working here part-time as a salesman to meet 
my expenses. Two weeks ago I bought X [20 X] 
gallons of a to help my dad on some work like 
your b that we were doing. It costs a little less 
[more] and [but] it turned out beautifully. I also 
got a little of the c you want to buy, and, hon- 
estly, it didn’t work out as well at all. There just 
was no comparison. Those X [20 X] gallons at 
a have proved out terrifically for us in every way. 

. [Pause] On the basis of my experience with X 
[20 X] gallons at a, I can really recommend your 
changing to the a if you want to. I certainly 
would if I were you. 


The italicized a above refers to the type of paint 
at the advocated price; b refers to details of the 
experimenter’s “job,” described so as to seem akin 
to the subject’s and, in the dissimilar condition, to 
require the use of much paint; and c refers to the 
subject’s initial selection of paint and price level. 
In an actual appeal to a subject who decided to 
buy floor paint at $6.98 per gallon, the variations 
were: a, “the 4.98 paint” or simply “4.98”; b, 


4The multiple of 20 was chosen because the 
experimenters’ experience in the paint department 
led them to believe such a magnitude would be per- 
ceived by the typical paint purchaser as plausible 
» but at the same time very different from his own 
acquisition of a gallon or two. 
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TABLE 1 
PAINTS AND PRICE PER GALLON IN DOLLARS 


Type of paint 1 2 3 
Latex” 2.97 5.59 6.90 
House (outside) 5.89 6.98 
Floor 4.98 6.69 
Semigloss 3.98 6.59 7.98 


“porches and basement too [in some duplexes]”; c, 
“698.” The material in brackets would have been 
omitted in the similar condition. 

In addition to noting whether or not the influ- 
ence attempt was successful, the experimenter 
recorded the original price, the advocated price, 
whether the sale was cash or charge, estimated 
the subject’s age, ,and noted any contaminating 
factors. 

Communicator Standardization and Ratings of 
Appeals 

The experimenters appeared before the writers 
students (class Ns=25 and 18) who*rated their 
appeals using Likert scales with anchors labeled 
“none at all” and “to a very high degree.” The 
experimenters were rated for warmth, expertness, 
believableness, speaking effectiveness, and the likeli- 
hood that “this salesman would make me change 
my mind.” In addition, the raters were asked to 
reproduce the appeals in writing. The raters were 
told that the experimenters were sales trainees and 
the ratings would not affect the trainee’s position 
„and salary but would be important in deciding 
what kind of sales job he was given. The experi- 
menters spoke in a balanced order before the two 

* classes. The rating forms asked the raters to imagine 
they had just bought “some paint” at a specified 
prige level: in this fashion half of each class was 
given a higher, and half, a lower treatment. The 
dissimilar experimenter reported his prior consump- 
tion as 20 gallons; the similar experimenter, as 1 
gallon. Detailed description of the communicator’s 
“job” and devaluation of the recipient’s price were 
omitted. There were only three statistically reliable 
(p <.05, two-tailed) outcomes: one of the experi- 
menters was rated higher than the other in 
“warmth”; the dissimilar communicator was rated 
higher in expertness, defined for the rater as “knows 
what he is talking about”; and female raters gave 
higher ratings. 

Both experimenters knew the hypotheses but their 
preferences, prior to the experiment, did not agree. 
The experimenters returned data cards to the writer 
biweekly and they were not informed about the 
results during the course of the data collection. Since 
they were employed in the paint department any- 
way, the experimenters felt they might enliven a 
humdrum selling task by sometimes making it a 
meaningful scientific activity. The experimenters re- 
ceived no additional remuneration or inducements of 
any kind, One of the experimenters had taken ad- 
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vanced psychology courses; the other, a chemistry 
major, had taken only an introductory psychology 
course, The experimenters did not collect data at the 
same time. 


. 


Selection of Subjects 


A small proportion of the purchasers entering the 
paint department during the period of the experi- 
ment was selected as subjects. No attempt was made 
to collect data when: neither experimenter was on 
duty; the paint department or store was very 
crowded; there was a sale in the store, in the paint 
department, or in adjacent departments; remodeling 
was being done in the paint or adjacent departments; 
the sales help in the paint department was below full 
strength. A purchaser was excluded as a subject if: the 
purchaser, was female or was accompanied by other 
persons; the purchaser was completing a project and 
returning for more supplies; thespurchaser was not 
exclusively interested in the products listed in 
Table 1; the desired color was not available at the 
price level to be advocated by the experimenter; 
special color mixing was necessary; the purchaser 
did not appear to make a choice or decision concern- 
ing price but seemed to the experimenter to have 
had a definite price in mind before entering the 
paint department. The purchaser was excluded if 
he forced the experimenter to give information that 
was favorable or unfavorable to a certain price level 
prior to the influence attempt or in addition to the 
influence attempt. For example, if a purchaser wanted 
to know why the $6.90 Latex cost more than the 
$5.59 Latex the experimenter would first say “prob- 
ably different chemicals in them.” In many instances 
this ambiguous reply was insufficient and the experi- 
menter had to endorse or evaluate the paints, A ° 
purchaser was excluded if his price decision did nòt 
permit the experimenter to apply the higher xor > 
lower) treatment which had been randomly deter- 
mined beforehand. When the experimenter was inter- 
rupted and not allowed to complete his influerte 
attempt, the purchaser’s data were omitted. Finally, 
the experimenters excluded purchasers who seemed 
disturbed (as reported by the experimenters) by 
words on the paint can label, such as “enamel.” In 
sum, the subject obviously made a price decision, 
the surround was normal and tranquil, only the 
products in Table 1 were involved, and the experi- 


TABLE 2 


NUMBER or PAINT PURCHASERS WHO CHANGED 
DECISION IN RELATION TO DIRECTION OF 
INFLUENCE ATTEMPT AND SIMILARITY 
or COMMUNICATOR’S 
CONSUMPTION 


Direction of influen ar : 
attempt | Similar | Dissimilar | Total 


To a lower price level |16 (73%) | 10 (45%) | 26 (59%) 
To a higher price level | 12 (55%) | 7 (3251) | 19 (4360 
28 (64%,)5| 17 AA Eo 


Note.—Cell N = 22 purchasers. 
a Chi-square, p < .05. 7 
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menter was able to apply the treatments in stand- 
ardized fashion, Data collection was terminated when 
the experimenters completed their period of employ- 
ment at the store. At this time 88 subjects had 


been run, 22 in each of the four conditions, higher "$ 


or lower, similar or dissimilar. 


RESULTS AND DISCUSSION 


Before evaluating the hypotheses it was neces- 
sary to ascertain whether bias was introduced 
by excluding purchasers who did not make a 
decision about price, forced the experimenter to 
enlarge on the content of his influence attempt, 
elicited supplemental evaluation from the experi- 
menter, or interrupted and prevented the experi- 
menter from completing his appeal. If purchasers 
in these categories were disproportionately repre- 
sented in the four cells of the design, the test. 
of the hypotheses could be impugned. Appropri- 
ate analysis of the data revealed no relationship 
between the independent classifications (higher 
versus lower, similar versus dissimilar) and the 
frequency of subjects who made no real price 
decision, who forced enlargement or endorse- 
ment, or who interrupted the experimenter. All 
exact test and chi-square p values were greater 
than .25.5 

The same pattern of outcomes was obtained 
for both experimenters, for cash and charge pur- 
chases, and for purchasers estimated to be over 
and under 40 years of age. The combined results, 
shown in Table 2, favored the second hypothesis. 
The dissimilar communicator, although presum- 
ably perceived as more knowledgeable, was less 
effective than the communicator whose paint con- 
sumption was the same as the purchaser's 
(p < .05). There was an expected tendency for 
downward influence attempts to encounter less 
resistance than upward advocacy. i 

Note that half of the 88 subjects changed in 
response to the influence attempt. No base line 
was available for evaluating this overall fre- 
quency of change; it could be attributed to an 
inherent instability of price decisions or to the 
elaborate screening of subjects. In any case, 
recent research has emphasized postdecision phe- 
nomena other than decision revocation (Brehm & 
Cohen, 1962). ‘ 

The findings add clarity to the literature in 
that previous attempts to demonstrate before- 


5 The exclusion of subjects might still have biased 
the outcome if, for some reason, the excluded types 
of subjects were more modifiable by a dissimilar 
than by a similar communicator. No rationale for 
this possibility could be constructed, however, and, 
in any event, the population, to which inferences 
were allowable, was defined as paint purchasers given 
the present treatment, nothing more and nothing less- 


G 


Brier ARTICLES 


after change, as a function of identification with 

a model, yielded ambiguous results (Burnstein 

et al, 1961; Leventhal & Perloe, 1962). The 
> present test of the perceived similarity hypothe- 
sis was considered stringent because the expertise 
effect, to the extent that it was operative, worked 
against confirmation of the hypothesis. Also op- 
posing confirmation was the presumed reluctance 
of purchasers in the higher condition to spend 
more money than they had initially decided upon. 
The present results increased understanding of 
why a face-to-face encounter may be more ef- 
fective in modifying a recipient than the im- 
portunittes of mass-media communicators (Katz 
& Lazarsfeld, 1955) ; the recipient changes toward 
the position of a communicator to the extent he 
perceives that he shares with the communicator 
amw attribute pertinent to the dimension along 
which. change is advocated, An interesting prob- 
lem for subsequent research is whether real-life 
behavior can be modified as readily when the 
similarity attribute is irrelevant to the change 
dimension. 

Subsequent research must also deal with a 
possible alternative explanation for the present 
findings. Perhaps the decisive factor in producing 
the observed differences was not the similarity 
of the “X gallon” communicator but the dis- 
similarity of the “20 X gallon” communicator. 

The subject may have rejected the dissimilar 

communicator because his experience seemed ir- 
relevant; or the subject may have resented the 
experimenter’s “showing off” how much paint he 
used or disbelieved the experimenter’s figure and 
regarded it simply as a sales ploy. If these proc- 
esses were operative, it would affect the interpre- 
tation that similarity is a sufficiently powerful 

‘factor to overcome the effects of expertness. Al- 

» though pretesting of the communicators showed 

. no differences in their believableness, and the 

dissimilar communicator was rated higher in 

expertness, future replications should include a 

control group in which the appeal to change is 
unaccompanied by inductions of either similarity 
or expertness. 

' ‘The cognitive theory of identification (Stotland 
et al., 1961) fits the present results, but it does 
not specify the likely motivational factors as 

- well as the more dynamical theory of social 

comparison processes (Festinger, 1954). In terms 

of Festinger’s theory, the purchaser wanted to 
evaluate the correctness of his price decision and, 
in the absence of nonsocial means of evaluation, 
would be sensitive to the judgment of other 
persons. ‘The salesman, of course, provided the 

_ wanted standard but when he was perceived as 

| Noncomparable on the relevant issue of paint 


I» 
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consumption, the purchaser was neither attracted 
to him nor motivated to reduce discrepancy 
between the chosen and advocated prive levels. 

Kelman’s (1962) presentation of hypotheses 
dealing with action and attitude change provided 
a third theoretical view. Kelman reasoned that 
gradients of approach or avoidance based on 
“identification” are steeper than those based on 
“internalization.” Hence, when the purchaser’s 
approach toward another price level was based 
on identification with the communicator, this 
sufficed to overcome avoidance based on inter- 
nalized values such as “spend no more than 
necessary” and/or “take no advice from inex- 
perienced persons.” The relationship between 
communicator-recipient similarity and niodifica- 
tion of the recipient requires further research in 
which an attempt is made to sort out contrasting 
predictions from theoretical formulations such 
as the three here examined. 

Finally, the present study showed that con- 
temporary theories in social psychology are not 
necessarily inadequate to “study thë powerful 
forces which affect people in the real social world 
[Katz & Stotland, 1959, p. 467].” With some 
noteworthy exceptions (Hovland, 1961; Schach- 
ter, Willerman, Festinger, & Hyman, 1961), the 
problem may lie rather in the understandable 
reluctance of experimenters to forego the com- 
fort, convenience, and methodological refinement 
of the laboratory. 
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$ JESSE E. GORDON anp EDWARD SMITH? 
University of Michigan 


The major hypothesis tested was that aggression in children can be reduced 
by the arousal of the affiliation motive. Children of 2 age groups were exposed 
either to a story designed to arouse affiliation drive or to a neutral story. 
Aggression measured in doll play before and after the story reading was taken 


as a reflection of conflict effects. The results indicate that the story arousal 
inhibited aggression in some children, replicating earlier findings. In addition, 
a comparison of parental attitudes on 
that (a) the stricter the girl’s mòther “the more aggressive the daughter, if the 
mother used physical punishment; and (b) the stricter the boy’s mother the 
less aggressive the son, especially jf physical punishment was not used. 


In a previous study (Gordon & Cohn, 1963) it 
was found that doll-play aggression in 4-5 year 
old children in a publicly supported nursery 
school could be reduced by reading the children a 
story designed to arouse affiliation motivation. 
The present study was designed to extend the 
earlier findings to different populations of chil- 
dren: 3-4 year olds in a more middle-class nurs- 
ery school, and 6-7 year olds in a public ele- 
mentary school, 

The present investigation also examines the 
relationships among parental training practices 
the aggression of their children in the doll play, 
and the children’s responsiveness to the affiliation 
story. Our interest in parental attitudes of per- 
missiveness for aggression, general strictness, and 
the use of physical punishment is based on pre- 

+ We wish to thank Irina Weiss-Marbach of the 
University of Hamburg for her help in the analyses 
of the data presented here. 
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child training and aggression showed 


vious work by Sears, Maccoby, and Levin (1957), £ 
whose findings concerning these variables were | 
inconsistent in some respects, and were based on | 
maternal reports of their children rather than, | 
direct observation. $ 


METHOD 
Subjects 


Subjects were 48 children, equally divided among | 
males and females, young (3-4 years, in private 
nursery schools in Ann Arbor, Michigan) and old 
(6-7 years, in public elementary schools serving the 
same areas as the private nursery schools from ~ 
which the young sample was obtained). Half of each 
group was assigned to the experimental (need affili- 
ation arousal) group and half to the neutral group: 


Procedures 


Subjects were tested in their homes by one experi- { | 
menter while another interviewed the mother an 
administered the questionnaires to her. The pro- 
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cedures for testing the children were as described by 
Gordon and Cohn (1963). These involved two struc- 
tured doll interviews separated by the reading of the 
+ experimental or control story: In the original study, 
the story reading and second doll interview came 
about 1 week after the first interview. In the present 
study they were in the same session, with an interpo- 
lated task replacing the week interval. The interpo- 
lated task consisted of presenting to the child several 
dolls other than those used in the interviews, and 
asking the child to select one from the group in 
response to questions about his perceived self (Which 
one is like you?) and his admired goal (Which one 
would you like to be like?). This task was part of 

* another «study on identification. 

In the doll interviews, subjects were given a doll 
for them fo use in responding to the experimenter’s 
doll, which frustrated the subject’s doll through an 

» imagined situation described by the experimenter 
(two children at a playground, with the experiment- 
er’s dell insulting and challenging the subject’s doll). 
The subject’s responses were recorded at the 12 
points at which the subject was asked to respond. 
Each of the 12 responses was later rated as aggres- 
sive or not. The situations used in the first and sec- 
ond doll-play interviews (DP, and DP:) were simi- 
lar. The dolls used were rubber hand puppets repre- 
senting a whale and an alligator, manufactured by 
Creative Playthings, Incorporated. 

The experimental story consisted of a short narra- 
tion by the experimenter of a story in which a young 
dog searches for but cannot find any friends with 
whom to play. The control story was exactly the 
same as the experimental story in all wording except 
that the word “ball” was substituted for “friend,” 
thus converting the story from one concerned with 

_ affiliation needs to a story about a dog looking for a 
ball with which to play. At the conclusion of the 
story, the subject was asked whether the dog would 
rather play with a pictured dog shown him by the 
experimenter, or with a pictured ball also présented. 

* This choice allowed an independent check on the 

*- subject's understanding of the story theme. 


5 


Questionnaires 


© While the child was being tested in one room, an- 
` other experimenter interviewed the mother and ad- 
ministered questionnaires in another room. One 
questionnaire was designed to measure general strict- 

| ness. It consisted of 33 children’s activities which 
might be considered obnoxious by the parent (for 
example, eating between meals, climbing on furni- 
‘ture, staying in the same room with company, etc.), 

- with the instruction to check those activities which 
they did not permit their children to engage in. The 
second questionnaire was designed to measure per- 
missiveness for aggression in particular. It consisted 
of 19 aggressive acts by children (spitting in anger, 
throwing toys, deliberately breaking objects, slapping 
other children), with instructions to the parent to 
indicate the age at which she would expect her child 
_ ‘to not engage in such action any longer. This sched- 
ule was modeled after Winterbottom’s measure for 


independence training attitudes (McClelland, Atkin- 
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TABLE 1 
CHANGE IN AGGRESSION SCORES 


% Experimental Control 
Young —.10 2.43 
Old 2.25 81 


son, Clark, & Lowell, 1953). In addition, the mother 
was asked to select from five alternative types of 
disciplinary actions, ranging from spanking to ignor- 
ing, which she would take if the child engaged in the 
aggressive action when he was a year older than the 
age at which the mother felt it sMould have been 
extinguished. A physical punishment score was ob- 
tained by counting the number of times that the 
mother selected “spanking” or “rough handling, 
shaking,” 


. 
RESULTS 


Collier’s (1958) analysis of variance for mixed 
designs was applied to the aggression scores of 
DP. There were no significant differences as- 
sociated with the experimental condition, indi- 
cating that before the story-réading need affilia- 
tion arousal, the experimental and control groups 
were not different in aggression. There were also 
no significant differences associated with age, but 
boys expressed significantly more aggression than 
girls in the first doll play (F = 4.21, p < .05). 

Because of sex differences in initial level of 
aggression, the experimental effects were evalu- 
ated through change scores from the first to the 

,secopd doll play. In this analysis, there were no 
significant differences associated with sex, age, or 
experimental condition, but the interaction be- 
twé@n age and experimental condition was sig- 
nificant (F = 8.6, p < .01). Analysis of this inter- 
action revealed significant differences between the 
experimental and control conditions for the 
younger subjects (¢= 2.55, p< .05) but no dif- 
ference for the older (¢=1.60, p>.05). The 
means of these groups are presented in Table 1. 

These results indicate that the experimental 
story was able to counteract the usual tendency 
for children to increase in aggression as a func- 
tion of doll-play trials, but that this was only 
true for the younger nursery-school subjects. Al- 
though it is possible and reasonable that young 
subjects may be more responsive to affiliation 
arousals, because of their greater dependency, it 
is also possible that the story used was simply 
inappropriate for the older children, who are 
beyond the age at which simple stories al 
personified animals are convincing. It does 
seem likely that possible social-class differen 
between children who attend nursery school 
those in public elementary school would acc 
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TABLE 2 
INTERCORRELATIONS OF SCALES 


| Bayes Å Combined 
M, 


geraram sit v | —.33** 

Permissible aggression ^7; 

physical BaN E wi 26: |—.38* | —.33** 
| 02 Age .28* 


Permissible aggression i... 


Physical punishment and 
general strictness 
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TABLE 4 


CORRELATIONS BETWEEN PARENT QUESTIONNAIRES” 
AND CHILDREN’S AGGRESSION [ 


Boys 


Physical punishment and 
DP: ; 07 
Permissible aggression and 


*p <.05. "4 
"p <.01, 


for these results. If anything, the elementary 
school children would come from a somewhat 
lower socioeconomic level than the nursery chil- 
dren, but certainly no lower than the children of 
working mothers in the earlier study who behaved 
very much like the younger children in this 
study. 


Parent Attitude Scales and Aggression 


The measures of parental permissiveness and 
use of physical punishment were modestly but 
significantly intercorrelated (see Table 2). 

The modest correlation between permissiveness 
for aggression and strictness, almost identical to 
the —.30 obtained by Sears et al. (1957), for the 
relationship between permissiveness for aggres- 
sion and restrictiveness in other ways indicates 
that mothers do make discriminations between 
aggressive and other kinds of undesirable be? 
haviors in their children. ° 

The correlations reported in Table 2 are about? 
the same for the two sexes, except for those be- 
tween general strictness and physical punishment, 
where it is found that the more strict a girl’s 
mother, the more likely she is to use physical 
punishment in controlling aggression while there 
is no evidence for such a tendency among mothers 
of boys. This is true even though the mother’s 
standards for permissible aggression are about the 
same for the two sexes, as revealed by the means 
presented in Table 3, in which no significant dif- 
ferences are found between the permissiveness 
for aggression, strictness, and physical punish- 
ment scores of mothers of boys and mothers of 
girls. Thus, while girls are not necessarily pun- 


TABLE 3 
MEAN ScORES ON THE PARENT VARIABLES 


Boys Girls 
Permissible aggression 70.57 72.03 
General strictness 10.14 12.67 
Physical punishment 7.25 7.27 


i 19 
General strictness and DP, —.50** 
*p <.05. 
=b $o 


ished physically more than boys, their pw 
ment is more closely tied to levels of mai 
strictness. 

The correlations between the parent scales ant 
aggression as measured in DP, are presented’ 
Table 4. The only signifi» correlation is W 
maternal strictness. The si }rise is that the ¢ 
relation is reversed in the! #f)-sxes. Sears et |. 
(1957) report “a slight tends, shers 
had high restrictions to repo rie 
were low in aggression,” but tokia 
broken down by sex of child. istlw. 
the sex differences in the pui oits" 
correlations may account for t nag 
tween the sexes in the strictnes. 979 s“ 
relations. If strict mothers of g ° °"%% 


physical punishment more than do ol 

of boys, then the sex differences in m r 
aggression relationship might be avai 
physical punishment. To test this pc- joj! 
children were divided into high ana tiv) n 


ically punished groups, and the core 
tween strictness, permissiveness for ‘¥j,'*”™ 
and aggression within the two levels 
punishment examined. This breakdown. 
correlations is presented in Table 5. 
As indicated in Table 5, sex differe i 
emphasized by the analysis. There is a } 
tendency for parental permissiveness for al 


TABLE 5 


CORRELATIONS BETWEEN Parent ATTITUDES 
AGGRESSION IN Low AND HIGH 


Li 


PUNISHED GROUPS | 
Varial 
Hig ayer tacos Girls 
Permissible aggression and 
DP: 
High —.66** 
Low —.18 
General strictness and DP; 
High wore 
Low 07 


p <.01, 
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sion to be associated with low aggression in girls 
who are physically punished, but this relationship 
is not found in less physically punished girls, nor 
wis it found at all among boys. On the other hand, 
our data show a strong relationship between ma- 
ternal strictness and aggression, but it is different 
for the two sexes, The more strict a girl’s mother, 
the more aggressive the daughter, if the mother 
uses physical punishment, but this is not found 
_for girls not physically punished. On the other 
hand, for boys the more strict their mothers, the 
less aggressive they are, especially if they are not 
physically punished. These relationships lend 
some support to the hypothesis that children’s 
imitation of parental aggression (physical pun- 
ishment) depends to some extent on the sex of 
. the child; the data are .consistent with Sears’ 
‘ notion that girls imitate maternal aggression but 
ai do not support thr hypothesis that boys do 
also, 


j USSION 
ww qvedemess; Practice Variable or 
Vert 

ond doll-p's a, 
bees do’ findings with those of Sears 
Creat vei yg striking similarities and dif- 
The expe hile Sears et al. found that 
tion by thie geness for aggression is associ- 
dog sear Jion, we find that the relation- 


whom £ 413 the use of physical punishment 
same as ti; ‘chila, and is reversed in girls from 


that the d in their study. They interpreted, 
thus e'g Ẹ to mean that permissiveness „for 
ba pows a child to practice aggressive 


“igvveswshile punishment serves as an aggres- 
sion-instir’ ing frustration, and a model for the 
aes nitate. Our findings with respeet to 
girls are‘inore consistent with an interpretation 

aid’ £ permissiveness as a measure of the 
which the mother produces aggression- 
ug frustration, for the less the permissive- 
ie greater was the girls’ aggression. These 
dit. aces between our data and Sears’ may be a 
restiit of the differences in our measures of ag- 
gression. The present study used direct measures 
of the children’s aggression, while Sears used 
maternal reports which may have been biased 
by mothers’ needs to rationalize their own use 
of punishment, or by special sensitivities to ag- 
gression in mothers who punish extensively, or 
by defensive projections of the mothers’ own 
aggression onto the children. 

Although permissiveness for aggression and 
| general strictness are correlated, unlike Sears et 
=al. we find that general strictness is more highly 
- telated to children’s aggression than is permissive- 
| hess for aggression. In the case of girls, the data 
t further support the interpretation of the strict- 


657 


hess-permissiveness dimension as one which func- 
tions as a potential source of frustration-instigat- 
ing aggression, with aggression increasing with 
restrictiveness. 
. 

Punishment and Models 

We find some logical s;>nert for the modeling 
function of maternal aggression. If we accept the 
proposition that parental restrictiveness provides 
a source of frustration and punishment provides 
a model for the child to imitate, then we should 
expect that the correlation between strictness and 
aggression should be greater in the highly pun- 
ished girls than in the highly punished boys; this 
was found. Thus our data support the notion that 
the child’s identification with the parents is an 
important variable in his responsiveness to their 
control techniques. ° 


Parental Control Techniques and Child’s Identi- 
fication 


The negative correlation between strictness and 
DP, aggression in the less physically punished 
boys is more difficult to interpret. It indicates 
that the more strict the mother, the less aggres- 
sive the boy, if the boy is not physically pun- 
ished. One speculative possibility might be sug- 
gested: if the mother supports her restrictiveness 
with love-oriented techniques of discipline, rather 
than physical punishment, it puts the son in a 
difficult position. If he violates her standards, he 
Idses her love. Boys, of course, are less likely to 
identify with their mothers than girls, and so do 
tot infitate the mother’s use of physical punish- 
ment, as indicated above. According to psycho- 
analytic theory, one function of identification is 
to free the child somewhat from dependence on 
the external source for affectional rewards by 
internalizing the parental rewards (Whiting & 
Child, 1953). If a mother withdraws love from 
her son, who does not identify strongly with 
her, then the son has no resource for replacing 
the loss. According to this theory, loss of ma- 
ternal love to a boy is then a more serious threat 
than it might be to a girl, and is one which the 
boy is therefore less likely to risk. Thus the boys 
in our study respond to maternal strictness by 
inhibiting aggression, thereby lessening the threat 
of loss of love. While this account is admittedly 
speculative and goes beyond the data we have, 
there is some evidential support for it. 

One source of support is the finding (Gordon & 
Smith, 19632) that girls are more closely identi- 
fied with mothers who use physical punishment, 
while boys whose mothers do not use physical 


2 Unpublished study entitled “Developmental 
Processes in the Identification of Boys and Girls.” 
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punishment are more closely identified with their 
fathers. ‘That is, boys whose mothers are more 
likely to use love-oriented techniques of control 
do not identify with their mothers. Thus we have 
evidence that the non-physically-punished boys 
are less identified .with their mothers than the 
physically punished girls. 

A second source of support comes from our 
findings concerning the reactions to the story- 
arousal procedures. The correlation between ma- 
ternal strictness and tendency to increase in ag- 
gression over the two doll-play sessions is r= 
—.70 among the boys in the control group (p < 
.01). This correlation indicates that the stricter 
the mother, the greater the tendency for the son 
to increase in aggression during the permissive 
doll play. This suggests that boys of strict moth- 
ers are relatively inhibited at the beginning of 
doll play, ‘but that the permissiveness of the ex- 
perimenter allows a loosening up in control so 
that these boys increase in aggression while the 
sons of permissive mothers who are therefore less 
inhibited at the beginning do not respond to the 
permissiveness in the doll play by disinhibiting. 
This relationship was not found in girls. Of even 
more significance is the finding that the —.70 cor- 
relation between strictness and aggression increase 
in the boys is reduced to a nonsignificant —.09 
in the experimental group. We interpret this to 
mean that the reading of a story about loneli- 
ness (that is, lack of love) prevents the restricted 
boys from increasing in aggression any more than 
do the nonrestricted boys. Thus the story abbut 
loneliness provided an inhibition to the restricted 
boys not present in the control story and not 
effective for unrestricted boys. The ability af a 
story about lack of love to prevent disinhibition 
in the sons of restrictive mothers adds some 
weight to the notion that these boys’ mothers 
enforce their restrictiveness by love-oriented 
techniques to which the boys respond by inhibi- 
tion of aggression. 

Our findings also raise the interesting possibil- 
ity that the same parental practices may have 
different effects on girls and boys, It is usually 
thought that if the sexes differ in aggressiveness, 
it must be because the parents differ in their 
treatment of boys and girls, Sears et al. (1957) 
had found that parental behaviors were different 
for the two sexes, but boys did not differ from 
girls on their measure of aggression. We found 
that the parents treated their sons and daugh- 
ters in essentially similar ways, at least as far as 
scores on permissible aggression, general strict- 
ness, and physical punishment were concerned, 
but that the sexes did differ in aggression. Taken 
together, these results suggest that differences in 
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aggression between boys and girls may not be a 
function of differences in parental training prac- 
tices, but they may be a function of differences in 
the children’s responsiveness to those practices, 
Our data thus support Eron’s (1963) conclusion 
that it is impossible to generalize from boys to 
girls in research on socialization, especially as far 
as aggression is concerned. 


Affiliation-A ggression Conflict 


Some comments concerning the affiliation 
arousal are in order. While our data confirm the 
earlier study which found that reading.a story 
about loneliness to children could arouse motiva- 
tions which conflict with aggression, they do so 
only with reference to the nursery-school chil- 
dren in the present study. Nevertheless, the 
change in the general strictness-aggression in- 
crease correlation in boys from —.70 in the con- 
trol group to —.09 in the experimental group 
suggests that there were effects attributable to 
the story. These effects, interpreted as evidence 
for the ability of the story to inhibit aggression | 
of boys with a high potential for disinhibition 
when in permissive situations, are thus seen to be | 
selective with respect to personality character- 
istics of the audience members. To the findings 
of Himmelweit, Oppenheim, and Vance (1958) 
and Eron (1963) indicating that television view- 
ing of aggressive themes is correlated with ag- 
gressive behavior in children, and the finding of | 
Gordon and Cohn (1963) that story themes af- 1 
fect motivational states, we may now add the 
further suggestion that the effects of such themes 
are selective, depending on personality character- 
istics of the audience. One might draw the fur- 
ther conclusion, though this is more speculative, » 
that the effects of the story themes depend on }' 
predispositional potentials toward the motiva- 
tional state with which the theme is concerned. | 

A final word about sex differences is in ordeto } 
We did not find aggression in girls to be more“) 
powerfully affected by the affiliation theme than 
is aggression in boys, nor did the earlier study. 
Further, our analysis suggests that it was the f 
boys whose aggression was more responsive to 4 
connection between affiliation arousal and love- 
oriented techniques of maternal discipline. This ; 
is inconsistent with other data, such as Mc- 
Clelland et al. (1953) and Bronfenbrenner’s 
(1961) which suggest that females are more re- | 
sponsive to such stimuli. However, the present 
study and its predecessor were concerned with 
girls much younger than those studied by Mc- + 
Clelland et al. and Bronfenbrenner, and it is | 
possible that responsiveness to the availability of -| 
love and affiliation may change with age. 
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ATTRACTION AS A LINEAR FUNCTION OF PROPORTION OF. 


POSITIVE REINFORCEMENTS * 


DONN BYRNE anp DON NELSON 


University of Texas 


In various investigations of the effects of the similarity of a stranger's attitude 
on attraction toward him, the proportion of similar attitudes has not been 
distinguished from the number of similar attitudes. In a 4 X 3 factorial design, 
4 levels of proportion and 3 levels of number were employed. Each of 168 Ss 
was asked to read an attitude scale purportedly filled out by an anonymous 
stranger and to evaluate him ọn a number of variables including attraction 
toward him. As hypothesized, analysis of variance indicated that attraction 
was significantly (p <.001) affected only by proportion. Utilizing these and 
other data for a total of 790 Ss, the functional relationship between proportion 
of similar attitudes and attraction was found to be a linear one. The con- 


ceptualization of attitude similarity as constituting positive reinforcement was 
strengthened by the finding of a linear relationship in McDonald’s (1962) 
data between proportion of high creativity:eatings given to 192 Ss and their 


attraction toward the rater. 


“Tt is a well-established finding that the at- 
traction of a subject toward a stranger is a func- 
ction of the similarity or dissimilarity of the lat- 
- ter’s attitudes and values to those of the subject. 
When similarity is manipulated experimentally, 
attraction is found to increase as the similarity 
of the stranger increases (Byrne, 1961a; Jones & 
Daugherty, 1959; Schachter, 1951; Smith, 1957). 
In order to proceed beyond these empirical find- 
ings to build a genuine theory of attraction, it 
would be helpful to obtain a clearer identifica- 


1 This research was supported in part by the 
United States Air Force under Grant AF-AFOSR 
261-63 from the Air Force Office of Scientific Re- 
search of the Air Research and Development Com- 
mand. 

_ The authors wish to thank June Goldberg, Rex 
“ Golightly, Nancy Johnson, Patricia Yale, and Betsy 
‘ Young for their assistance on this project. 


tion of the stimulus which evokes differential at- 
traction responses and a more precise specifica- 
tion of the relationship between this stimulus and 
attraction. 

In a series of papers, Newcomb (1953, 1956, 
1959, 1961) has dealt with the antecedents of 
interpersonal relationships. Along with other 
principles he has proposed that attraction between 
individuals is a function of the extent to which 
reciprocal rewards are present in their inter- 
action. As an extension of this conceptualization, 
it was further suggested (Byrne, 1961a, 1962) 
that attraction toward a person is determined by 
the number of rewards relative to the number of 
punishments received from him. Various types of 
reward and punishment have been utilized ex- 
perimentally, but the major portion of attrac- 
tion research has utilized similarity and dissimi- 
larity of attitudes, opinions, beliefs, and values as 
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TABLE 1 


NUMBER OF ITEMS ON WHICH THE STRANGER HELD 
SIMILAR/DISSIMILAR ATTITUDES 


Number of similar attitude? 
Proportion of 
similar attitudes g 
4 8 16 

1.00 4/0 8/0 16/0 

.67 4/2 8/4 16/8 

50 4/4 8/8 16/16 
-33 4/8 8/16 16/32 A 


the stimulus. The rationale is that the learned 
drive to be logical and to interpret incoming in- 
formation correctly is reinforced by consensual 
validation and frustrated by consensual invalida- 
tion (Dollard & Miller, 1950; Festinger, 1950, 
1954; Newcomb, 1953, 1956, 1959, 1961). 
Investigations of the effects of varying pro- 
portions of similar and dissimilar attitudes have, 
unfortunately, involved a confounding of the 
number of positive reinforcements with the pro- 
portion of positive reinforcements. For example, 
Byrne (1962) variéd seven items of similarity- 
dissimilarity in the eight possible combinations 
ranging from 7-0 to 0-7. The highly significant 
effects on attraction toward the stranger to whom 
the attitudes were attributed could have been a 
function of either the proportion or the number 
of similar attitudes. The present investigation is 


designed to test the proposition that attraction R 


toward a stranger is a positive function of the 
proportion of positive reinforcements receiyed 
from that stranger. 


PROCEDURE <o 
Utilizing attitude similarity-dissimilarity, the design 
permits the comparison of the effects of number of 
positive reinforcements (16, 8, and 4) with the 
effects of proportion of positive reinforcements (1.00, 
.67, .50, and .33) on attraction in a 4X3 factorial 
design as shown in Table 1. Each cell contained 14 
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subjects, divided approximately evenly with respect 
to sex. Subjects were asked to read an attitude scale 
supposedly filled out by another student of their 
same sex and then make several judgments about 
him or her including the ratings that constitute the 
measure of attraction. 

The subjects consisted of 168 students enrolled in 
the introductory psychology course at the University 
of Texas. In their classrooms, each student responded 
to one of several forms of an attitude scale (ranging 
in length from 4 to 48 items) in which they indicated 
their opinions about each topic on a 6-point scale. 
On the basis of a pilot study with another group 
of 138 subjects, each scale was balanced with respect 
to the importance of the topics; in addition, items 
which yielded extremely uniform responses in the 
population were omitted in favor of items yielding 
as close as possible to 50-50 splits. The latter control 
was an attempt to avoid the possible confounding 
of dissimilarity of attitudes with deviancy of atti- 
tudes on the part of the stranger. The attitude items 
covered a variety of topics including fraternities and 
sororities, integration, science fiction, welfare legisla- 
tion, tipping, discipline for children, community 
bomb shelters, and gardening. 

In the experiment itself, subjects were brought into 
a special room in small groups. As has been described 
previously (Byrne, 1962) they were told that the 
experiment concerned the accuracy of interpersonal 
judgments based on limited information. They were 
to receive an attitude scale filled out by an anony- 
mous stranger, read the responses carefully, and 
then make several judgments about him or her. Each 
subject received a spurious scale containing responses 
which constituted the appropriate number of similar 
and dissimilar attitudes depending on the cell to 
which he was assigned. 

The stranger was rated on 7-point scales (Byrne 
& Wong, 1962) with respect to intelligence, knowl- 
edge of current events, morality, adjustment, and 
then om the two attraction scales (probable liking 
for the stranger and probable enjoyment of working 
with him). In previous investigations, responses to, 
the latter two scales have been analyzed separately 
as alternate measures of the dependent variable. It 
seems advantageous, however, to combine the ratings o 
on the two scales into a single attraction index with 


TABLE 2 


MEANS AND STANDARD DEVIATIONS OF 


ATTRACTION SCORES TOWARD STRANGERS WITH VARYING 


NUMBERS AND VARYING PROPORTIONS OF SIMILAR ATTITUDES 


Number of similar attitudes 
Proportion of 
similar attitudes 4 8 16 Total 

M SD M SD M SD M SD 
1.00 11.14 1.68 12.79 1.01 10.93 1.93 
1 10.79 2.46 9.36 2.64 9.50 aay O88 2.60 
“0 9:36 2.52 9.57 2.53 7.93 3.20 8.95 2.86 
33 i 3.02 6.64 1.99 6.57 2.02 7.12 2.50 

Total 9.86 2.74 9.59 3.05 8.73 3.01 
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a possible range of 2-14. With the two scales con- 
ceptualized as forming a two-item measuring device, 
data from 10 different samples totaling 1,010 subjects 
yield an average corrected split-half reliability of .85 
for the attraction measure. 


REsuLTs 


The means and standard deviations of the at- 
traction scores are shown in Table 2. Before 
computing the analysis of variance, Hartley's 
maximum F ratio test for heterogeneity of vari- 
ance (Walker & Lev, 1953) was employed. The 
Fmax Of 10.04 indicated a significant departure 
from homogeneity. Even though it has been 
found (Young & Veldman, 1963) that hetero- 
geneity of variance has a negligible effect on 
both the alpha level and the power of the F test, 
the data were transformed into the square of 
each number in order to obtain acceptable homo- 
geneity. There was no difference in significance 
levels between the analysis based on the raw 
data and that based on the transformed data; 
the summary shown in Table 3 is based on the 
raw data. Since the only significant F is that for 
the proportion of similar attitudes, the hypothesis 
is confirmed. 


DISCUSSION 


The positive relationship between proportion 
of similar attitudes held by a stranger and at- 
traction toward him appears to be a firmly estab- 
lished one. The nature of that relationship can 
be described somewhat more precisely at this 
point. Data were available to the author from 
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TABLE 3 


ANALYSIS OF VARIANCE OF ATTRACTION SCORES TOWARD 
STRANGERS WITH VARYING NUMBERS AND 
VARYING PROPORTIONS OF SIMILAR 


A ATTITUDES 
Source df MS rX 
Proportion (A) 3 158.71 23.34* 
Number (B) 2 14.11 1.63 
AXB 6 8.49 1.25 
Within 156 6.80 
*p <.001. 


five published studies (Byrne,“ 1961a, 1961b, 
1962; Byrne & McGraw, 1964; Byrne & Wong, 
1962),2 the present investigation, and two un- 
published studies; in each instance attraction 
was the dependent variable and various propor- 
tions of similar attitudes the independent vari- 
able. The subjects totaled 790, and 11 different 
values of the proportion of similar attitudes held 
by the stranger were represented. A plot of the 
mean &ttraction scores for these 11 points sug- 
gested linearity, and a straight-line function was 
fitted to the data by the least-squares method. 
The solution yielded the formula Y= 5.44% 
+ 6.62, and the plot is shown in Figure 1. 
While the linear relationship between propor- 
tion of similar attitudes and attraction appears 
to be a lawful phenomenon, more evidence is 
2In two of the studies some of the strangers were 
identified as Negroes, but for the purpose of the 
present paper only the data involving white strangers 
, have, been utilized. 


Y=5.44X + 6.62 
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ger as a linear function of proportion of similar 
attitudes. 
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Fic, 2. Attraction toward stranger as a linear function of proportion of high 
H greativity ratings, 


needed to support the more general proposition 
that attraction is a linear function of the propor- 
tion of positive reinforcements. Attraction has 
been found to be influenced by various types of 
reward and punishment, including high and low 
ratings of creative productions (McDonald, 
1962), facilitation of success at an experimental 
task (Kleiner, 1960), and insulting and frus- 
trating behavior (Worchel, 1958). Only in Mc- 
Donald’s experiment, however, was proportion of | 
positive reinforcements represented by a series of 
values. His subjects were asked to create seven 
stories in response to a set of stimulus cards, a 
confederate posing as another subject rated each 
story on a 10-point scale with respect to crea- 
tivity. Each of the 192 subjects was assigned to 
one of eight possible conditions (seven high 
ratings, six high and one low, etc.). Attraction 
toward the peer who made the ratings was found 
to increase as a function of the proportion of 
high ratings. McDonald made his data available 
to the author, and they were plotted as shown 
in Figure 2. Again, a linear function was found 
to fit the data: Y = 2.98X + 8.47. Thus, linear- 
ity of the relationship is evident across two 
different experimental conditions but the values 
of the constants differ. Presumably, the similarity 
across stimulus conditions is attributable to the 
common element of positive and negative re- 
inforcements involved. A tentative law of at- 
traction is proposed as A, = mPR, + k or attrac- 
tion toward X is a positive linear function of the 
proportion of positive reinforcements received 
from X. The explanation for the differences 


Y=2.98X + 8.47 
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between the two situations in terms of the slope 
of the line and the Y intercept must be sought 
in subsequent research. 
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PREDICTION OF INTERPERSONAL PREFERENCES AND GROUP 


MEMBER SATISFACTION FROM ESTIMATED ATTITUDES* 


MARTIN FISHBEIN 


University of Illinois 


This paper tests the predictive utility of a theory of belief and attitude in the 
area of interpersonal preferences within small groups. Each leader’s attitude 
toward each member of his group was computed as a function of the leader’s 
beliefs about the behavior of each member. These estimated leader attitudes 
were related to: (a) the esteem which the leader gave each member of the 
group, (b) his sociometric choices, (c) his perception of the group’s atmosphere, 
and (d) his overall satisfaction in the, group. The results support the theory, 
and provide evidence for the predictive, effectiveness of attitudes estimated 
indirectly from an individual’s beliefs about an object. 


In some recent papers, the writer (Fishbein, 
1962, 1963) has presented a theory of-relation- 
ships between beliefs about an object and the 
attitude toward that object. Essentially, the 
theory leads to the prediction that an individual’s 
attitude toward any object is a function of his 
beliefs about the object, and the evaluative as- 
pect of those beliefs. This holds for the case 
where a belief about an object is defined as the 
probability that the object is related to some 
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other object, concept, value, or goal, and the 
@valuative aspect of a belief is defined as the 
evaluation of, that is, attitude toward, the “re- 
lated” object. 

This paper presents a test of the applicability 
of this theory in the area of interpersonal atti- 
tudes. Specifically, the attitudes of group leaders 
towards the three members of their respective 
groups will be estimated from the leaders’ beliefs 
about their members. In addition, the relation- 
ships between these estimated attitudes and the 
leaders’ sociometric choices, their perception of 
the “group’s atmosphere,” and their satisfaction 
with the group are also investigated. 


METHOD 


The data for this analysis were obtained by Mc- 
Grath and Julian (1962) in a study of negotiation 
processes. Since a complete discussion of the method 
and procedures used may be found in McGrath and 
Julian, only those parts of the study relevant to 
the present analysis will be discussed. 3 

Twenty graduate students served as chairmen of 
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TABLE 1 
Topics or NEGOTIATION AND THE RELIGIOUS FOUNDATION WITH THE ROLE oF MINORITY-OF-ONE 
Negotiation topics 4 Foundation with minority role 
ays Content Codo Foundation name Position 

X Use of King James Bible as a reading B Baptist Student Union (Southern | Favoring 
text in public schools Baptist) 

Y Federal aid to parochial and other C Newman Foundation (Catholic) Favoring 
private schools 7 3 

Z Empowering FCC to regulate TV D Channing-Murray Foundation Opposing 
programing to reduce obscenity and (Unitarian-Universalist) . 
violence 


Note,—From McGrath and Julian (1962). 


60 four-person negotiation grðups. Each chairman 
met with three different negotiation groups, each 
group being composed of three new members and 
the chairman, The negotiators were representatives 
of three campus religious foundations, and the topics 
selected for negotiation provided a situatioh in 
which two of the negotiators formed a natural alli- 
ance (majority members), placing the third negoti- 
ator in a position of a minority-of-one. Further, the 
topics were selected such that the representative of 
each religious foundation was the minority member 
in one of the negotiations. Table 1 indicates the 
three religious foundations, the topics of negotiation, 
and the minority member for each topic. Although 
each chairman and each negotiator served in three 
negotiation sessions, any given chairman met with 


nine different individuals. Each negotiation discuss 


sion lasted approximately 30 minutes. y 

At the end of each of the discussions, all partici- 
pants completed several rating instruments. First, 
they rated each group member, as well as the “groups 
atmosphere” on a semantic-differential-type instru- 
ment containing 17 evaluative bipolar scales.* These 
ratings provide a direct measure of the chairman’s 
evaluation of, that is, attitude toward, each mem- 
ber of his group, and a measure of his evaluation of 
the group’s atmosphere. Consistent with the termi- 
nology of McGrath and Julian (1962) and Fiedler 
(1958), the former scores shall be referred to as 
“esteem scores,” and the latter as “group atmos- 
phere scores.” However, throughout the paper, it 
should be recalled that an esteem score is a directly 
obtained measure of one individual’s attitude toward 
another individual. 

Each participant then rated every other partici- 
pant on a Behavioral Description Questionnaire 
(BDQ). This instrument consists of 28 5-point scales 
on which the respondent is asked to indicate how 
well each of a series of statements describes the 
behavior of a particular member. Each chairman’s 
rating on the BDQ provides a measure of his beliefs 
that a particular member performed certain behaviors, 


2 For a complete description of all the instruments 
used in the study, see McGrath and Julian (1962). 


For example, the chairman was asked to indicate how 
strongly he believed that Member “B” “expressed "his 
opinions tactfully,” “criticized those with whom he 
disagreed,” etc. Finally, each member filled out a 
questionnaire, two items of which are relevant here. 
The first asked each respondent to indicate, on @ 
5-point scale, how much he “enjoyed being a mem- 
ber of this group.” The second was a sociometric 
choice question asking each member to indicate the 
people he would most and least want to have with 
him in a similar group. $ 

Through a consideration of the role relationships 
built into the negotiation situation, McGrath and 
Julian obtained quite accurate predictions of ma- 
jority and minority members’ esteem for, and socio- 
„metric choice of, the other group members. For ex- 
ample, they found that participants in the role of 
minority-of-one had significantly more esteem for 
the chairman than either of their “majority partners.” 
Consistent with this, minority members significantly 
overchose the chairman as the person they would 
most like*to have with them in a similar group. No 
such predictions could be made for the chairmen. In 
general, chairmen held equal esteem for majority and 
minority members. Similarly, they were equally 
likely to accept or reject majority and minority 
negotiators on the sociometric choice question. Fur- 
ther, on 27 of the 28 BDQ items, the chairmen did 
not differentially rate majority and minority mem- 
bers. The main purpose of the present paper is to 
predict the chairman’s esteem for, and sociometric 
choices of, his respective group members. 


RESULTS AND DISCUSSION 
Prediction of Attitude 


McGrath and Julian (1962) concluded that the 
personal and interpersonal consequences of ne- 
gotiation are “mediated via the interaction proc- 


8It should be noted that the present study was 
designed after McGrath and Julian (1962) had com- 
pleted their study and found that the chairmen’s es- 
teem given and sociometric choices could not be pre- 
dicted from their role relationships. 
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ess.” Thus, they imply that the chairman’s es- 
teem for any member was a function of his be- 
liefs about the interpersonal behavior in which 
the member engaged. In the terminology of the 
theory described above, this conclusion can be 
formulated as the following hypothesis: A chair- 
man’s attitude toward any group member is a 
function of his beliefs that the member per- 
formed certain behaviors, and the evaluative as- 
pect of those beliefs, that is, his evaluations of 
.the behaviors. Algebraically, the hypothesis is 
expressed as follows: 


N 
x Ao = È Bia; 
i=l 


` where 
A. = attitude toward member “o” 
= belief “%” about “o,” that is, the probability 
or improbability that “o” performed some behavior 
x 
a, = the evaluative aspect of B:, that is, the evalu- 
ation of “x” 
N = number of beliefs 
While the chairmen’s beliefs about the be- 
haviors of each member could be obtained di- 
rectly from the BDQ, no measure of the evalua- 
tive aspect of the beliefs was available. Further, 
the writer (Fishbein, 1962) has previously ar- 
gued that attitude is primarily a function of 
salient beliefs, and no measure of saliency was 
obtained. In order to obtain these values, five 
“expert” judges‘ first rated each of the be- 
haviors on the BDQ on an 11-point scale ranging 
from extremely good (+5) to extremely bad 
(—5), and second, rank ordered the 28 behaviors 
in terms of their importance if one were to de- 
cide whom he wanted to have continue in the 
group. The judges went through this procedure 
twice: once considering that the behavior was 
-performed by a majority member and once con- 
sidering that the behavior was performed by a 
minority member. Table 2 presents the seven 
" behaviors ranked as most important on each sort, 
and the mean evaluations of these behaviors. 
Utilizing the algebraic formula above, estimates 
of the chairman’s attitudes toward each of the 
nine members with whom he worked were com- 
puted. For example, a chairman’s attitude 


4 The judges were five staff members of the Group 
Effectiveness Research Laboratory at the University 
of Illinois. While ideally these ratings of a; and 
salience should have been made by each subject, 
McGrath and Julian did not obtain these types of 
data in their study. Thus, judges were used in an 
attempt to approximate the relevant information. 

The selection of seven behaviors was based on 
>. Fishbein’s (1962) argument that there are probably 
only 6-11 beliefs salient at any point in time. 
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toward a given minority member was computed 
by considering his responses on the seven selected 
BDQ items indicated in Table 2. Each response 
was scored from + 2 (strong agreement that the 
minerity member performed the behavior) to 
—2 (strong disagreement). Each score was then 
multiplied by the evaluation of the behavior 
included in the item. Finally, the seven products 
were summed. This summed score shall be re- 
ferred to as the estimated attitude (EA). 

Correlations between estimated and obtained 
Scores were computed with respect to each of the 
nine group members with whom the leaders 
worked (for example, the estimated attitudes of 
the 20 leaders toward Member B in Negotiation 
Session “X” were correlated with théir corre- 
sponding esteem gcores). All nine correlations are 
significant beyond the .01 level. Indeed, the 
average correlation = .753 (p < .001). This find- 
ing provides strong support for the theory of the 
relationships between beliefs about an object and 
the attitude toward that object. 


© 


o 
Prediction of Sociometric Choice 


Knowledge of a chairman’s attitudes toward 
the members of his group should allow one to 
predict his sociometric choices. Since the pre- 
dictions of most- and least-preferred group mem- 


TABLE 2 


RELEVANT BELIEFS AND THEIR ÅSSIGNED EVALUATIVE 
WEIGHTINGS 


1? He made important contributions to the group 
discussion and decisions. 

. He is an intelligent person. 

. He worked well with others in the group. 

. He influenced the opinions of others. 

. He was a warm person, easy to talk to. 

. He was the real “idea man” in the group, sug- 
gesting new ways of handling the group’s 
problems. 

. He listened attentively to others. 

He expressed his opinions tactfully. 


on in 


ox 


Relevant to: 


Majority members Minority members 


Belief a Belief ai 
1 4.2 1 4.6 
2 3.8 2 4.0 
3 2.6 3 2.8 
4 2.8 5 1.6 
5 1.6 6 1.6 
7 3.8 7 44 
8 3.6 8 3.2 


Note.—Selection of relevant beliefs and assignment of evalua- 
tive weights is based on judgments of five “experts.” 
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bers are not independent, predictions were made 
only of the chairman’s choice of the member he 
would least prefer to have continue in the group. 
Since the decision to predict “least preferred” 
was arbitrary, it should be noted that the pre- 
dictions of “most preferred” yielded equivalent 
results, Both obtained esteem scores and EAs 
predict sociometric choices significantly better 
than chance. The percentage of correct predic- 
tion is 64.3 (p <.001) and 73.7 (p<.001) for 
esteem and EA, respectively. Although the dif- 


ference in accuracy between the two predictors’ 


does not reach significance (p < .15), it is not 
surprising that thè EAs appear slightly better. It 
should be recalled that the EAs are based on be- 
liefs that were judged to be relevant in making a 
decision about whether a given member should 
be allowed to continue in the group. That is, EA 
is estimated from beliefs abdut each person as a 
group member. Thus, EA actually represents an 
estimate of the chairmen’s attitudes toward each 
individual as a group member. Esteem, how- 
ever, is a refatively direct evaluation of a given 
person in general. Clearly, EA is more directly 
related to the sociometric choice under consid- 
eration than is esteem. The more relevant the 
attitude is to the behavior to be predicted, the 
better will be the prediction. 


Prediction of Group Atmosphere 


In a recent article summarizing 12 years of 
research, Fiedler (1963) stated that “The per- 


sonal relationship between the leader and key, 


members of his group is probably the most im- 
portant single determinant of group processes 
which affect team performance.” Indeed, the 
nature of the affective leader-group relations 
serves as one of the three major dimensions in 
Fiedler’s contingency model for the prediction of 
leadership effectiveness. One of the major indices 
that has served as a definition of this dimension 
is the leader’s rating of the group atmosphere. 
Consistent with this definition, Fiedler (1958) and 
his associates (for example, Fiedler, Hutchins, & 
Dodge, 1959; McGrath & Julian, 1962) have 
previously shown that an individual’s rating of 
the group’s atmosphere is highly correlated with 
the sum of his esteem for the members of his 
group. Clearly, the more an individual likes the 
members of his group, the more favorably he 
should rate the group’s atmosphere, Thus, as an- 
other test of the validity of the EA scores each 
leader’s EAs toward his group members were 
summed, and these total estimated attitude 
(TEA) scores were correlated with his rating of 
the group’s atmosphere. 

Consistent with Fiedler’s previous findings, 
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‘the leader’s total esteem for his group members 
is highly correlated with his ratings of the group's 
atmosphere (7 = .834, p < .001). Similarly, and 
in support of the validity of the EA scores, the 
leader’s TEA toward his group members is also 
significantly related to group atmosphere = 
613, p<.001). Although total esteem is a sig- 
nificantly better predictor of group atmosphere 
than TEA (p < .05), it should be recalled that 
the measures of the leader’s esteem for his group 
members, and the measure of “group atmosphere” 
are obtained on the same measuring instrument. 
Thus, the high correlation between total esteem 
and group atmosphere may, in part, be reflecting 
a methods factor in the obtainment of these 
scores. No such artifact can enter into the cor- 
relation between TEA and group atmosphere. 
This is due to the fact that the EA scores are 
derived from responses on an independent in- 
strument containing a different response format 
and different item content.® 


Prediction of Satisfaction 


Fiedler and his associates (for example, Fied- 
ler et al., 1959; McGrath & Julian, 1962) have 
previously shown that an individual’s satisfaction 
in a group is significantly related to his attrac- 
tion toward the group members and/or his rat- 
ing of the group atmosphere. That is, the more 
esteem, the leader has for the group members, 
or the more favorably he views the group’s at- 

o mosphere, the more he should enjoy being a 
member of the group. Only’six leaders gave dif- 
eferential ratings of enjoyment to all three of the 
groups they participated in. Six others indicated 
a group they most preferred (giving lower, but 
equal ratings to the other two groups), while six 
more indicated a group they least preferred. The 


remaining two leaders enjoyed all three groups 


equally well. By randomly dividing the six chair- 
men who did differentiate between all three 
groups, it was possible to obtain a most-preferred 
group for nine chairmen, and a least-preferred 
group for nine different chairmen. Group at- 
mosphere, total esteem, and TEA scores were 
used as a basis for predicting either a most- or 4 
least-preferred group for each of the 18 chair- 
men, 

Consistent with the previous findings of Fied- 
ler and his associates, both group atmosphere and 
total esteem are significantly related to satisfac- 


ë This is not meant to imply that the correlations 
between group atmosphere and TEA are free of arti- 
facts, but only that one specific source of common 
variance has been eliminated. As one reviewer pointed 
out, social desirability artifacts may be influencing all 
the correlations reported in this paper. 


>- 


: 


i 
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tion. Both measures predict with 55.5% accuracy 
(p < .05). However, the best prediction of satis- 
faction is based on the total estimated attitude. 
This measure leads to 88.8% accuracy (p< 
.001) and is significantly better than either the 
group atmosphere or total esteem scores (p< 
.05). Once again, this difference is not surprising. 
One of the major reasons that attitudes have been 
such poor predictors of behavior is that in many 
cases, the attitude being measured is not ap- 
propriate to the behavior under consideration. 
For example, Krech, Crutchfield, and Ballachey 
(1962) have pointed out that “the ‘Negro’ on an 
attitude scale may be quite a different object 
... from the Negro in a face-to-face situation.” 
That is, an individual’s beliefs about Negroes in 
general (that is, the Negro on an attitude scale) 
„are likely to be quite different from his beliefs 


“about a particular Negro. Since an individual’s 


attitude toward any object is a function of his 
beliefs about that object, it follows that an indi- 
vidual’s attitude towards “Negroes in general” 
may be quite different from his attitude toward 
a particular Negro. Obviously, if an investigator 
attempted to predict an individual’s behavior 
toward a particular Negro on the basis of his 
attitude toward “Negroes in general,” he would 
obtain low predictability as a result of using an 
inappropriate predictor. Similarly, Bales (1958) 
has often pointed out that the leader of a group 
is usually not the “best liked person’”»in the 
group. Thus, if one attempted to predict an indi- 
vidual’s sociometric choices of both the leader of” 
the group and the best liked person in the group 
from a single predictor (for example, a general 
measure of attitude toward the other group mem- 
bers), it is unlikely that he would obtain signifi- 
cant accuracy with respect to both choices. Bales’ 
findings suggest that an individual may have 


. several attitudes toward any given individual, for 


example, an attitude toward him as a friend, as 
a co-worker, as a leader, etc. That is, when one 
individual views another individual as a leader, he 
might have a different set of salient beliefs about 
that person than when he views the same person 
as a friend, In the present study it was found 
that slightly better predictions of sociometric 
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choices of people to continue in a group could be 
made from an estimate of an individual’s attitude 
toward each person as a group member than from 
an obtained measure (that is, esteem) of his 
gengral attitude toward each person. Similarly, 
since it is assumed that a person’s enjoyment in 
a given group is a functidn of his attraction to 
that group, it is not surprising to find that sig- 
nificantly better predictions of satisfaction are 
obtained when one uses the sum of the leader’s 
attraction toward each person as a group member 
‘as the predictor than when one uses total attrac- 
tion toward each person in general as the pre- 
dictor. . 
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This study concerns Negro Ss’ avoidance tendencies in competitive achievement 

tasks. It was predicted that Negro Ss would not show avoidance behavior if 

they had higher expectancies of being able to gain reinforcements for their 

efforts, 3 groups of Negro reformatory inmates played a competitive game with 


white stooges who won continuously. 1 group, designated the high-expectancy 
group, was comprised of j 
game was related to perso 
nificantly longer than either o; 
more typical failure-avoidance 


nal skills of 


The results of several recent studies by Katz 
(Katz & Benjamin, 1960; Katz Cohen, 1962; 
Katz, Goldston, & Benjamia, 1958; Katz & 
Greenbaum, 1963), indicated that Negroes tend 
to feel inadequate, and orient compliantly toward 
whites even when they are given objective evi- 
dence of equal mental ability. Battle and Rotter 
(1963) and Lefcourt and Ladwig (1963) have 
demonstrated that Negro subjects answered 
questionnaires and performed in achievement 
tasks in a manner that is characteristic of exter- 
nally controlled persons. That is, the Negro sub- 
jects perceived reinforcements as being unrelated 
to their own behaviors and therefore beyond 
their personal control. In these studies Negroes 
behaved in a predominantly failure-avoidant man- 
ner, especially in biracial achievement situations. „ 

In the studies by Katz et al. (1958), and Katz ` 
and Benjamin (1960), an attempt was made to. 
alter Negroes’ behavior by group prestige tech- 
niques in which Negro and white partners were? 
assured that they were competent in classroom 
ability or had scored highly on certain tests in 
comparison with fictitious norms. Reversals of 
predictions or no effects at all were found on a 
wide range of behaviors in their Negro sample, 
while whites did show some positive effects from 
the prestige manipulation. 

The present study attempts to manipulate 
Negroes’ behavior through a group prestige tech- 
nique similar to that used by Katz and his 


1 This research was based on the senior author’s 
doctoral dissertation at the graduate school of the 
Ohio State University. The research was supported 
in part by the United States Air Force, under Con- 
tract No. AF 49(638)-317 monitored by the Air 
Force Office of Scientific Research of the Air Re- 
search and Development Command. 

The senior author wishes to thank Alvin Scodel 
for his suggestions and help in the design of this 
experiment. 


f 2 control groups. The latter demonstrated the 
characteristic of Negroes in previous research. 


azz musicians who had been led to believe that the 


musicians. This group persisted sig- 


colleagues. The notable difference is that this 
manipulation utilizes a social role within an 
already established positive reference group 
(membership is of high positive reinforcement 
value to the participants). 

From the previously mentioned research it can 
be tentatively concluded that when Negroes 
enter biracial, competitive, achievement-motiva- 
tion eliciting tasks they will have high expectan- 
cies of failure and consequently will behave in a 
failure-avoidant manner. In studies of persistence 
by Feather (1961, 1963) subjects characterized 
as motivated to succeed (M,) were found to 
persist in an insoluble task longer than subjects 
described as failure avoidant (My) if the prob- 
ability of success was .50 or better. Failure- 
avoidant subjects, on the other hand, persisted 
longer when probabilities dropped below .50. In 
the present study all of the subjects are con- 
sidered to be M, to obtain a highly valued in- 
centive (cigarettes). Expectancy of success in 
securing the reinforcements, however, varies with 
the prior experience of the subject in the role as 
a jazz musician. Consequently, subjects who have 
been defined as having higher expectancies of 
success regarding their roles as jazz musicians 
(above -50) will be more apt to persist in an 
insoluble task related to that role than those 
who have lower expectancies of success (below 
.50 level). 


Hypotheses 


Negroes have usually displayed failure-avoidant 
behavior in biracial achievement situations. How- 
ever, when such achievement tasks are construed 
as related to competence in a role in which 
Negro subjects have already had some success 
experience, expectancy of success should increase- 
It is predicted, then, that Negro subjects will be 
more persistent in a competitive biracial task 
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when they perceive the task as related to success 
in a role that has positive reinforcement value for 
them and in which they have had some success 
experience. 


METHOD 
Subjects 


Three groups of Negro inmates were drawn from 
a correctional institution. The first group (J) was 
comprised of 20 inmates who had displayed a con- 
tinuous interest in jazz as demonstrated by at least 
a 6-months’ membership in a jazz club, This club 
met once a week, held jam sessions, and prepared 
musicgl shows for the inmate population. A second 
group (Q) was comprised of 20 inmates who had 
joined the jazz club but had quit after attending 
a few sessions. A third group (C) was made up of 
20 inmates who had never joined any music interest 
“groups. Analyses of variance revealed no differences 
amgng the three groups in intelligence, age, social 
class, or types of crimes resulting in commitment. 
In addition, no differences were found among the 
groups in regard to the internal-external control 
dimension (Rotter, Seeman, & Liverant, 1962) or 
in failure-avoidant behavior on Rotter’s Level of 
Aspiration Board (Rotter, 1942). As reported in a 
previous study (Lefcourt & Ladwig, 1963), Negroes 
were consistently more failure avoidant and had 
greater expectancies of external control than whites. 
Mean beta IQ was 98.5 and the average age was 
21.6 years. 


Task e 


Subjects from the J and Q groups were informed » 
that they had been chosen as subjects for an experi- 
ment because of their interest in jazz. The study, 
was described as concerning the interests, opinions, 
and skills of jazz musicians, Each subject was inter- 
viewed regarding his interests in music, his favorite 
musicians, etc. In addition, subjects were given a 
„Scale eliciting their ratings of known jazz musicians. 


~ Several other measures were administered at this 


time, after which the subjects were released. They 
were told that they would be called in for a second 


I session which would focus on skills of musicians. 


The C group subjects were merely informed that 
they had been chosen at random from the inmate 
population, completed several tests and were re- 
leased with the information that they would be 
called back again. 

The experimental task was a two-person zero-sum 
game called NIM. The game, described in Time 
Magazine (The Marienbad Game, 1962), consists of 
picking up matches until one player is left with the 
last match, this player being the loser. The outcome 
of the game can be controlled almost completely by 
use of a set of combinations. 

Negro subjects played against a white stooge who 
had memorized the necessary combinations. The 
stooge won each game including a practice trial so 


\* that all subjects had a continuous failure experience. 


The subject and the stooge were separated by a 
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screen with only their hands visible to each other. 
The stooge was described as just another inmate 
who was not a jazz musician. The subjects were 
instructed: 


‘This is a simple game but it requires cleverness 
and skill at outsmarting your opponent. Each of 
you will take turns picking up matches. You can 
take as few or as many matches as you wish, even 
a whole row. [Sixteen matches are initially laid 
out in rows of 7, 5, 3, and 1.] But all the matches 
you take in any one turn must be taken from one 

* row. The player who picks up the last remaining 
match loses, Try one game for practice so you can 
see how it works, . . . Both of you are starting 
off with chips worth $2.00. You have 20 chips, 
each worth ten cents, You will bet ten cents for 
each game that you play, and you can play any 
number of games you want. Either of you can call 
it quits at any time. However, if you are the one 
to give in first thefi you will only get credit for 
half the value of your chips. The chips are worth 
cigarettes at 25 cents a pack, so you can see it is 
worthwhile to win as many chips from each other 
as possible. The one who doesn’t gige in will get 
paid the full amount that he has won, 


Subjects were allowed a maximum of 15 plays 
after which they were informed that the games 
would be stopped to insure their winning some 
reward. Consequently, scores ranged from O to 15 
games. All subjects were tested during a 2-day 
period. Four of the 15 subjects tested on the first day 
were deliberately allowed to win to offset possible 
effects of communication. No differences were found 
among J group members tested on the 2 days. 

° 
o RESULTS 


Because of institutional disciplinary actions, 
¥'subjects of the original 60 were lost for the 
second session of testing. Consequently, there 
were slight differences among Ns in the three 
groups. As Table 1 indicates, significant differ- 
ences were obtained among the three groups in 
the predicted directions. The Newman-Keuls 
procedure was used to determine the sigificance 
of mean differences among the three groups 
(Winer, 1962). 

The J group persisted significantly longer than 
either the Q or C groups. The Q group did not 
differ significantly from the C group though the 
difference did approach the .10 level. 

Since there were some differences in variance 
as well as in N and direction of skew among the 
groups, a Kruskal-Wallis H was computed on 
the same data. Results indicated an H value of 
11.36, p < .01. Mann-Whitney U tests revealed 
similar significance of differences as the para- 
metric tests except that the J + Q group differ- 
ence was significant at the .06 level rather than 
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TABLE 1 
ANALYSIS OF VARIANCE AND MEAN DIFFERENCES AMONG THE J, Q, anD C Groups 
IN THE NUMBER OF GAMES PLAYED IN THE NIM SITUATION 
-Keul: f 

_ Group N M SD ? Bilas differences. nioi b 
e Co) eee! 3.76 J+Q 05 

3 19 8.26 6.68 5.8 01 J+C 01 

Cc 16 5.06 6.19 Q+c ns 


the .05 level obtained with the Newman-Keuls 
test. 


* Discussion 


The regults suggest that*previous failures of 
prestige manipulations to alter Negroes’ failure- 
avoidant behavior may have been due to the 
fact that such manipulations have too often 
focused on intellectual achievement, an area in 
which Negroes have low expectancies of success. 
In this experiment, competence as a jazz musician 
was emphasized. With a continuous failura ex- 
perience in a seemingly simple, two-person, 
biracial, competitive situation, subjects who had 
high reinforcement value for and success in jazz 
club membership, as demonstrated by the length 
of time they had remained participants in the 
club, exhibited more persistence in a task said 
to be related to competence as a jazz musician. 
Nearly 50% of the Q group persisted as long as 
the J group members. In interviews with these 
subjects, several claimed to have quit attending 
the jazz club for reasons beyond their control, 
such as time conflict with required activities. This, 
may explain the greater effect of the jazz cues 
in the Q than in the C group. 3 

It might be argued that the jazz musician 
sample is composed of an atypical group of Negro 
subjects who, having achieved some modicum of 
success with jazz skills, are more confident of 
themselves in general. Consequently, it could be 
hypothesized that these musicians would have 
displayed the same persistence in competition 
regardless of the experimental manipulation used 
in this study. However, it should be noted that 
only 2 of the 20 subjects in the jazz club sample 
had any experience performing outside of the 
institution setting, and their experience was 
minimal. Therefore, there is little chance that 
the subjects felt “secure” in the musician role, 
having no affirmed status except that of novice 
or amateur. Second, as mentioned above, there 
were no differences among the three groups in 
regard to past achievement histories insofar as is 
indicated by school grades completed, Stanford 
Achievement Test scores, and intelligence test 


performance. In addition, the three groups dis- 
played no differences among themselves in 
failure-avoidant behavior and external control 
expectancies while each of the groups differed 
significantly from an otherwise equivalent sample 
of white inmates. 

Another alternative that presents more of a, 
problem is that the subjects in the musician * 
sample may have felt gratified by the invésti- 
gators’ interest in jazz and consequently felt 
obliged to comply with the experimenters’ de- 
mands, The control subjects, on the other hand, 
may have perceived themselves as anonymous 
guinea pigs who were forced to act as subjects 
by virtue of the fact that they were reformatory 
inmates with little right of protest. If such were 
the case then persistence in the competitive task 
could be explained by the subjects’ perceptions 
of the experimenters’ demands, and by their 
desire to cooperate with the experimenters. 

In follow-up interviews members of the J 
` group frequently mentioned curiosity and interest 
in the task as an explanation for their persistence. 
"The C subjects, on the other hand, often quoted 


such maxims as “Bird in the hand. . .” to ac- | 


count for their early withdrawal from the com- 
petitive situation. Jazz club members seemed 
less concerned with the immediate material 
gains than with the task itself while the reverse 
seemed to be true of the control groups. 

One implication of these findings is that pres- 
tige suggestions need to be related to goals 
towards which subjects have some expectancy of 
success. A second implication is that Negroes 
may become more task and achievement oriented 
when they expect that their behavior can de- 
termine the occurrence of reinforcements which 
are of high value to them. The fact that Negroes 
have usually performed less adequately than 
whites on intelligence measures may reflect the 
fact that middle-class goals which require ex- 
tensive schooling are customarily unavailable to 
them. Consequently, such goals may accrue a 
negative reinforcement value as well as low ex- 
pectancies of success for Negroes. With gradual 
changes in Negroes’ opportunities for upward 


yee 
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mobility it is possible that Negroes will come to 
expect and value middle-class goals and con- 
sequently display more achievement behavior. 
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PREVIOUS EXPERIENCE WITHIN THE DYAD AND COOPERATIVE 
GAME BEHAVIOR * 


ALBERT A. HARRISON anp CHARLES G. McCLINTOCK 
University of Califorria, Santa Barbara 
ce 


The present study was an attempt to 


‘explain cooperation within the dyad 


in terms of reinforcement theory. 2 groups.of 10 dyads each received monetary 


gains while engaging in a tooperative reaction-time game; 


2 similar groups 


were not rewarded, but sustained losses. Immediately following this reaction- 
time game, 1 of the rewarded groups and 1 of the nonrewarded groups played 
a series of 60 consecutive Prisoner's Dilemma games, a task in which one may 
either cooperate or compete. The other rewarded and nonrewarded groups of 
dyads returned after an interval of 1 wk. to participate in the Prisoner's 
Dilemma. A comparison of Ss who were rewarded during the reaction-time 
game with Ss who had no previous experience within the dyad indicated that 


the rewards received during 


the initial task increased the probability 


of a 


cooperative response during the Prisoner’s Dilemma game, and that this effect 
persevered over time. The results also indicated that losses sustained during 
the reaction-time game did not increase or decrease the number of cooperative 
responses in the group tested immediately but did increase such responses in 


the group tested 1 wk. later. 


Since the appearance of Luce and Raiffa’s 
Games and Decisions (1957) the Prisoner’s 


1 This research was supported by the United States 
Air Force through the Air Force Office of Scientific 
Research of the Air Research and Development Com- 
mand, under Contract No. AF 49(638)-794. 

2Now at the University of Michigan. 


Dilemma game has gained prominence within 
social science research. This two-person, non- 
zero-sum game provides each player with a pair 
of alternative responses. One of these choices 
may be considered cooperative, since it favors 
gains for both the participants, while the other 
may be considered competitive because it favors 
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high personal gains at the other player’s expense. 

Several experiments have been conducted to 
determine the effects of personality and/or situa- 
tional factors upon the proportion of cooperative 
responses made over a series of successive, Pris- 
oner’s Dilemma games (Bixenstine, Potash, & 
Wilson, 1963; Lutzker, 1960; McClintock, Har- 
rison, Strand, & Gallo, 1963; Scodel, 1962).* 
A review of much of the game-theoretical litera- 
ture has been presented by Rapoport and Orwant 
(1962). The present research represents an at- 
tempt to study cooperative game behavior within 
a simple reinforcement framework, The under- 
lying rationale for the present study is that when 
cooperative behavior is rewarded by monetary 
gain, the probability of cooperation at a later 
time will be increased. k 

Given the assumption that winning money re- 
inforces human behavior, shis experiment tests 
the following hypothesis: Subjects who have en- 
gaged in an interdependent task in the past, and 
have received rewards for cooperating in the 
course of this interaction, will be more cogpera- 
tive while participating in the Prisoner’s Dilemma 
game than subjects who have not previously 
received rewards at the task. Thus, the present 
study is essentially an attempt at inducing col- 
laborative behavior by manipulating past experi- 
ences within the dyad with one group receiving 
reward at an interdependent task and another 
group receiving losses. To determine whether the 
manipulation produced a long-term effect or a 
transitory cooperative set, a second major vari- 
able is introduced, that is, the time intervat 
between the initial cooperative task and the 
playing of the Prisoner’s Dilemma game. Sub- 
jects in the immediate-test groups participated 
in the Prisoner’s Dilemma immediately following 
the cooperative reaction-time game, while sub- 
jects in the delay-test groups participated 1 week 
following the experimental manipulation. Thus, 
there were four conditions: reward immediate, 
nonreward immediate, reward delay, and non- 
reward delay, In addition, a fifth or control group 
was used to determine the effects of previous 
interaction per se. This latter group participated 
in the Prisoner’s dilemma game without having 
previously engaged in the reaction-time game. 


METHOD 
Subjects 


The experiment involved the use of 100 naive 
subjects drawn from introductory psychology classes 


3 For a more complete description of the Prisoner’s 
Dilemma, the reader is referred to Luce and Raiffa 
(1957), Scodel (1962), and Rapoport and Orwant 
(1962). 
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and residence halls at the University of California, 
Santa Barbara. There were 20 subjects, 10 males 
and 10 females, in each of five groups. Like-sexed 
subjects were randomly assigned in pairs to the 
reward or nonreward treatments. Because of sched- 
uling limitations, subjects were allowed to determine 
whether they participated in the immediate-test, 
delay-test, or control conditions.* Subjects in the 
four treatment groups participated in two, 1-hour 
sessions. Subjects in the control group participated 
only in the Prisoner’s Dilemma session, s 
In addition to controlling on sex, controls were 
placed upon previous friendship and interaction be- 
tween Sessions I and II. Subjects were instructed 
not to appear at the same time as friends, and upon” 
arrival a “Friendship Scale” was administered. Poten- 
tial subjects indicating previous friendship were 
deleted. Subjects who volunteered during the course 
of the experiment that they detected the experimental 
manipulation were also discarded. Discarded subjects, 
were replaced, so that complete data were collected 
for 100 subjects. In order to determine whether dif- 
ferential interaction had occurred between sessions 
for subjects in the delay-test groups, an “Interaction 
Scale” was administered at the end of Session II. 


Procedure 


Session I: Reaction-time game. After introduction 
to the reaction-time game room, subjects were seated 
at opposite sides of a partition, in front of individual 
game boxes. These units each consisted of a warning 
light, a play signal, and a response button. On each 
trial a warning light came on, followed by the 
illumination of the play signal. Subjects were told 
that in order to win, they must press their response 

> button as quickly as possible after the play signal 
came on, and that these responses must be made 
»before an unspecified criterion time, Subjects were 
informed that if they won, each would gain $.01, 
and that on the other hand if either or both failed ` 
to respond rapidly enough they would lose $.01. ” 
Subjects were able to ascertain whether they won or 
lost on any given trial by referring to a digital’ _ 
display unit. Subjects were informed that since the ~ 
task was totally interdependent they would win or 
lose together, having identical sums of money at 
any given time. The instructions emphasized that — 
the outcomes were interdependent and that, because 
of this, the game was to be considered a cooperative 
task. Subjects were given $1.00 to start with—a score 
on the digital display—and were told that they 
would be allowed to keep what they won during the 
game. Subjects were instructed not to talk. 

In the reaction-time game “wins” and “losses”: 
Were randomly inserted by the experimenter into the 
Sequence of 100 trials. Dyads receiving the reward 
treatment won on 85 trials, and those receiving the 


*This determination was made indirectly by the 
subjects’ selection of a 1-hour session, a continuous 
2-hour session, or two 1-hour sessions separated by 
a week. All subjects came from introductory psy- 
chology courses where they were required to partici- 
Pate as subjects for 3 hours during the semester. 
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- nonreward treatment lost on 85 trials. All subjects 
within an experimental condition were rewarded 
on the same trials. 

.. Following the reaction-time game, the experimenter 
informed the subjects of their number of wins and 
losses. Subjects in the two nonreward conditions 
were told that 37 of their losses were due to Player 
Is failures, 37 of their losses due to Player Ils 

© failures, and 11 due to their combined failure. 
Subjects in the two reward conditions were told 

= that 7 of their losses were attributable to Player I, 
7 of their losses attributable to Player II, and 1 
loss attributable to joint failure. The assignment 
of equal fault for losses was effected to control 
individual feelings of inferiority or vindictiveness. 

After the verbal resumé of their performance, 
subjects in the reward-immediate and nonreward- 
immediate conditions participated in Session II. Sub- 

- jects in the reward-delay and nonreward-delay con- 
ditions returned 1 week later to participate in the 
Prisoner’s Dilemma game. 

Session II: The Prisoner’s Dilemma. Subjects were 
seated opposite each other, separated by a partition. 
In front of them was a Prisoner’s Dilemma game 
board, $.75 in pennies, and a copy of the payoff 
matrix (Figure 1), The apparatus consisted of three 
interdependent units: an electric gameboard for each 
subject, a control unit for the experimenter. At the 
bottom of Player Ps board was an “A” and a “B” 
button, controlling Columns A and B; at the bottom 
of Player Ils board was an X and a Y button, 
controlling Rows X and Y. Above each subject’s 
pair of buttons were four lights, arranged in a 
quadrant, the quadrants corresponding to the payoff 
matrix. At the top of each light quadrant was a 


pilot light which served as a “play signal.” Subjects « 


were told that when the play signal came on, each 
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TABLE 1 


MEAN NuMBER OF COOPERATIVE RESPONSES PER DYAD 
AS A FUNCTION OF CONDITION 


Reward Nonreward 


Test 


treatment | treatment Control 
Immediate 79.0 48.0 48.7 
Delay 79.9 73.6 


Prisoner’s Dilemma game, and the nature of the 
experiment were explained. Subjects were not actually 
given the amount of money they had in their pos- 
session at the end of the experiment, but all were 
awarded either $2.00 for participating in two sessions 
or $1.00 for participating in the one-session control 
group. x V: 

Participants were, requested not to discuss the 
games or the experiment. 


° 
RESULTS 


Complete data were collected on 57 dyads.’ 
Seven of these dyads were discarded: 2 dyads 
failed to meet the criteria with resp&ct to the 
Friendship Scale, 4 contained one or more 
members who volunteered affer Session II that 
Session I involved deception, and 1 dyad was 
discarded because of experimenter error. The 
responses of the remaining 50 dyads were used 
to evaluate the hypothesis. 


Cooperation as a Function of Treatment 


The principal hypothesis of the study, indi- 
viduals who have received rewards for partici- 


subject was to make his response, and that after , pating in an interdependent task will be more 


each had responded, the appropriate light would 
illuminate, indicating their mutual choices. Following 
this, a cash payoff would be made to each subject. 
As in the reaction-time game, they were told that 
they would be allowed to keep whatever amount 
they ended up with. Subjects were instructed not to 
converse. Sixty consecutive Prisoner’s Dilemma games 
y were played. 
© Following the Prisoner’s Dilemma, the Interaction 
Scale was administered to the subjects in the delay- 
test conditions, Finally, the reaction-time game, the 


Player I 
A 
xX 3,3 5,-1 
Player II 
> -1,5 0,0 


Fic. 1. The Prisoner’s Dilemma. 


cooperative in a subsequent Prisoner’s Dilemma 
game than those who have not previously re- 
ceived rewards, was partially supported. Dyads 
that were rewarded during the reaction-time 
game were more cooperative than dyads which 
had sustained losses, but only when the Prisoner’s 
Dilemma immediately followed the initial session. 
Dyads which were rewarded in Session I were 
also more cooperative while participating in the 
Prisoner’s Dilemma than dyads that had not 
previously interacted, that is, control group dyads. 
The mean number of cooperative responses per 
dyad per condition are given in Table 1. The 
figures are computed on the basis of the number 
of individual cooperative moves made by either 
player. 

The significance of these differences was tested 
using a one-way analysis of variance with five 


5 In addition, three dyads on the delay-test groups 
were necessarily deleted because they contained one 
or more members who failed to appear for Session 
TL Two of these dyads were in the nonrewarded 
group, the other was in the rewarded group. 
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TABLE 2 


ONE-WAY ANALYSIS OF VARIANCE WITH FIVE GROUPS, 
MEAN NUMBER OF COOPERATIVE RESPONSES 


PER DYAD 
2 
Source df SS MS F. 
Between groups| 4 | 8,508.1 | 2,127.02 | 3.62* 
Within groups | 45 | 26,450.9 | 587.79 
Total 49 | 34,959 713.44 
*p < 05. 


groups (Table 2). Although there were 20 sub- 
jects in each group, the data were treated in 
terms of the scores taken from the dyads, re- 
ducing the size of the W to 10 per group. This 
analysis gave an F that was significant at the .05 
level. K 

‘The Scheffé test was used to test the signifi- 
cance of the differences between selected means, 
using the .05 level of significance as criteria. The 
Scheffé tests confirmed what the data in Table 1 
suggest: ” a 

1. There were no, significant differences in the 
mean number of cooperative responses between 


ni 


) 


a 


1-10 I-20 2-30 31-40 41-50 51-60 
IO-TRIAL BLOCKS 


o—— CONTROL 

REWARD, IMMEDIATE TEST 

"= REWARD, DELAY TEST 
NON-REWARD, IMMEDIATE TEST 
=—----- NON-REWARD, DELAY TEST 


Fic. 2, Mean number of cooperative Tesponses per 
dyad as a function of games played and treatment 
received. 


MEAN NUMBER OF COOPERATIVE RESPONSES PER DYAD 
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the dyads of the reward-immediate, reward-delay, 
and nonreward-delay conditions (all k’s < 1, df= 
1/18). 

2. The difference between the nonreward-im- 4 
mediate and control conditions was not signifi- 
cant (k=.09, df = 1/18). 

3. The dyads in the nonreward-immediate and 
control conditions made significantly fewer co- 
operative responses than the subjects in the other 
three conditions (k = 3.72, df = 4/45). 

4. The dyads in the nonreward-delay condition 
made more cooperative responses than the dyads 
in the nonreward-immediate condition (k = 3.34, 
df = 1/18). The latter finding that thé dyads 
in the nonreward-delay group were more coop- f 
erative than the dyads in the nonreward-immedi- 
ate group was unanticipated. jd 


Cooperative Behavior over Time o 


The number of cooperative responses as a` 
function of the number of games played and as a 
function of the treatment received is presented in 
Figure 2. The mean number of cooperative re- 
sponses is presented over 10 trial blocks. Inspec- 
tion shows that the reward-immediate dyads 
were initially the most cooperative, and that the 
control dyads were initially the least cooperative. 
All groups, except the nonreward immediate 
showed an increase in cooperation as a function 
of games played, from Trial 21 through Trial 40, 
followed by a slight decrease over the subsequent 

» 20etrials. The nonreward-immediate dyads dem- 
onstrated a decreasing number of cooperative 
»Tesponses until Trial 40, whereupon a sharp in- 
crease appears. The general figure presented by 
the nonreward-immediate polygon stands in sharp | 
distinction to the figures of the other four poly- 
gons, a 


Responses to the Interaction Scale 


As a control measure the Interaction Scale was | 
administered to subjects in the delay-test condi- 
tions to determine if they interacted differentially 
during the 1-week time interval separating Ses- 
sion I and Session II. Generally, subjects inter- 
acted little, if at all. There was no significant 
differential interaction recalled by subjects in the 
teward-delay as opposed to the nonreward-delay 
conditions during the week time interval separat- 
ing Sessions I and II. Subjects in the reward- 
delay group gave a mean total estimate of 1.05 
Minutes spent in social interaction with their 
partners, subjects in the nonreward-delay group 
gave a mean total estimate of 2.45 minutes spent 
in social interaction with their partners. The dif- 
ference between these means was not significant 
using a ¢ test (¢= 1.02, df = 38). 
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„reinforcement received 1 
* game had the hypothesized effect of producing 
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Discussion 


Perhaps the most salient finding of the study 
was that the nonreward-delay dyads did not dif- 
fer from the two reward groups. The similarity of 
the reward-delay and nonreward-delay groups is 
interesting because it is attributable not to a 
lesser amount of cooperation by the reward-delay 
dyads (as compared with the reward-immediate 
dyads) but rather to a greater amount of coop- 
eration by dyads in the nonreward-delay (as 
compared with the nonreward-immediate dyads). 
This finding cannot be explained by inferring 
that gubjects in the nonreward-delay dyads 
formed friendships with their partners during 
the 1-week time interval separating the two ses- 
sions, since the Interaction Scale indicated little 
or no interaction. Nor can it be accounted for by 
“arguing that the nonreward-delay subjects who 
were responsive to the treatment had an oppor- 
tunity to “drop out” as opposed to nonreward- 
immediate dyads who had no such opportunity. 
Only two dyads within the nonreward-delay 
group were discarded because one or more mem- 
bers failed to appear for Session II. 

The similarity between the mean number of 
cooperative responses exhibited by dyads in the 
nonreward-immediate and control conditions is 
also of note, This would imply that the losses 
sustained by the nonreward-immediate dyads 
produced no “punishing” effect, that is, theemone- 
tary losses were not associated with a decrease in 


the probability of a cooperative response. The* 


difference in the number of cooperative responses 
made by the dyads in the 
reward-delay conditions, when contrasted with 
the number of cooperative choices made by dyads 
in the control group, would indicate that mutual 
during the reaction-time 


more cooperative behavior, and that this effect 
persisted over time. However, the high level of 
cooperation maintained by the nonreward-delay 
group is not easily accounted for. 

The marked difference in the number of co- 
operative responses made by the nonreward-delay 
group in comparison to the nonreward-immediate 
group would seem to indicate that the initial 
effect of a nonrewarding interaction tends to dis- 
sipate, and that previous interaction per se 
tends to produce an increase in cooperative re- 


sponses through time. Tt would be of major theo- 
determine more precisely the 


retical interest to 


reward-immediate ande 
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effects upon cooperative behavior in the dyad of 
varying the reinforcement parameters of the ini- 
tial encounter, and varying the time interval be- 
tween initial and subsequent interactions. 

The shape of the curves showing the mean 
number of cooperative responses per dyad as a 
function of the number of games played and 


“treatment received (Figure 2) also demonstrates 


a temporal effect. Although the difference in the 
mean number of cooperative responses per dyad 
between the nonreward-immediate and control 
dyads is not significant, the shapes of the curves 
are strikingly dissimilar. Whereas the control 
group’s dyads leveled off in terms of the proba- 
bility of a cooperative response during the later 
trials, the nonreward «dyads demonstrated a sharp 
increase. Because, the Prisoner’s Dilemma session 
terminated after 60 consecutive playings, the out- 
come of this trend tan be ascertained only by 
future research. At present, this trend must be 
considered as extremely tentative. However, it is 
of note that the group which has previously inter- 
acted, even though this initial experience is non- 
rewarding, appears to be beginning to display a 
greater increment in the number of cooperative 
responses over time than the group which has not 
interacted previously. Exactly what occurs in Ses- 
sion I and in the time between sessions to produce 
these perplexing results can only be determined 
by future research. 
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